An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/bropenguin847/python-learning

Learning Python for Scientific Programming. Intended for beginners with some knowledge of Python.

array-manipulations convolution jupyter matlab matplotlib-pyplot numpy pandas-python pep8 python python3 scientific-programming scipy spyder-ide vscode

Last synced: 13 Apr 2026

https://github.com/ot-code/coca-cola-stock-prediction

This repo compares four predictive models—Linear Regression, ARIMA, XGBoost, and LSTM—to forecast Coca‑Cola FEMSA stock closing prices using Python and five years of historical data.

arima csv linear-regression lstm-neural-networks mae matplotlib mse numpy pandas python r2 scikit-learn seaborn tensorflow-keras xgboost

Last synced: 13 Apr 2026

https://github.com/aritrakar/ai-projects

🤖A centralized collection of links to my AI projects roughly divided by topic.

numpy pandas python pytorch tensorflow

Last synced: 13 Apr 2026

https://github.com/5hraddha/eda-instacart-customers-shopping-habits

In this Exploratory Data Analysis (EDA) project we'll clean up the data and prepare a report that gives insight into the shopping habits of Instacart customers.

data-visualization exploratory-data-analysis instacart matlpotlib numpy pandas

Last synced: 13 Apr 2026

https://github.com/ediloaz/linear-regression-with-lists-numpy-and-pytorch

This was done like the homework #01 of Machine Learning Course, of Computer Science Master.

arrays-raw jupyter-notebook machine-learning numpy python pytorch

Last synced: 13 Apr 2026

https://github.com/legalaspro/rnn_gru_lstm_experiments

RNN, GRU, LSTM implementation using PyTorch and Numpy

gru ltsm numpy pytorch rnn

Last synced: 21 Apr 2026

https://github.com/tkc/node-neural-network

Built with only pure typescript, neural network iris sample.

javascript machine-learning neural-network node numpy typescript

Last synced: 09 Apr 2026

https://github.com/jersonrn/termviewer3d

Interactive terminal environment that allows users to explore 3D models in real-time.

3d ascii cupy gpu linear-algebra numpy python

Last synced: 19 May 2026

https://github.com/theblueberrycompendium/keras-movie-classification

This is an basic application that uses Keras

ai keras-tensorflow numpy python3 simple-project

Last synced: 13 Apr 2026

https://github.com/elvisrock/government_analysis

Esse projeto, tem como objetivo, analisar, explorar algumas técnicas e propor uma sugestão ao governo de forma geral, sem qualquer viés politico.

data-science machine-learning numpy pandas python sklearn

Last synced: 05 Apr 2026

https://github.com/sweta-kaundilya/finance-data-project

This data project focuses on exploratory data analysis of bank stock prices throughout financial crisis all the way to early 2016.

financial-analysis jupyter-notebook matplotlib numpy pandas-python seaborn

Last synced: 16 May 2026

https://github.com/farhad-here/height-distribution-analysis

Statistical comparison of height distributions in two groups using mean, standard deviation, and boxplots.

coefficient-of-variation data-analysis interquartile-ranges matplotlib mean numpy python scipy standard-deviation variance

Last synced: 13 Apr 2026

https://github.com/swatisinghit/treadmill-customer-profiling-for-aerofit

Create comprehensive customer profiles for each AeroFit treadmill product through descriptive analytics. Develop two-way contingency tables and analyze conditional and marginal probabilities to discern customer characteristics, facilitating improved product recommendations and informed business decisions.

analytics conditional-probability data-analysis data-science data-visualization eda numpy pandas probability statistics

Last synced: 08 May 2026

https://github.com/sducournau/ign_lidar_hd_dataset

🏗️ Comprehensive Python library for processing IGN LiDAR HD data into machine learning-ready datasets for Building Level of Detail (LOD) classification. Features GPU/CPU processing, smart data management, and complete ML pipeline integration.

building-classification data-processing dataset france geospatial gis ign lidar lidar-hd numpy point-cloud scikit-learn

Last synced: 20 Jan 2026

https://github.com/christabelsakyi/tsec

This project implements a time series forecasting solution for predicting electricity consumption based on historical data. It provides multiple model options, a Flask API for deployment, and visualization tools.

facebook numpy pandas python sklearn time-series-analysis

Last synced: 07 Oct 2025

https://github.com/raveenaross/bulldozer-price-prediction

Predicting the sale price of bulldozers using machine learning

matplotlib numpy pandas sklearn time

Last synced: 07 Oct 2025

https://github.com/ricardorobledo/malicious_server_hack_detection

Predictive model to detect malicious hacking patterns in banking servers. Utilizes advanced Machine Learning techniques such as SMOTE, Gradient Boosting, and probability calibration to predict attacks befor

anaconda cibersecurity imbalanced-data imbalanced-learning imblearn kaggle matplotlib numpy pandas pandas-library python3 sklearn

Last synced: 14 Apr 2026

https://github.com/davityak03/object-detection-in-images-using-yolo

This project demonstrates object detection using the YOLO (You Only Look Once) model, providing a solution for detecting and labeling multiple objects within images or video streams. This implementation covers image preprocessing, model inference, and visualization of detected objects.

computer-vision matplotlib-pyplot numpy object-detection opencv-python pandas python yolo

Last synced: 18 Jan 2026

https://github.com/piyushxbhardwaj/credit-card-fraud-detection

In this project, we focus on developing a machine learning model to detect fraudulent transactions within credit card data. Credit card fraud poses a significant challenge to financial institutions and consumers globally, with traditional rule-based detection systems often struggling to adapt to new fraud tactics.

matplotlib numpy pandas python scikit-image seaborn

Last synced: 16 Feb 2026

https://github.com/rachakondaganesh/superstore-sales-data-analysis-project

Analyzed retail sales data to uncover key business insights using Python and Power BI. Explored patterns in profit, sales, and customer segments across regions. Built interactive dashboards to visualize trends, identify top-performing categories, and highlight areas for improvement in shipping and discount strategies.

dashboard exploratory-data-analysis matplotlib-pyplot numpy pandas powerbi seaborn

Last synced: 02 May 2026

https://github.com/ahmedabdalkreem/lung-cancer

It help to know the patient have lung cancer or not and show what depenent in this disease that lead to this disease.

matplotlib numpy pandas python sklearn-library

Last synced: 14 Apr 2026

https://github.com/gherrada22/spanish-sign-translator

This project uses Python and the OpenCV and MediaPipe libraries to detect and translate signs in real time.

language matplotlib mediapipe numpy opencv pycharm-ide python python3 spanish-language toml

Last synced: 18 Jan 2026

https://github.com/bhavnanahar/coral_project

ChatGPT said: This project analyzes and predicts the health of coral reefs in the Florida Keys by examining coral cover and species richness trends over time.

colab-notebook matplotlib numpy pandas plotly python random-forest scikit-learn seabo

Last synced: 14 Apr 2026

https://github.com/dmarks84/coursework_project_image-text-recognition

Project for University of Michigan Python Programming Specialization -- Read in documents with images and text, and utilized CV libraries/packages to extract specific types of images and text, pairing them together

classification computer-vision image-classification numpy pandas programming python text-classification

Last synced: 14 Apr 2026

https://github.com/ianhaggerty/final-capstone

This represents the final capstone project in my HyperionDev Data Science (fundamentals) course. A dataset of Amazon customer reviews is analysed using natural language processing.

amazon data-analytics data-science data-visualization dataset matplotlib nlp nlp-machine-learning numpy pandas plotly reviews seaborn spacy tabulate textblob wordcloud

Last synced: 19 Jan 2026

https://github.com/rakesh3596/apple-phone-sales-analysis

Apple Phone Sales Analysis using Python tools like Pandas, Matplotlib, Numpy, Plotly, Apple csv file

csv-files csv-reader matplotlib numpy pandas plotly python

Last synced: 14 Apr 2026

https://github.com/thekartikeyamishra/colorextractor

A Python-based project that extracts the dominant colors from an image using the KMeans clustering algorithm. This project is useful for applications like image processing, color palette generation, or analyzing the most prominent colors in images.

kmeans-clustering matplotlib matplotlib-pyplot numpy p pillow python sklearn

Last synced: 05 May 2026

https://github.com/tnel7/data-science-yandex-practicum

Проекты из Яндекс Практикума "Специалист по Data Science"

data-science machine-learning nlp-machine-learning numpy pandas seaborn sklearn

Last synced: 19 Jan 2026

https://github.com/chandkund/housing-price-prediction

Predict housing prices using the Boston Housing Dataset. Covers data loading, cleaning, preprocessing, EDA, normalization, standardization, and regression models (Linear Regression, Decision Tree, Random Forest, Extra Trees). Evaluated with Mean Squared Error (MSE). Tech: Python, Pandas, NumPy, Scikit-learn, Seaborn, Matplotlib.

data-science data-visualization matplotlib numpy pandas pyhton sklearn sklearn-library sklearn-metrics

Last synced: 21 Jan 2026

https://github.com/badranalyst/time-series-analysis-of-global-trends-in-diet-gym-and-finance

This project analyzes global trends in diet, gym, and finance over time using time series data. The analysis is performed using Python libraries like Pandas, Matplotlib, and Seaborn to visualize trends and identify patterns in these sectors across various countries.

data-analysis dataset matplotlib-pyplot numpy pandas python seaborn time-series

Last synced: 14 Apr 2026

https://github.com/ljadhav25/false-alarm-detection

The data was first pre-processed and analysis libraries like Numpy and Pandas were used to make it ready to be utilized by a machine learning algorithm.Problems like standard scaling, categorical data and missing values were handled with appropriate techniques. Then, we used LogisticRegression model.

flask flask-api logestic-regression numpy pandas pkl pycharm-ide python

Last synced: 19 Jan 2026

https://github.com/akshay-kamath/anomaly_detection

This project addresses the challenge of identifying anomalies in sales data, aiming to optimize sales strategies by uncovering deviations from expected profit patterns.

anomaly anomaly-detection cblof flask isolation-forest knn matplotlib numpy pandas pyod python sklearn

Last synced: 19 Jan 2026

https://github.com/saylie/cat-dog-classifier-

The Cat-Dog Classifier is a deep learning-based image classification project that can differentiate between images of cats and dogs. It uses a Convolutional Neural Network (CNN) model trained on labeled datasets to predict whether a given image belongs to a cat or a dog.

ai javascript keras numpy opencv python react tensorflow

Last synced: 14 Apr 2026

https://github.com/massimilianovisintainer/rock-vs-mine-prediction

Rock vs. Mine classification using Logistic Regression

machine-learning model numpy pandas sklearn

Last synced: 21 Jan 2026

https://github.com/soumyapro/parkinson-disease-prediction

This project predicts Parkinson's disease using machine learning models.

logistic-regression numpy pandas scikit-learn svc xgboost

Last synced: 19 Jan 2026

https://github.com/charancholaveti/imdb-sentimentanalysis

IMDB Movie Review Sentiment Analysis-It will give the sentiment of the review whether it is Positive or Negative

deeplearning keras numpy rnn-tensorflow tensorflow

Last synced: 14 Apr 2026

https://github.com/ricardorobledo/paymentcardfrauddetection2025

Comparative analysis of probabilistic classification models for credit card fraud detection, focusing on model calibration and threshold optimization in highly imbalanced datasets.

imbalanced-learn matplotlib numpy pandas python3 scikit-learn search

Last synced: 14 Apr 2026

https://github.com/dhruvrajsinhzala24/house-price-predictor-ml-model

A machine learning project to predict house prices using linear regression and random forest models. The project leverages housing data to train and evaluate models, comparing their performance using metrics like MSE and R². Visualizes actual vs. predicted prices to assess model accuracy.

linear-regression machine-learning matplotlib ml numpy pandas python random-forest-regression sklearn

Last synced: 14 Apr 2026

https://github.com/sabbir45ali/cam-board

This is Python based project which is a virtual board and we write there using our finger and clear it and color changing.

numpy pandas python3

Last synced: 14 Apr 2026

https://github.com/zahediparsa/nlp_ta_uok

This repository contains exercises designed for students in the Natural Language Processing (NLP) course at the University of Kurdistan, taught by Dr. Fatemeh Daneshfar. The course took place from February 2024 to July 2024.

cbow jupyter-notebook matplotlib nlp nltk numpy pandas python tf-idf

Last synced: 19 Jan 2026

https://github.com/angelarreola/ai_notes

Notas de la materia "Inteligencia Artificial" para su posterior extraccion mediante algun modelo de lenguaje que nos permita dar respuestas personalizadas con base a la informacion presente en este repositorio.

ai matplotlib numpy pandas phaserjs python scikit-learn

Last synced: 21 Jan 2026

https://github.com/abhishekyadav915/multiple_disease_prediction_using_ml

Developed a web-based application for predicting the likelihood of multiple diseases using machine learning models with accuracy up to 95%.

colab-notebook logistic-regression numpy pandas python3 sklearn spyder-python-ide streamlit-web streamlitcloud svm-classifier

Last synced: 05 May 2026

https://github.com/gerlero/generala

🎲 Probabilistic tool for the dice game of Generala

generala numpy python

Last synced: 27 Jan 2026

https://github.com/avik-pal/kaggle-titanic

Predicting whether a given set of people survive on the Titanic

machine-learning numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 14 Apr 2026

https://github.com/matkorussovich/student-performance-analysis

Este repositorio contiene un análisis del desempeño académico de estudiantes, realizado como parte del módulo "Introducción al Data Science" en el Máster en Data Science de la Universidad Europea de Madrid.

jupyter-notebook matplotlib-pyplot numpy pandas python

Last synced: 14 Apr 2026

https://github.com/karthik9273/revolutionizing-gold-rate-forecasting-for-small-businesses-with-machine-learning

The "Gold Price Prediction" project focuses on predicting the prices of gold using machine learning techniques. By leveraging popular Python libraries such as NumPy, Pandas, Scikit-learn (sklearn), Matplotlib, Seaborn, Random Forest Regressor, and others, this project provides a comprehensive solution for accurate price estimation.

data-science google-colab-notebook jupyter-notebook machine-learning matplotlib numpy pandas-dataframe python seaborn sklearn

Last synced: 06 May 2026

https://github.com/mjul/scipy-lab

Scientific computation with Python

matplotlib numpy python scipy

Last synced: 04 May 2026

https://github.com/mbarbetti/hello-snakemake

Simple implementation of snakemake to sample and plot data

hello-world matplotlib numpy python snakemake

Last synced: 14 Apr 2026

https://github.com/sunilvarma-l/liverdiseaseprediction

"Streamlit app to predict liver disease risk using a machine learning model based on patient input data."

machine-learning matplotlib numpy pandas pickle python scikit-learn seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/neelays/xor-xnor_neural_network

NumPy neural network to approximate XOR/XNOR

numpy

Last synced: 15 May 2026

https://github.com/sharif-minhaz/rag-system

Ingest and vectorize content upon publication, store embedding, then retrieve and augment user queries with context to generate high-quality responses.

faiss flask mysql2 nodejs numpy rag react transformers

Last synced: 14 Apr 2026

https://github.com/saivarun2611/potato_disease_classifier

This project is a Potato Disease Classifier that uses a Convolutional Neural Network (CNN) to classify potato leaves as healthy, suffering from early blight, or late blight. The model helps in early detection of diseases, which can be crucial in maintaining crop health and yield.

cnn keras matplotlib numpy pandas python3 tensorflow

Last synced: 14 Apr 2026

https://github.com/sun-data/ndoptimize

Numba-accelerated minimization of objective functions.

astropy minimization numba numpy optimization scientific-computing

Last synced: 27 Jan 2026

https://github.com/shuyib/pimadlkeras

Using keras specify-compile-fit- predict workflow on this binary classification problem to investigate if i'll get better predictions.

diabetes hidden-layers keras-neural-networks matplotlib-figures model-validation numpy pima-indians-dataset prediction predictive-modeling python-3-5 seaborn

Last synced: 15 Mar 2026

https://github.com/mlicamele/neural-network

Project focused on exploring the computations behind neural networks by building one from scratch with only numpy and testing it with the MNIST dataset.

gradient-descent matrix-computations neural-networks numpy python

Last synced: 12 Apr 2026

https://github.com/chirindaopensource/search_benford_law_compatibility

End-to-End Python scalable forensic accounting toolkit implementing Benford's Law analysis for FTSE financial data. Delivers automated anomaly detection with Chi-Squared/MAD testing, comprehensive validation pipelines, and risk-based prioritization of investigative resources. Replicates Ausloos et al.'s (2025) methodology with full reproducibility.

academic-research anomaly-detection benfords-law chi-squared-test data-validation econometrics financial-analysis financial-data forensic-accounting fraud-detection ftse goodness-of-fit jupyter-notebook numpy pandas python reproducible-research risk-management scipy statistical-testing

Last synced: 12 Apr 2026

https://github.com/harmanveer-2546/reducing-data-entries

Way to delete data entries from csv/excel file using. For excel file, use excel instead of csv in the code.

csv data data-entry delete-data excel numpy pandas python

Last synced: 05 May 2026

https://github.com/hellymodikalpesh/numpy

This repository is designed as a comprehensive resource for learning the NumPy library. It covers nearly all functions and features of NumPy, providing a thorough exploration of its capabilities for numerical computing and data analysis.

jupyter-notebook numpy

Last synced: 28 Jan 2026

https://github.com/adriantomin/bulldozer-price-prediction

Predicting the Sale Price of Bulldozers Using Machine Learning 🚜💰 This project uses machine learning to predict bulldozer sale prices based on historical data from the Kaggle Bluebook for Bulldozers competition. The goal is to minimize the RMSLE between actual and predicted prices.

data-science jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn

Last synced: 23 Jan 2026

https://github.com/rezasharifi82/lesson_extractor

Extract all possible "معارف" courses @ Ferdowsi University of Mashhad ,FUM

beautifulsoup4 ipynb jupyter-notebook numpy pandas python3 regex

Last synced: 06 Feb 2026

https://github.com/harmanveer2546/heart

Predicting the presence of heart disease based on several health-related factors and Performing - i.) Data Cleaning ii.) Data Pre-Processing iii.) EDA iv.) Compare 5 different classification algorithms (Logistic Regression, Decision Tree, Random Forest, KNN and SVC)

data-preprocessing decision-tree eda knn logistic-regression machine-learning numpy pandas random-forest roc-auc-curve svc

Last synced: 03 May 2026

https://github.com/alejandro945/insurance-risk

This project aims to predict the risk of insurance claims using a dataset from Kaggle. The dataset consists of 26 columns and 205 rows, providing various features related to insurance risk. By analyzing this data, we seek to build predictive models that can help insurers assess the risk of claims.

data-analytics ipython-notebook numpy pandas python

Last synced: 06 Feb 2026

https://github.com/hansalemaos/cv2_stack_images

Stacks 2 images (horizontal/vertical)

concat cv2 images numpy python stack

Last synced: 28 Jan 2026

https://github.com/ishinzoo/songrecommendation

This project is a machine learning-based system that recommends songs based on the user's detected emotions. The application uses facial expression recognition to determine the user's current emotional state and suggests songs that align with that emotion. This system can be particularly useful for personalized music streaming services, helping use

machine-learning mediapipe numpy opencv os python tenserflow

Last synced: 25 Feb 2026

https://github.com/keyurparalkar/breast-cancer-detection

Predict whether the cancer is benign or malignant

gradient-descent logistic-regression machine-learning numpy

Last synced: 26 Apr 2026

https://github.com/amruta33/credit_card_analysis

The loan providing companies find it hard to give loans to the people due to their insufficient or non-existent credit history. Because of that, some consumers use it as their advantage by becoming a defaulter.

numpy pandas python3

Last synced: 15 Apr 2026

https://github.com/ankitpundir7/group-face-emotion-recognition

This project can identify and categorize human emotions in both static and dynamic context. Duration : Jun 2023 - Jul 2023

convolutional-neural-networks deep-learning keras machine-learning-algorithms numpy opencv python tensorflow

Last synced: 07 Feb 2026

https://github.com/offthetab/lung_cancer_prediction

ML for Lung Cancer Prediction dataset from kaggle.

matplotlib numpy pandas python seaborn sklearn

Last synced: 25 Feb 2026

https://github.com/anshpg/exploration-in-image-processing-digit-image-generation

This project, developed by Anshuman Pattnaik, explores image processing techniques using Python libraries such as pandas, numpy, matplotlib, and cv2 (OpenCV). The primary objective of the project was to delve into image processing with a focus on creating a unique dataset and algorithm for image generation.

cv2 image-generation image-processing ipynb-jupyter-notebook matplotlib-pyplot numpy opencv pandas

Last synced: 01 Feb 2026

https://github.com/luizabash/wmap-tt-eda

EDA of the WMAP 5-Year TT Power Spectrum data, the analysis includes summary statistics, data visualization, polynomial regression for model fitting, and hypothesis testing

astrophysics matplotlib numpy pandas scipy

Last synced: 01 Feb 2026

https://github.com/sarowarahmed/advertising-sales-app

📈 Advertising Sales Predictor: A web app powered by a Machine Learning model, built with Numpy, Pandas, Scikit-learn, and Streamlit, to forecast sales based on TV, Newspaper, and Online Advertising. Deployed on Streamlit Cloud for real-time, easy-to-use predictions.

advertising app machine-learning multiple-linear-regression numpy pandas sales scikit-learn streamlit

Last synced: 07 Feb 2026

https://github.com/sharkb8t/credit-risk-classification

Demonstrates my abilities to use Jupyter Notebook with scikit-learn to train and evaluate a machine learning model.

jupyter-notebook numpy pandas pathlib python scikit-learn

Last synced: 15 Apr 2026

https://github.com/giihcastro/trabalho-fisica

3D VPython Electromagnetic Particle Motion Simulator

numpy python vpython

Last synced: 15 Apr 2026

https://github.com/ryancswallace/npdb

Parallel NumPy-like interface for large n-dimensional arrays on disk.

mapreduce-designpatterns numpy numpy-arrays parallel-computing python

Last synced: 09 Feb 2026

https://github.com/hansalemaos/all_against_all

Each item of a list against all others

list numpy

Last synced: 02 Apr 2026

https://github.com/sahilk12nayak/data-capstone-projects-using-python-libraries

In this data project we will focus on exploratory data analysis. This project utilizes Python libraries such as NumPy, Pandas, Matplotlib, and Seaborn for data manipulation, analysis, and visualization.

matplotlib numpy pandas seaborn

Last synced: 15 Apr 2026

https://github.com/cherukuri-thanu/conjoint-analysis-netflix

This repository contains configuration files for performing conjoint analysis on Netflix survey data.

conjoint-analysis logistic-regression matplotlib-pyplot netflix numpy pandas squantify statsmodels

Last synced: 16 Apr 2026

https://github.com/karthikarajagopal44/numpy-beginner-to-advanced

This repository is designed to help you master NumPy, the fundamental package for scientific computing with Python. Whether you are just getting started or looking to refine your skills with advanced techniques, this repo covers a comprehensive range of topics and functionalities. It covers the range of topics from beginner to advance.

data-science numpy numpy-library numpy-tutorial python

Last synced: 16 Apr 2026

https://github.com/mouradhamzaoui/tweets_sentiments_emotions_analysis

Les réseaux sociaux, et en particulier Twitter, sont des plateformes essentielles où les utilisateurs expriment leurs sentiments et émotions au quotidien. Analyser les tweets peut fournir des informations précieuses sur l’humeur collective et les préoccupations des gens. Ce projet utilise un dataset de tweets disponible sur Kaggle

nltk numpy pandas sklearn

Last synced: 16 Apr 2026

https://github.com/shakhthi/mlproject

This project is an end-to-end ML solution that predicts math scores from student attributes. It starts with ingesting and preprocessing data, followed by feature engineering to pinpoint key predictors. A robust model is then trained and fine-tuned before being seamlessly deployed to deliver real-time predictions.

jupyter-notebook matplotlib-pyplot numpy pandas python3 seaborn sklearn

Last synced: 16 Apr 2026

https://github.com/tzepart/choosing-oil-region

Choosing and initialisation a machine learning model that will help determine the region where production will bring the greatest profit with the least risk of loss.

matplotlib ml numpy pandas seaborn sklearn

Last synced: 16 Apr 2026