An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/kenwuqianghao/ml-zoomcamp

Code and homework for ML Zoomcamp

machinelearning numpy pandas python3 tensorflow

Last synced: 06 Apr 2026

https://github.com/ashithapallath/comparison-of-ssd-sad-in-motion-estimation-of-video-coding

Course assignment for Algorithm and Massive Datasets comparing SAD and SSD for motion estimation. Includes analysis of accuracy, speed (Python/NumPy), and error sensitivity. Highlights trade-offs for speed-critical vs. precision-focused applications.

l1-normalization l2-norm numpy pytho video-cod

Last synced: 04 Apr 2025

https://github.com/sarthakm402/rabbi

A Ml library that can be used ot clean,and use models to predict on data

numpy optuna pandas pypi-package python sklearn

Last synced: 20 Apr 2026

https://github.com/hariprasath-v/machinehack_analytics_olympiad_2023

Create a machine learning model to determine the likelihood of a customer defaulting on a loan based on credit history, payment behavior, and account details.

binaryclassification catboost exploratory-data-analysis machine-learning numpy pandas python scikit-learn shap

Last synced: 08 Apr 2026

https://github.com/mzayles/vendas_ficticias_dataprep

💻📊✅ Curso de Programação em Python para Data Science | Analisando e tratando dados fictícios.

numpy pandas python

Last synced: 15 Jun 2025

https://github.com/priyanshulathi/cancer-diagnosis-prediction-model

A Machine Learning project to predict cancer malignancy using K-Nearest Neighbor, Support Vector Machine, and Decision Tree algorithms.

machine-learning numpy pandas python scikit-learn

Last synced: 03 Jan 2026

https://github.com/barraharrison/seoul-bike-sharing

Performing EDA on a kaggle dataset to look at the distribution of Seoul's bike-sharing system

jupyterlab matplotlib numpy pandas python scikit-learn seaborn

Last synced: 23 Jul 2025

https://github.com/alphacrypto246/insurance-charges-prediction

The Predicting Insurance Charges project uses Decision Tree Regression to predict insurance charges based on features like age, sex, BMI, and smoking habits. It involves data preprocessing, feature scaling, and model evaluation with metrics like MAE and R².

machine-learning numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 03 May 2026

https://github.com/mgitrov/coce

A deep learning-based project aiming to classify images out of 10 classes.

computer-vision convolutional-neural-networks deep-learning docker fastapi keras matplotlib numpy pillow regularization

Last synced: 06 Apr 2026

https://github.com/eesunmoon/genai_cor-recom

[Project] Outfit Coordination Recommender System using KoAlpaca

data-analysis fine-tuning generative-ai huggingface keyword llm numpy pandas python python3 selenium

Last synced: 06 Apr 2026

https://github.com/elijah-1994/modelling-airbnbs-property-listing-dataset

Training and Evaluating classification and regression model use cases on the Airbnb dataset.

data-science machine-learning numpy pandas plotly python pytorch sklearn

Last synced: 10 Apr 2026

https://github.com/lfgodoi/rpm-deep-guesser

A deep learning-based RPM estimator based on spectral features extracted from vibration signals of rotating machines.

condition-monitoring deep-learning docker flask machine-learning neural-networks numpy python pytorch scipy signal-processing spectral-analysis

Last synced: 02 Mar 2025

https://github.com/shreyasdankhade/portfolio_optimatization_project

The Portfolio Optimization Project uses optimization techniques to balance risk and return, helping investors make efficient asset allocation decisions.

flask flask-application matplotlib numpy pandas pandas-python porfolio-optimization portfolio python

Last synced: 06 Apr 2026

https://github.com/ashkarin/rawimage

Read and write RAW images (or data arrays) without a headache.

image-processing imagej ndarray numpy raw-data raw-image readers writers

Last synced: 28 Apr 2026

https://github.com/dmarks84/ind_project_superstore-sales-time-series-analysis--kaggle

Independent Project - Kaggle Dataset-- I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2.

chloropleth data-modeling data-visualization eda linear-regression matplotlib numpy pandas python seaborn sklearn statistics statsmodels supervised-ml time-series-analysis

Last synced: 09 Apr 2026

https://github.com/t-lak/decision-tree

This project implements a basic Decision Tree classifier. It supports visualizing the tree and calculating performance metrics (accuracy, precision, F1-score, and recall).

decision-trees graphviz metrics numpy python3 scikit-learn

Last synced: 07 May 2026

https://github.com/harsha-yuvaraj/Data-Compression-and-Decompression-Tool

A specialized lossless file compression and decompression tool designed mainly for plain text, including programming files.

data-structures huffman-compression-algorithm lossless-compression-algorithm numpy python tkinter

Last synced: 02 May 2025

https://github.com/bassemhalim/webrtc-demo

webrtc demo using aiortc library

numpy opencv python3 webrtc

Last synced: 09 Apr 2026

https://github.com/eljandoubi/disasterresponsepipeline

Project aim is to build a Natural Language Processing (NLP) model to categorize messages on a real time basis.

flask nltk numpy pandas plotly scikit-learn scipy sqlalchemy

Last synced: 09 Apr 2026

https://github.com/armahdavi/analytics_statistics_ML_plotting_dust_extraction_hvac_filters_ph2

PhD Technical Paper 1 - Phase 2 - Mahdavi & Siegel (2020) (Aerosol Science & Technology; AS&T) - Sharing all the data pipelines, processing codes, descriptive statistics, statistical modellings, and plotting/visualizations - Project Miestone: 2017 - 2020 - Full-length article is available

data-pipelines data-science data-visualization machine-learning matplotlib-pyplot numpy pandas-dataframe python scipy-stats sklearn statistics

Last synced: 17 Sep 2025

https://github.com/armahdavi/data_pipeline_analytics_statistics_ML_PM_PSD_residential_QFF

Sharing all the data pipelines and processing codes, statistical modellings, descriptive statistics, plot visualizations, and machine learning from Mahdavi & Siegel (2021) (Indoor Air) Project Miestone: 2017 - 2020 Full-length article: https://onlinelibrary.wiley.com/doi/abs/10.1111/ina.12782

data-science data-visualization dust hvac indoor-air-quality jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats spyder spyder-python-ide statistics

Last synced: 17 Sep 2025

https://github.com/elijahondiek/shadowcoach-formtracker

A computer vision-based workout form tracking application that uses AI to monitor exercise form, count repetitions, and provide real-time feedback.

mediapipe-pose numpy opencv-python

Last synced: 18 Apr 2026

https://github.com/justinzhang17/comp-5801-queens

Reinforcement learning task of teaching an agent to play the game queens/star-battle

gymnasium numpy pygame python

Last synced: 15 May 2026

https://github.com/abeertechcamus/documentdata

The dataset was cleaned and queried using Python inside Jupyter Notebook and visualizes using PowerBI Document Data Analysis Projects

dax jupyter-notebook numpy pandas powerbi python

Last synced: 04 May 2026

https://github.com/dakshsammi/arkaid

Arkaid is a game performance analytics platform developed for the Information Integration Architecture Course - CSE656 (IIIT Delhi). It uses a data warehouse approach to analyze gaming data from multiple sources and provides insights via an AI-driven interface.

ai airflow aws database-management datawarehouse docker etl flask information-integration numpy openai postgresql python togetherai

Last synced: 09 Apr 2026

https://github.com/harmanveer-2546/campus-placement-prediction

Student Campus-Placement Prediction ML Project using Python

decision-trees machine-learning matplotlib numpy python random-forest scipy seaborn

Last synced: 08 Mar 2026

https://github.com/roluochke/real-estate-price-prediction-app

This is a data science project for a real estate price prediction website.

flask html-css-javascript jupyter-notebook matplotlib numpy python sklearn

Last synced: 09 Apr 2026

https://github.com/gilevatanya/yandex-practicum-projects

Кейсы решенные на курсах Яндекс Практикума.

bert bootstrap catboost keras lightgbm matplotlib nltk numpy pandas postgresql python pytorch scikit-learn scipy seaborn sql

Last synced: 06 Jan 2026

https://github.com/dsowles/miniproject1davidsowles2025

INF601 Mini Project 1

numpy yfinance

Last synced: 16 May 2026

https://github.com/riyajain255/customer-segmentation-for-e-commerce

This project analyzes online retail data to segment customers using K-Means clustering and build classification models to predict those segments based on purchasing behavior.

customer-segmentation data-analysis kmeans-clustering logistic-regression machine-learning matplotlib numpy pandas python random-forest scikit-learn seaborn-plots

Last synced: 02 Apr 2026

https://github.com/abyshergill/ultralytics_yolo_object_detection_testing_gui

This is a desktop application for real-time object detection using the YOLO (You Only Look Once) model build with ultralytics libarary. It's built with PyQt5 for the graphical user interface and OpenCV for video and image processing. The application allows users to load a YOLO model, adjust detection settings, and perform object detection on media.

numpy opencv-python pytorch qt5-gui threading ultralytics yolo

Last synced: 08 May 2026

https://github.com/prachi005748/website-performance-data-analysis-project

Briefly describe the objective of the project—e.g., analyzing website performance metrics over time, uncovering trends in user engagement, or evaluating channel-wise traffic quality.

data-analyst data-cleaning data-preprocessing data-visualization data-visualization-python exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn storytelling

Last synced: 01 May 2026

https://github.com/samiksha29-patil/e-commerce-sales-insights-dashboard

This project focuses on analyzing e-commerce sales data through data visualization. It highlights customer behavior, popular sales channels, product category trends, and city-wise performance to provide actionable business insights.

analytics customer-insights data-visualization ecommerce matplotlib numpy pandas python sales-analysis seaborn

Last synced: 03 May 2026

https://github.com/princeoncada/quant-pca-risk

Applies Principal Component Analysis (PCA) to daily returns of 20 US equities (2015–2025) to uncover hidden risk factors. Explores variance explained, scree, loadings, factor returns, covariance reconstruction, and Varimax rotation. Results show 3–5 PCs capture ~75% of portfolio risk.

correlation-analysis covariance-matrix dimensionality-reduction factor-models matplotlib numpy pandas pca portfolio-risk principal-component-analysis python quantitative-finance time-series-analysis variance varimax

Last synced: 06 May 2026

https://github.com/kushalagarwalla/netflix-movie-data-analysis

🚀 Netflix Data Analytics Project 🎬📊 | Analyzed 9K+ movies to uncover insights on genres, popularity, votes & release trends. Includes EDA, KPIs & visualizations using Python (Pandas, NumPy, Matplotlib, Seaborn). Supports data-driven content & engagement strategy.

data-analysis data-visualization jupyter-notebook numpy pandas python seaborn

Last synced: 06 May 2026

https://github.com/samiksha29-patil/flipkart-mobiles-data-analysis-visualization-in-python

This project analyzes Flipkart Mobiles Dataset to extract useful insights about mobile phones, their pricing, ratings, discounts, and customer reviews. The analysis and visualization are done using Python to understand market trends and customer preferences.

data-analysis data-visualization matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/makcoder-2004/exploratory-data-analysis-projects

This repository features Exploratory Data Analysis (EDA) projects using Python, Pandas, NumPy, Matplotlib, and Seaborn to clean, explore, and visualize data. The goal is to uncover patterns, trends, and insights through statistical analysis and clear visualizations, preparing datasets for deeper analysis or modeling.

csv-files dataframes jyputer-notebook matplotlib numpy pandas seaborn

Last synced: 07 May 2026

https://github.com/kandil2001/lid-cavity-evolution

Lid Cavity Evolution is an open-source CFD suite for the lid-driven cavity problem, featuring MATLAB, Python, and parallel solvers with benchmark comparisons.

benchmarking cfd cfd-simulation lid-driven-cavity matlab mpi numpy open-source openfoam openmp parallel-computing python scientific-computing simple-algorithm star-ccm

Last synced: 06 May 2026

https://github.com/4ngelojr/ai-ml-cheatsheets

🗂️ Access essential AI and ML concepts with quick-reference cheatsheets for effective learning and project implementation.

bigdata cheatsheet computer-science convolutional-neural-networks data-science deep-learning dl excel hadoop keras linux neural-networks numpy pig plotly probability sql statistics

Last synced: 01 May 2026

https://github.com/labex-labs/numpy-for-beginners

This comprehensive course covers the fundamental concepts and practical techniques of NumPy, the essential library for numerical computing in Python. Learn to create, manipulate, and analyze arrays efficiently.

array-manipulation array-slicing beginner-friendly course data-analysis data-science data-structures fast-computation hands-on labex labs linear-algebra matrix-operations numerical-computing numpy programming python python-programming scientific-computing vectorized-operations

Last synced: 20 Jun 2026

https://github.com/vitor-ace/sunspots-data-analysis

This is a Jupyter Notebook which works with Data Analysis logic and libraries implementation with Python.

data-analysis data-visualization debbuging error-handling file-handling matplotlib-pyplot numpy pandas python

Last synced: 06 May 2026

https://github.com/jaketherealone/machinelearningtoolset

Collection of primary tools used in the field of machine learning - Written for numpy projects

machine-learning numpy python toolset

Last synced: 19 Jun 2026

https://github.com/rohanbanerjee1234567-cell/prediction-of-expected-salary-using-machine-learning

Here is my first Project Repository where I have made a Machine Learning Project using Python. The Problem statement was to train a model based on the given Dataset and from there we need to Predict the Expected Salary of an Employee who will have similar profiles.

exploratory-data-analysis linearregression matplotlib-pyplot numpy pandas randomforest randomforestregressor scikit-learn scikitlearn-machine-learning searborn visualization

Last synced: 27 Apr 2026

https://github.com/radithsandeepa/robot-path-visualizer

Welcome to Robot Path Visualizer! This tiny project uses Python, Pygame, and NumPy to replay a precomputed trajectory (x, y, θ). It rotates a robot sprite frame-by-frame and draws a trail, making it handy for diff-drive path playback, simulation demos, and quick visual debugging of robotics experiments.

animation numpy pygame python robot-simulation robotics trajectory-visualization

Last synced: 18 Sep 2025

https://github.com/shotadft/vecposlib

This is a library that enables vector calculations in Python.

math math-library numpy python python-lib python-library python3

Last synced: 18 Sep 2025

https://github.com/lovesaroha/learning-neural-networks

Various concepts of neural networks applied in python (numpy) to help people get started with AI.

batch-normalization dropout gradient-descent logistic-regression neural-network numpy python regularization

Last synced: 08 May 2026

https://github.com/giacomolat/realestateai-solutions---a-forecasting-model-for-the-housing-market

This project applies regularization techniques (Ridge, Lasso, and Elastic Net) to improve real estate price forecasting. This project focuses on reducing overfitting and increasing the stability of regression models' predictions

cross-validation elasticnet-regression lasso-regression-model machine-learning-algorithms matplotlib matplotlib-pyplot numpy pandas python regularization-methods regularization-to-avoid-overfitting ridge-regression-model seaborn standardization

Last synced: 01 May 2026

https://github.com/abhishek-patidar066/synthetic-datasets

Synthetic datasets are artificially generated data used for training machine learning models, simulating real-world data while ensuring privacy.

clustering datasets jupyter-notebook libraries matplotlib-pyplot numpy pandas-dataframe python random sklearn

Last synced: 15 Apr 2026

https://github.com/crvargasm/metnumun2021i

Universidad Nacional de Colombia - Métodos Numéricos ~ Grupo 1 2021-I

matplotlib numerical-analysis numpy pandas python quantopian

Last synced: 12 Apr 2026

https://github.com/anushkapal28/maatri-care

MaatriCare is an AI-based healthcare application designed to provide personalized nutrition advice and maternal care recommendations for pregnant women, lactating mothers, and women in general.

gemini-api jupyter-notebook labelencoder numpy pandas python3 random-forest streamlit svm-training

Last synced: 18 Sep 2025

https://github.com/akash-207/ecommerce-monetization-analysis

“Revenue and monetization analysis of Brazilian e-commerce data (Olist dataset). Includes KPIs, trends, and business insights.”

business-analytics matplotlib mysql numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/noxs1d/cv-emotion-classification

This project is a deep learning-based system for detecting human emotions from facial images using TensorFlow, OpenCV, and NumPy.

cnn cnn-classification dockerfile keras numpy opencv tensorflow transfer-learning

Last synced: 09 Apr 2026

https://github.com/spacebakery/crunchie-munchies-project

Analyze Data with Pyhton

numpy python statistics

Last synced: 27 Jul 2025

https://github.com/prath0m/python-numpy

Python basic Numpy Program

machine-learning numpy python python3

Last synced: 09 Apr 2026

https://github.com/chernyakid/russian-film-distribution

Исследование российского кинопроката

jupyter numpy pandas python seaborn

Last synced: 15 May 2026

https://github.com/ourway/simple-cnn

A simple CNN implementation using numpy

convolutional-neural-networks numpy

Last synced: 10 May 2026

https://github.com/jain1shh/solar-flare-prediction

This repository contains code and data for predicting solar flare energy ranges using machine learning, based on NASA's RHESSI mission data. It includes preprocessing of FITS files into a unified CSV dataset and implements models like Gradient Boosting, Random Forest, and Decision Tree classifiers, achieving accuracies up to 87%.

data-visualization machine-learning numpy pandas python scikit-learn solar-flare-prediction

Last synced: 09 Apr 2026

https://github.com/alejoduarte23/fast_fdd

Fast implementation of frequency domain decomposition (FDD) in python with multiple identification techniques

numpy scipy-signal

Last synced: 10 May 2026

https://github.com/mayankmittal29/tensortinker_statistical_methods_in_ai

This repository contains implementations of various machine learning algorithms from scratch, including Multi-Layer Perceptron (MLP), Gaussian Mixture Models (GMM), Principal Component Analysis (PCA), Autoencoders, and Variational Autoencoders.

autoencoder-mnist cupy gmm-clustering image-segmentation matplotlib-pyplot mlp-classifier mlp-regressor mnist-dataset numpy pandas pca python3 pytorch roc-auc seaborn torch variational-autoencoder

Last synced: 09 Apr 2026

https://github.com/nguyenanhtuan1912/datatable-image-to-text

Repository này là đồ án môn học Computer Vision

computer-vision javascript nodejs numpy opencv python tesseract

Last synced: 09 Apr 2026

https://github.com/vyjayanthipolapragada/image_classifier_cnn_data_augmentation

A deep learning project using Convolutional Neural Networks (CNNs) to classify CIFAR-10 images. The model leverages data augmentation, batch normalization, and ReLU activation to improve performance and generalization. Includes training and evaluation scripts for multi-class image classification.

adam-optimizer convolutional-neural-networks data-augmentation deep-learning image-classification jupyter-notebook neural-networks numpy optimizer pandas python pytorch relu-layer

Last synced: 09 Apr 2026

https://github.com/paul-bokelman/nn

Basic neural network in python

machine-learning neural-networks numpy python

Last synced: 09 May 2026

https://github.com/v-mayya/resale-revolution

Empower Hacks Hackathon Submission

html matplotlib numpy pandas python sqlite streamlit

Last synced: 09 Apr 2026

https://github.com/mirzaazwad/tymbert

TYMBert is our submission for NCIM 2025, a spam classifier that makes use of knowledge distillation to compress the model while preserving accuracy

bert huggingface-transformers knowledge-distillation machine-learning matplotlib numpy pandas python3 scikit-learn tiny-bert torch

Last synced: 09 Apr 2026

https://github.com/jcardonamde/datasets_ml

This project analyzes cab and limousine travel data in New York City. This with the goal of predicting the total duration of trips within the city. Machine learning models were used.

data-science machine-learning machine-learning-algorithms matplotlib numpy pandas pipelines python seaborn sklearn

Last synced: 09 Apr 2026

https://github.com/shafaq-aslam/predicting-heart-disease-risk-with-logistic-regression-techniques

Develop a predictive model using logistic regression techniques to assess heart disease risk based on patient health metrics and data analysis.

data-analysis heart-disease logistic-regression machine-learning machine-learning-models matplotlib numpy pandas python scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/stefagnone/text_adventure_game

A text-based adventure game project using Python fundamentals

matplotlib numpy pandas python r scikit-learn seaborn sql

Last synced: 09 Apr 2026

https://github.com/Khushi130404/CatNet

CatNet is a simple machine learning project that classifies images as either a cat or not a cat using logistic regression. The dataset consists of labeled images of cats and non-cats, preprocessed and used to train a binary classification model.

h5py matplotlib numpy pillow scipy

Last synced: 19 Sep 2025

https://github.com/n-t-raghava/the_sweet_16

This project detects faces in real-time from a webcam feed or an uploaded image and predicts age (in bins of 5 years) and gender (Male/Female). The model is based on OpenCV’s Deep Neural Network (DNN) module and pre-trained models for face, age, and gender detection.

caffemodel deep-learning flask numpy opencv python

Last synced: 09 Apr 2026

https://github.com/kiok46/subset-sum-problem

Solving the Subset Sum Problem using Python, Pandas and Numpy.

numpy pandas python subset-sum

Last synced: 05 May 2026

https://github.com/shauryashaurya/marty_mcfly

Code, text and notebooks on a tutorial for Introduction to Machine Learning using open sources

anaconda jupyter-notebooks machine-learning machine-learning-tutorials notebook numpy python regression scikit-learn scipy tutorial

Last synced: 09 Apr 2026

https://github.com/nandit123/python_on_excel

Data Analysis using python libraries on excel data

csv data-analysis data-science fill fluctuations graph numpy python python-library

Last synced: 16 May 2026

https://github.com/haseebulhassan437/lstm-next-word-predictor

A deep learning-based LSTM model for predicting the next word in a sequence using natural language processing techniques.

keras-tensorflow lstm numpy tensorflow

Last synced: 18 Apr 2026

https://github.com/bagusperdanay7/absa-with-bilstm-undergraduate-thesis

My undergraduate thesis program, Aspect-Based Sentiment Analysis Towards Matket Place Application Review Using Bidirectional Long Short-Term Memory used Python, Keras and Tensorflow

ai aspect-based-sentiment-analysis bilstm deep-learning gensim imbalanced-learning ipython-notebook keras machine-learning matplotlib natural-language-processing nltk numpy pandas python scikit-learn seaborn tensorflow

Last synced: 11 Apr 2026

https://github.com/rajireddy15/employee-attrition-prediction-hr-analytics-

Employee Attrition Prediction (HR Analytics) helps organizations analyze employee data, identify factors driving turnover, and predict attrition using machine learning and visual dashboards, enabling data-driven HR decisions and retention strategies.

data-cleaning data-collection data-manipulation data-preprocessing data-science data-visualization eda feature-engineering imbalanced-data machine-learning mysql-database numpy pandas

Last synced: 04 May 2026

https://github.com/ghaniketrajputp005/adobe-gensolve-hackathon-2024

This Project aims to identify, regularize, and beautify curves in 2D Euclidean space.

cv2 keras-tensorflow numpy os sklearn

Last synced: 07 Feb 2026

https://github.com/xkomil/datasciencesummerstudy

I want to document in this repository all my studying data science topics

data-science data-visualization machine-learning meteostat numpy pandas seaborn sklearn streamlit-webapp

Last synced: 05 May 2026

https://github.com/adijo/multilayeredperceptron

An implementation of a multi-layered perceptron.

deep-learning machine-learning numpy

Last synced: 01 May 2026