An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/elifftosunn/textdataclean

Kirli veri çekildiğinde ön işleme adımlarına gerek kalmadan model eğitimi için hazır hale getirmek amacıyla yapılan uygulamadır.

corpus deasciifier morphological-analysis ngram nltk numpy pandas sentence-embedding sentence-tokenizer stemmer stopwords string turkish turkish-sentence-tokenizer word-tokenizer

Last synced: 20 May 2026

https://github.com/kr1shnasomani/speedvision

Vehicle speed detection using Computer Vision, Deep Learning and Neural Network

bytetrack computer-vision deep-learning neural-network numpy opencv pytorch supervision tqdm yolo

Last synced: 05 Mar 2025

https://github.com/jt4v4res/neural-networks-and-deep-learning

Some neural networks implementation with purpose of understanding the use of this learning algorithm and deep learning.

deep-learning machine-learning machine-learning-algorithms neural-networks numpy perceptron-learning-algorithm

Last synced: 16 Jan 2026

https://github.com/davgiles/ML-Projects

This repository contains my projects from the Data Science & Business Analytics Post-Graduate Program through UT Austin.

eda matplotlib numpy pandas python scikit-learn scipy seaborn visualization xgboost

Last synced: 20 Jan 2026

https://github.com/csengupta1101/python-concepts

All basic to advanced Python Concepts Will be covered here in depth.

data-science django dsa-algorithm flask machine-learning numpy pandas python rest-api software-development

Last synced: 08 Apr 2026

https://github.com/pramodyasahan/house-price-prediction

This repository contains the code for a machine learning model aimed at predicting housing prices. The model is based on the RandomForestRegressor algorithm from the scikit-learn library and utilizes feature selection, preprocessing, and pipeline techniques for improved performance.

machine-learning numpy pandas python scikit-learn

Last synced: 08 Apr 2026

https://github.com/annygutierrez/ai-chatbot-python

Deep learning chatbot: Chatbot model to make predictions and chat back and forth with our user.

nltk numpy tensorflow tflearn

Last synced: 06 May 2026

https://github.com/harrisonwelch/pythondatascience

Repo of code from the linked-in lesson "Python: Data Analysis"

data data-science matplotlib notes numpy python tutorial

Last synced: 12 Apr 2026

https://github.com/thekartikeyamishra/aipoweredmarketingassistant

AI-Powered Marketing Assistant, an advanced tool designed to enhance your digital marketing campaigns using the power of machine learning (ML) and large language models (LLMs). This project empowers small businesses and MSMEs to create compelling content, analyze campaigns, and strategize effectively.

artificial-intelligence llm matplotlib numpy openai pandas python scikit-learn streamlit

Last synced: 08 Apr 2026

https://github.com/byte-sourcerer/array_contract

Contract programming for Pytorch, Numpy

contracts ndarray numpy python python3 pytorch tensor

Last synced: 14 Sep 2025

https://github.com/01one/python-online-with-pyodide

Run and edit python code including matplotlib and numpy on the browser with pyodide and codemirror

matplotlib-python numpy pyodide python-browser python-online python-online-compiler

Last synced: 29 Mar 2025

https://github.com/mikhailkhorokhorin/the-loop

Student project of team "полетики-самолетики"

3dsmax krpc ksp math matplotlib numpy python

Last synced: 20 May 2026

https://github.com/andriikot/smartmatch

Smart Match with python

docker faiss huggingface numpy ptyhon pythorch

Last synced: 04 Apr 2026

https://github.com/dulajkavinda/diabetes-detection

💊detect diabetes using machine learing

added machinelearning numpy pandas python scikit

Last synced: 14 Sep 2025

https://github.com/sandeepannandi/slicecam

SliceCam is a fun, camera-controlled game powered by OpenCV and Mediapipe that lets you slash flying fruits with real-time hand tracking.

collision-detection handtracking mediapipe mediapipe-hands motiondetection numpy opencv pygame python

Last synced: 06 May 2026

https://github.com/abynxv/automatedexamsheetgrader-ocr

This Python project automates grading of handwritten or printed answer sheets using OCR (Optical Character Recognition). It extracts student answers from scanned images, compares them to an answer key, and generates a detailed report with class statistics.

numpy ocr-recognition opencv pandas python3 tesseract

Last synced: 27 Jun 2025

https://github.com/baschin1103/principal_component_analysis

In this repository you find a python program and the prints and 3D-visualization of it. After the KNN-Classification I wanted to know which variables have the most relevance for the results. One approach for this is the Principal-Component-Analysis (PCA). More details in the python program as comments.

3d-printing labelencoder matplotlib numpy pandas principal-component-analysis sklearn-library sqlalchemy standardization variable-importance

Last synced: 13 May 2026

https://github.com/joeylemon/python-kmeans

an implementation of the K-means clustering algorithm in Python

k-means-clustering matplotlib ml numpy python

Last synced: 21 May 2026

https://github.com/kumpatlapavankumar/recognizing-flowers-with-convolutional-neural-networks

This project uses a Convolutional Neural Network (CNN) to classify flower images into five categories: Daisy, Dandelion, Rose, Sunflower, and Tulip. The model is trained on the Flowers Recognition dataset with image augmentation techniques to enhance generalization.

cnn-keras coding data-science deep-learning enjoying flower-classification machine-learning numpy opencv2 pandas pathlib pillow python tensorflow testing training

Last synced: 08 Apr 2026

https://github.com/vyjayanthipolapragada/human_detection_hog

Detect humans in real-time, image or video and count the detected people using OpenCV and HOG Descriptor

computer-vision easygui hog-descriptor human-detection image-processing imutils numpy opencv os python support-vector-machine

Last synced: 06 May 2026

https://github.com/suryavamsi-p/youtube-trends-and-tracks-engagement

This project aims to leverage Python’s robust visualization libraries to derive deep insights from YouTube video trends data. Using the 'youtube.csv' dataset, this analysis explores the dynamics of YouTube video attributes such as views, likes, dislikes, and comment counts to understand user engagement and their emotional insights.

dash datavisualization hypothesis-testing matplotlib normality-test numpy pandas python3 scipy seaborn visual-studio-code

Last synced: 03 Nov 2025

https://github.com/praju-1/data_science_projects

It contains the necessary code, datasets, and documentation to understand, replicate, and build upon the project's findings and methodologies.

data-science datasets deep-learning exploratory-data-analysis machine-learning matplotlib numpy pandas predictive-analytics python seaborn sklearn statistics visualization

Last synced: 08 Apr 2026

https://github.com/abhishekbagdiya01/movies-recommendation-system

This repository contains the code for a movie recommendation system built using Jupyter Notebook.

aiml jupyter-notebook numpy pandas python scikit-learn

Last synced: 08 Apr 2026

https://github.com/marknature/opencv-bootcamp

If you are looking to take your first steps towards learning Computer Vision and AI using OpenCV, this is the best OpenCV course to jumpstart your career.

anaconda computer-vision cv2-library ffmpeg jupyter-notebook matplotlib numpy opencv os urllib zipfile

Last synced: 02 Mar 2025

https://github.com/abhishek-k-git/feature-engineering-using-stock-exchange-data

Creating quantitative trading strategies requires historical and real-time data and involves comparing models against future returns post-training.

lightgbm matplotlib numpy pandas plotly seaborn

Last synced: 12 Apr 2026

https://github.com/omarsaad21/credit-train-data-science-project

This a full web application to predict the credit score of clients plus I did many visulizations to express many insights in chart

eda matplotlib ml numpy pandas python sklearn streamlit-webapp

Last synced: 09 Apr 2026

https://github.com/caioandrian/data-analyst-procon

Análise dos dados do Procon, período de 2013 à 2016.

dataset numpy pandas python

Last synced: 11 Apr 2026

https://github.com/emredemirbas/google-playstore-eda

Exploratory Data Analysis (EDA) of the Google Play Store dataset — examining trends in app ratings, categories, pricing, and user engagement using Python and LaTeX.

exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 03 May 2026

https://github.com/utkarsh251106/tracking-with-yolo

This project uses YOLOv8 and DeepSORT to detect and track children and adults in video streams. It assigns unique IDs, handles re-tracking after occlusions, and outputs an annotated video with labeled bounding boxes.

computer-vision deep-learning deepsort machine-learning numpy opencv python torch ultralytics yolov8

Last synced: 30 Jan 2026

https://github.com/nishthasharma-22/binary-black-hole-merger-gravitational-waves-simulation

Repository of all astronomy related projects, including: Gravitational Waves graph from binary black hole merger

astrophysics blackhole-merger gravity-simulator matplotlib numpy python scipy

Last synced: 05 May 2026

https://github.com/priyasingh26/financial_document-data_extraction

This project extracts key information from financial documents like invoices and receipts using text recognition. It processes images, classifies documents, and extracts data, which is then stored in a CSV file. The aim is to automate data collection from scanned documents, reducing manual work and increasing accuracy.

data-extraction numpy ocr pandas pillow preprocessing pytesseract-ocr python sklearn torch transformers

Last synced: 08 Apr 2026

https://github.com/chawthinn/car-price-prediction-regression-ml

This project predicts used car prices using regression models including Linear, Ridge, Random Forest, XGBoost, and LightGBM. It covers preprocessing, EDA, model evaluation, hyperparameter tuning, and model persistence using Scikit-learn and related libraries.

car-price-prediction lgbmregressor linearregression numpy pandas python scikit-learn xgbregressor

Last synced: 08 Apr 2026

https://github.com/haydencordeiro/terafeed

Terafeed - Addressing Zero Hunger in Africa (Sustainability Goal SDG 2)

javscript numpy pandas powerbi python scikit-learn tableau vuejs

Last synced: 08 Apr 2026

https://github.com/5hraddha/zyfra-gold-recovery-prediction

Zyfra is a pioneering developer of efficiency solutions for heavy industries & is aiming to take help of machine learning to optimize the efficiency in Gold Ore processing

decisiontreeregressor dummyregressor linearregression numpy pandas randomforestregressor scipy seaborn smape supervised-learning

Last synced: 08 Apr 2026

https://github.com/aysh2603/twitter-sentiment-analysis

The Twitter Sentiment Analysis project employs Natural Language Processing (NLP) techniques to classify tweets into positive or negative sentiments. By analyzing the tone of tweets, this project provides insights into public sentiment on various topics.

hyperparameter-tuning nlp-machine-learning numpy pandas python3 scikit-learn

Last synced: 13 Jul 2025

https://github.com/rdvdev2/tf-test

Mostres de ML i IA amb TensorFlow basades en els tutorials oficials per al PR

numpy python research-project tensorflow tensorflow-tutorials

Last synced: 22 Mar 2025

https://github.com/anmamun0/data-analysis-home-cleaning-services

This repository contains the analysis and visualization of data from a home cleaning services dataset. The project provides valuable insights into revenue generation, customer trends, and regional performance, helping businesses make data-driven decisions.

matplotlib numpy pandas

Last synced: 05 Mar 2025

https://github.com/singhvishal003/stock-price-predict

Stock Price Prediction model using yfinance data .

keras matplotlib numpy pandas yfinance

Last synced: 08 Apr 2026

https://github.com/beolawork-art/novabank-churn-analysis

NovaBank has noticed that customers are closing accounts or going inactive, and they want to understand why.

data-analysis data-science-projects data-visualization eda machine-learning numpy pandas python scikit-learn sql

Last synced: 08 Apr 2026

https://github.com/shahbazshaddy/explainable-multimodal-ai-for-breast-cancer-and-pneumonia-prediction

A deep learning-based framework integrating explainable multimodal AI for accurate prediction and transparent diagnosis of breast cancer and pneumonia.

deep-learning explainable-ai grad-cam groq-api llm machine-learning matplotlib multimodal numpy pandas python pytorch scikit-learn seaborn streamlit

Last synced: 08 Apr 2026

https://github.com/lintangwisesa/flask_sklearn_californiahouse

Simple Flask App with Linear Regression model to predict California house pricing 🏠

california-house-pricing flask linear-regression machine-learning numpy pandas sklearn

Last synced: 12 Apr 2026

https://github.com/shreyavats23/sign_language_recognition

Sign Language Recognition using Deep Learning : ~ Built a model to recognize American Sign Language (ASL) for letters A–Z and digits 1–9 ~ Achieved ~89% validation accuracy using a Convolutional Neural Network (CNN) ~ Used data augmentation techniques to improve generalization ~ Visualized predictions to test the model on real samples

artificial-intelligence keras machine-learning matplotlib numpy python pytorch seaborn sklearn tensorflow

Last synced: 08 Apr 2026

https://github.com/ashithapallath/comparison-of-ssd-sad-in-motion-estimation-of-video-coding

Course assignment for Algorithm and Massive Datasets comparing SAD and SSD for motion estimation. Includes analysis of accuracy, speed (Python/NumPy), and error sensitivity. Highlights trade-offs for speed-critical vs. precision-focused applications.

l1-normalization l2-norm numpy pytho video-cod

Last synced: 04 Apr 2025

https://github.com/hariprasath-v/machinehack_analytics_olympiad_2023

Create a machine learning model to determine the likelihood of a customer defaulting on a loan based on credit history, payment behavior, and account details.

binaryclassification catboost exploratory-data-analysis machine-learning numpy pandas python scikit-learn shap

Last synced: 08 Apr 2026

https://github.com/priyanshulathi/cancer-diagnosis-prediction-model

A Machine Learning project to predict cancer malignancy using K-Nearest Neighbor, Support Vector Machine, and Decision Tree algorithms.

machine-learning numpy pandas python scikit-learn

Last synced: 03 Jan 2026

https://github.com/alphacrypto246/insurance-charges-prediction

The Predicting Insurance Charges project uses Decision Tree Regression to predict insurance charges based on features like age, sex, BMI, and smoking habits. It involves data preprocessing, feature scaling, and model evaluation with metrics like MAE and R².

machine-learning numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 03 May 2026

https://github.com/dmarks84/ind_project_superstore-sales-time-series-analysis--kaggle

Independent Project - Kaggle Dataset-- I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2.

chloropleth data-modeling data-visualization eda linear-regression matplotlib numpy pandas python seaborn sklearn statistics statsmodels supervised-ml time-series-analysis

Last synced: 09 Apr 2026

https://github.com/eljandoubi/disasterresponsepipeline

Project aim is to build a Natural Language Processing (NLP) model to categorize messages on a real time basis.

flask nltk numpy pandas plotly scikit-learn scipy sqlalchemy

Last synced: 09 Apr 2026

https://github.com/armahdavi/data_pipeline_analytics_statistics_ML_PM_PSD_residential_QFF

Sharing all the data pipelines and processing codes, statistical modellings, descriptive statistics, plot visualizations, and machine learning from Mahdavi & Siegel (2021) (Indoor Air) Project Miestone: 2017 - 2020 Full-length article: https://onlinelibrary.wiley.com/doi/abs/10.1111/ina.12782

data-science data-visualization dust hvac indoor-air-quality jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats spyder spyder-python-ide statistics

Last synced: 17 Sep 2025

https://github.com/harmanveer-2546/campus-placement-prediction

Student Campus-Placement Prediction ML Project using Python

decision-trees machine-learning matplotlib numpy python random-forest scipy seaborn

Last synced: 08 Mar 2026

https://github.com/kushalagarwalla/netflix-movie-data-analysis

🚀 Netflix Data Analytics Project 🎬📊 | Analyzed 9K+ movies to uncover insights on genres, popularity, votes & release trends. Includes EDA, KPIs & visualizations using Python (Pandas, NumPy, Matplotlib, Seaborn). Supports data-driven content & engagement strategy.

data-analysis data-visualization jupyter-notebook numpy pandas python seaborn

Last synced: 06 May 2026

https://github.com/kandil2001/lid-cavity-evolution

Lid Cavity Evolution is an open-source CFD suite for the lid-driven cavity problem, featuring MATLAB, Python, and parallel solvers with benchmark comparisons.

benchmarking cfd cfd-simulation lid-driven-cavity matlab mpi numpy open-source openfoam openmp parallel-computing python scientific-computing simple-algorithm star-ccm

Last synced: 06 May 2026

https://github.com/jaketherealone/machinelearningtoolset

Collection of primary tools used in the field of machine learning - Written for numpy projects

machine-learning numpy python toolset

Last synced: 19 Jun 2026

https://github.com/rohanbanerjee1234567-cell/prediction-of-expected-salary-using-machine-learning

Here is my first Project Repository where I have made a Machine Learning Project using Python. The Problem statement was to train a model based on the given Dataset and from there we need to Predict the Expected Salary of an Employee who will have similar profiles.

exploratory-data-analysis linearregression matplotlib-pyplot numpy pandas randomforest randomforestregressor scikit-learn scikitlearn-machine-learning searborn visualization

Last synced: 27 Apr 2026

https://github.com/radithsandeepa/robot-path-visualizer

Welcome to Robot Path Visualizer! This tiny project uses Python, Pygame, and NumPy to replay a precomputed trajectory (x, y, θ). It rotates a robot sprite frame-by-frame and draws a trail, making it handy for diff-drive path playback, simulation demos, and quick visual debugging of robotics experiments.

animation numpy pygame python robot-simulation robotics trajectory-visualization

Last synced: 18 Sep 2025

https://github.com/crvargasm/metnumun2021i

Universidad Nacional de Colombia - Métodos Numéricos ~ Grupo 1 2021-I

matplotlib numerical-analysis numpy pandas python quantopian

Last synced: 12 Apr 2026

https://github.com/akash-207/ecommerce-monetization-analysis

“Revenue and monetization analysis of Brazilian e-commerce data (Olist dataset). Includes KPIs, trends, and business insights.”

business-analytics matplotlib mysql numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/spacebakery/crunchie-munchies-project

Analyze Data with Pyhton

numpy python statistics

Last synced: 27 Jul 2025

https://github.com/jain1shh/solar-flare-prediction

This repository contains code and data for predicting solar flare energy ranges using machine learning, based on NASA's RHESSI mission data. It includes preprocessing of FITS files into a unified CSV dataset and implements models like Gradient Boosting, Random Forest, and Decision Tree classifiers, achieving accuracies up to 87%.

data-visualization machine-learning numpy pandas python scikit-learn solar-flare-prediction

Last synced: 09 Apr 2026

https://github.com/mayankmittal29/tensortinker_statistical_methods_in_ai

This repository contains implementations of various machine learning algorithms from scratch, including Multi-Layer Perceptron (MLP), Gaussian Mixture Models (GMM), Principal Component Analysis (PCA), Autoencoders, and Variational Autoencoders.

autoencoder-mnist cupy gmm-clustering image-segmentation matplotlib-pyplot mlp-classifier mlp-regressor mnist-dataset numpy pandas pca python3 pytorch roc-auc seaborn torch variational-autoencoder

Last synced: 09 Apr 2026

https://github.com/v-mayya/resale-revolution

Empower Hacks Hackathon Submission

html matplotlib numpy pandas python sqlite streamlit

Last synced: 09 Apr 2026

https://github.com/shafaq-aslam/predicting-heart-disease-risk-with-logistic-regression-techniques

Develop a predictive model using logistic regression techniques to assess heart disease risk based on patient health metrics and data analysis.

data-analysis heart-disease logistic-regression machine-learning machine-learning-models matplotlib numpy pandas python scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/haseebulhassan437/lstm-next-word-predictor

A deep learning-based LSTM model for predicting the next word in a sequence using natural language processing techniques.

keras-tensorflow lstm numpy tensorflow

Last synced: 18 Apr 2026

https://github.com/rajireddy15/employee-attrition-prediction-hr-analytics-

Employee Attrition Prediction (HR Analytics) helps organizations analyze employee data, identify factors driving turnover, and predict attrition using machine learning and visual dashboards, enabling data-driven HR decisions and retention strategies.

data-cleaning data-collection data-manipulation data-preprocessing data-science data-visualization eda feature-engineering imbalanced-data machine-learning mysql-database numpy pandas

Last synced: 04 May 2026

https://github.com/ajxxxs/spotify-music-analysis

spotify Music (web scraped playlists ) analysis (over 3 states) , trends, features and a music recommendation system.

matplotlib numpy panda scikit-learn seaborn

Last synced: 28 Jul 2025

https://github.com/kikoveiga/feup-ia1

Artificial Intelligence (IA) First Project (2023/2024): BSc in Informatics and Computing Engineering @ FEUP

feup feup-ia feup-leic genetic-algorithms hill-climbing numpy pandas python simulated-annealing tabu-search

Last synced: 06 May 2026

https://github.com/arjunravi26/glucose-monitoring

Glucose Monitoring system to monitor patients and notify whenever glucose goes high.

machine-learning numpy pandas plotly python3 streamlit xgbboostregressor

Last synced: 30 Jul 2025

https://github.com/valmir-unicap/deteccao-de-mascaras-faciais

Experimento baseado no artigo: A Face Mask Detection Algorithm Based on YOLO

kaggle-dataset mobilenetv2 numpy opencv python torch yolo

Last synced: 12 Apr 2026

https://github.com/werctfourth/python-border-autocrop2

Python script that crops borders from images v2

border crop image-processing libvips numba numpy python python3

Last synced: 17 May 2026

https://github.com/nagipragalathan/python_tutorial_for_data-science

This repository is a comprehensive guide for learning data science using Python. It covers various essential libraries and tools commonly used in the field of data science, including Jupyter Notebook, Matplotlib, NumPy, Pandas, Scikit-learn, and PyTorch.

datascience datavisualization deeplearning jupyter jupyter-notebook learning-by-doing learningresources machinelearning matplotlib numpy opensource pandas python python-script python3 pytorch pytorch-implementation scikitlearn tutorial

Last synced: 09 Apr 2026

https://github.com/muavia1/roman-urdu-poetry-generation-using-lstm

Here’s a short description you can add to your GitHub project: Roman Urdu Poetry Generator A deep learning project using LSTM and TensorFlow to generate Roman Urdu poetry. Trained on a poetry dataset and deployed with a Gradio interface for interactive text generation.

deep-learning gradio gradio-interface lstm model-training numpy pandas poetry-generator python tensorflow

Last synced: 09 Apr 2026

https://github.com/sridharyadav07/machine-learning-project-bankruptcy-prevention-

The project explores multiple machine learning algorithms and evaluates their performance using various metrics, such as accuracy and confusion matrices. The models tested include Logistic Regression, K-Nearest Neighbors (KNN), Naive Bayes, and Support Vector Machine (SVM). In addition, regularization techniques (L1, L2) are used to avoid overfit.

data-preprocessing evaluation machine-learning-models matplotlib-pyplot modelbuilding modeldeployment numpy pandas python scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/dmarks84/ind_project_european-soccer-top-points-contributors--kaggle

Independent Project - Kaggle Dataset-- I worked on the European Soccer Dataset, using SQL (SQLite) to read in the data and then data wrangling before running statistical analysis and hypothesis testing on questions of who helps earn the most points for their team.

data-wrangling hypothesis-testing numpy p-values pandas python scipy-stats statistics t-test

Last synced: 09 Apr 2026

https://github.com/blinksta1ker/co2-prediction

EDA and Supervised ML model using Regression to predict Co2 Emissions from Vehicles

jupyter-notebook machine-learning numpy pandas python regression seaborn sklearn supervised-learning

Last synced: 09 Apr 2026

https://github.com/dpgitaccount/project-customer-segmentation-analysis-with-power-bi

This project segments customers based on their purchasing behavior to identify different target groups. It demonstrates skills in data analysis, clustering, and visualization using Power BI.

kmeans-clustering-algorithm linear-regression numpy pandas powerbi python seaborn segmentation

Last synced: 09 Apr 2026

https://github.com/0xunkn0wn4m1r/data_engineering_banking_project

🏦 Build a complete data engineering workflow for a banking system, showcasing ETL processes, data transformations, and an interactive financial dashboard.

automation data-analysis data-cleaning data-science feature-engineering fintech-bank flask-api loan-default-prediction machine-learning mlops model-explainability numpy postgresql scikit-learn segmentation shap sql unsupervised-learning

Last synced: 09 Apr 2026