An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/shubhamsoni98/prediction-with-binomial-logistic-regression

To predict client subscription to term deposits and optimize marketing strategies by identifying potential subscribers.

binomial data data-science eda machine-learning matplotlib pipeline python scikit-learn seaborn sklearn sql visualization

Last synced: 06 Feb 2026

https://github.com/jofaval/sonar

Binary Classification of Sonar Signals of Rocks and Metal cylinders in 1987

data-analysis data-science data-visualization machine-learning python scikit-learn sonar uci

Last synced: 09 Apr 2026

https://github.com/elprofesoriqo/chrome-extension-gmail-spam-filter

Chrome extension that automatically identifies and moves emails marked as spam to the spam folder in Gmail.

api-client chrome-extension firebase-database javascript machine-learning python scikit-learn

Last synced: 09 Apr 2026

https://github.com/rahul-120/crop_recom

This project is a Machine Learning based Crop Recommendation System built using Flask. It helps farmers or users decide the most suitable crop to grow based on soil nutrients and environmental conditions.

crop-recommendation-system flask flask-application machine-learning python3 scikit-learn

Last synced: 02 May 2026

https://github.com/anuranjanjain/video-upscaler

A WebAPP designed for upscalling video to HD Resolution using custom Denoise filter and OpenCV

artificial-intelligence opencv python scikit-learn tailwindcss

Last synced: 24 Jan 2026

https://github.com/pyzit/recommandation-engine-in-drf-sk-learn

Full Stack Movie Recommendation System Project made in Django REST Framework and React JS

api django django-rest-framework movies reactjs recommender-system scikit-learn

Last synced: 28 Jan 2026

https://github.com/shahaba83/airplane-ticket-cancellation

In this project, we try to predict the possibility of canceling the plane ticket by the buyer

datatime numpy pandas python scikit-learn seaborn

Last synced: 25 Feb 2026

https://github.com/tamk-kol/project_orbital_data_analysis

The goal of this project is to develop an automatic method to detect orbital maneuvers using machine learning.

matplotlib numpy pandas scikit-learn

Last synced: 30 Jan 2026

https://github.com/diiblo/la-poste-predictive-flux

Prédiction journalière du flux de colis dans les centres de tri de La Poste. Pipeline complet : génération de données, modélisation LightGBM, orchestration via Airflow (Docker), stockage PostgreSQL et dashboard interactif Streamlit. Projet réalisé en Mastère 2 Data Engineering à l’ECE Paris.

airflow docker postgresql scikit-learn streamlit

Last synced: 31 Jan 2026

https://github.com/emv271828/diabetes_cdc_uci_machine_learning

Segunda avaliação para a disciplina de Inteligência Artificial da Universidade Federal Fluminense.

jupyter-notebook machine-learning pandas python scikit-learn

Last synced: 15 Apr 2026

https://github.com/jofaval/titanic-disaster

Data Analysis of the famous Titanic Disaster in 1912 with Machine Learning

classification data-analysis data-science data-visualization google-colab kaggle machine-learning python scikit-learn

Last synced: 15 Apr 2026

https://github.com/sarmad426/ai

AI basic to advanced featuring Machine Learning, Deep Learning and Data Science.

ai data-science deep-learning hugging-face machine-learning numpy pandas python scikit-learn

Last synced: 15 Apr 2026

https://github.com/samiyaalizaidi/nn-ml-homeworks

Homework solutions for CPE-4903: Neural Networks & Machine Learning at Kennesaw State University.

machine-learning machine-learning-workflow neural-networks numpy scikit-learn

Last synced: 15 Apr 2026

https://github.com/as1467/canada-per-capita-income-prediction

This project is a simple machine learning exercise to predict Canada's per capita income based on historical data. The dataset used in this project was sourced from the CodeBasics GitHub repository and is used here to practice linear regression as part of my machine learning learning process.

machine-learning matplotlib-pyplot pandas python scikit-learn

Last synced: 15 Apr 2026

https://github.com/nikitalpopov/evotor_champ

solution for evotor data challenge

data-analysis data-science python scikit-learn

Last synced: 15 Apr 2026

https://github.com/nits2612/data-science-projects

Portfolio of data science projects completed by me during PGP AI/ML, self learning, and hobby purposes.

data data-science dataanalysis deep deep-learning keras machine-learning matplotlib numpy opencv pandas python scikit-learn seaborn surprise-python tensorflow transfer-learning

Last synced: 01 Feb 2026

https://github.com/sarowarahmed/advertising-sales-app

📈 Advertising Sales Predictor: A web app powered by a Machine Learning model, built with Numpy, Pandas, Scikit-learn, and Streamlit, to forecast sales based on TV, Newspaper, and Online Advertising. Deployed on Streamlit Cloud for real-time, easy-to-use predictions.

advertising app machine-learning multiple-linear-regression numpy pandas sales scikit-learn streamlit

Last synced: 07 Feb 2026

https://github.com/aerojam95/math70076-data-science-cw2

This repository presents the second coursework for the MATH70076 Data Science module at Imperial College London, where the project showcases different machine and deep learning models for image classification

data-science deep-learning machine-learning python3 pytorch scikit-learn

Last synced: 15 Apr 2026

https://github.com/danicaalana/wine-dataset-decision-tree

This project is developed as part of Digital Skill Fair (DSF) 35.0 - Data Science by Dibimbing. I am using Wine Recognition Dataset from scikit-learn, which is the results of a chemical analysis of wines grown in the same region in Italy by three different cultivators.

data data-analysis-python data-science decision-tree-classification machine-learning python scikit-learn wine-dataset

Last synced: 18 Apr 2026

https://github.com/sarowarahmed/predicting-kolkata-house-price

🏠 Predicting Kolkata House Price: A web app powered by a Machine Learning model, built with Numpy, Pandas, Scikit-learn, and Streamlit, to predict house prices in Kolkata. Deployed on Streamlit Cloud for easy access and real-time predictions.

app kolkata linear-regression machine-learning numpy pandas scikit-learn streamlit

Last synced: 26 Feb 2026

https://github.com/brossend/automl_bank_project

Automated ML pipeline for the UCI Bank Marketing dataset: ETL, Optuna-based AutoML, model evaluation, MLflow logging, pytest tests, Docker, and CI/CD.

automl bank-marketing binary-classification ci-cd classification data-science docker docker-compose etl github-actions gitlab-ci machine-learning ml-pipeline mlflow model-monitoring optuna pytest python scikit-learn uci-dataset

Last synced: 02 Jun 2026

https://github.com/0eix/ibm-ds-spacex-falcon9

IBM Professional data science certificate Final Project Notebooks

data-science data-visualization exploratory-data-analysis ibm poetry scikit-learn shap

Last synced: 11 Feb 2026

https://github.com/nurulashraf/predictive-maintenance-analysis-for-machine-failure-prevention

Predictive maintenance analysis for machine failure prevention using sensor data and ML. Built a Random Forest model and Gradio dashboard to identify high-risk machines for proactive maintenance.

data-science failure-prediction gradio industrial-iot machine-learning power-bi predictive-maintenance python scikit-learn

Last synced: 16 Apr 2026

https://github.com/cego669/dirtycategoriesencoding

Repository containing two classes (StringAgglomerativeEncoder and StringDistanceEncoder) useful for grouping or visualizing the distance between dirty categorical variables. They are compatible with the scikit-learn API.

category clustering dimensionality-reduction dirty hierarchical-clustering machine-learning scikit-learn singular-value-decomposition svd

Last synced: 11 Feb 2026

https://github.com/arshc0der/n.o.v.a-geospatial-ozone-predictor

An AI-powered geospatial intelligence dashboard for predicting atmospheric ozone levels using 27 years of NASA data. Features 3D climate mapping and live satellite tracking.

atmospheric-science climate-tech dashboard-ui data-visualization desktop-app geospatial-analysis gis machine-learning matplotlib ozone-prediction pandas python random-forest-regressor satellite-tracking scikit-learn tkinter windows-executable

Last synced: 01 Mar 2026

https://github.com/smuralee/machine-learning-samples

Machine learning samples

pytorch scikit-learn

Last synced: 15 Feb 2026

https://github.com/quran-yeamen/serverlifecycleml

Predictive modeling of server lifecycle stages using synthetic data and machine learning.

data-science machine-learning predictive-modeling python scikit-learn synthetic-data

Last synced: 15 Feb 2026

https://github.com/paultheal1en/dsc-fact-checking

Fact-checking project classifying claims as SUPPORTED, REFUTED, or NEI. Uses ANN, DNN, RNN, CNN, Random Forest, PhoBERT, and Sentence Transformers.

deep-learning fact-checking keras machine-learning nlp phobert random-forest scikit-learn sentence-transformers tensorflow transformers

Last synced: 16 Apr 2026

https://github.com/pramodyasahan/health-insurance-cost-prediction

This project focuses on predicting health insurance costs using a polynomial regression model. By employing machine learning techniques in Python, the project aims to accurately estimate insurance costs based on various personal attributes. The model takes into account several features including age, sex, BMI, number of children, smoking status etc

machine-learning matplotlib numpy pandas python3 scikit-learn

Last synced: 16 Apr 2026

https://github.com/piotrwnuczek/cloudprediction

Predicting cloud task execution time using AI/ML

matplotlib pandas python scikit-learn

Last synced: 16 Apr 2026

https://github.com/meiyor/abatech_ai_test

This repository contains the files for deploying an Exploratory Data Analysis (EDA) for participant demographic and company-based data collected by the outsourcing service given by the company ABATech located in Colombia. This repository also includes the evaluation of three different classifiers to decode the level of satisfaction of the users.

keras python scikit-learn scikitlearn-machine-learning tensorflow

Last synced: 16 Apr 2026

https://github.com/archish27/pythontutorial

Python Programming Tutorial for new geeks who want to learn python from scratch to deal with various applications

matplotlib numpy pandas pygame python python-2 python-3 scikit-learn soup

Last synced: 01 Apr 2026

https://github.com/dan-niles/iris-ml

Machine learning on the Iris dataset

iris-dataset machine-learning scikit-learn

Last synced: 16 Apr 2026

https://github.com/sahiltiwariiii/dssp

Predicting student math scores ! This project utilizes advanced machine learning techniques and MLOps tools like DVC and MLflow to predict a student's math score based on various factors such as gender, race/ethnicity, parental level of education, lunch type, test preparation course, writing etc

docker dotenv dvc flask machine-learning mlflow mlops mysql mysql-connector-python numpy pandas pymysql python python-dotenv scikit-learn seaborn sklearn-library statistics streamlit

Last synced: 27 Mar 2026

https://github.com/sanikamal/deep-learning-atoz

A collection of deep learning architectures ,model, code snippets, tips and mini projects.

computer-vision deep-learning nlp scikit-learn skimage tensorflow

Last synced: 16 Apr 2026

https://github.com/supershivam5/python_projects

💻 Python programming with Numpy, Pandas, Matplotlib.🌟 Love exploring new technologies. Check out my projects!

matplotlib-pyplot numpy pandas scikit-learn seaborn

Last synced: 17 Apr 2026

https://github.com/ejw-data/proj-food-inspections

Analyzing Chicago Food Inspection data for interesting insights by combining multiple data resources and performing feature engineering.

decision-trees pandas preprocessing python scikit-learn

Last synced: 17 Apr 2026

https://github.com/vaishnavis03/finlatics_ml_program

This repository contains the .ipynb files for 3 datasets, along with a PPT for each. The datasets included are Facebook Marketplace Data, Sales Prediction Data, and Wine Quality data.

correlation data-analysis data-science data-visualization knn linear-regression machine-learning matplotlib numpy pandas random-forest-classifier scikit-learn

Last synced: 17 Apr 2026

https://github.com/iamwatchdogs/cardiovascular-risk-prediction

This mini-project uses machine learning algorithms to predict possible risks of heart disease by analyzing given data.

jupyter-notebook machine-learning-algorithms matplotlib numpy pandas python scikit-learn seaborn

Last synced: 02 Apr 2026

https://github.com/akshitvats026/heart_disease_prediction

An ML-based Heart Disease Prediction System that predicts the likelihood of heart disease based on user health parameters. Built using Python, Pandas, and Scikit-learn, the system performs data preprocessing, trains a predictive model, and provides real-time predictions with high accuracy.

accuracy-score logistic-regression machine-learning matplotlib-pyplot numpy pandas python scikit-learn

Last synced: 02 Apr 2026

https://github.com/raphael-ufrj/analise_algodao

Análise histórica de plantio de algodão, analise do plantio com base no clima e nos dados históricos.

analysis data-science data-visualization dataset docker pandas provenance python python3 scikit-learn seaborn streamlit

Last synced: 02 Apr 2026

https://github.com/isshiki/machine-learning-with-python

連載『Pythonで学ぶ「機械学習」入門』(@IT)で使用するノートブックが配布されているリポジトリです。

data-science machine-learning machinelearning-python python scikit-learn

Last synced: 17 Apr 2026

https://github.com/orliluq/inmersion-datos-python

Desarrollar modelos de machine learning para predecir la probabilidad de incumplimiento crediticio de los clientes, utilizando diferentes algoritmos de clasificación (Regresión Logística, Árboles de Decisión, Random Forest, Naive Bayes).

colab-notebook numpy pandas python scikit-learn

Last synced: 02 Apr 2026

https://github.com/felixamaladhas/amazon-reviews-sentiment-analysis

This is a sentiment analysis project that classifies Amazon product reviews as positive or negative using machine learning techniques.

matplotlib numpy pandas python scikit-learn

Last synced: 02 Apr 2026

https://github.com/a-poor/sample-model-serve

Demo for using Flask to serve a scikit-learn model as an API

api data-science docker flask machine-learning scikit-learn

Last synced: 30 Apr 2026

https://github.com/ngangawairimu/linear-regression-

This project builds a linear regression model in Python to predict outcomes and derive insights from feature data. It covers data cleaning, feature analysis, and model evaluation, showcasing predictive modeling techniques using scikit-learn, pandas, and visualization libraries.

data-analysis linear-regression machine-learning predictive-modeling python scikit-learn

Last synced: 17 Apr 2026

https://github.com/mangesh-balkawade/pythonautomationsscripts

This is the repository which contains the python automations scripts and machine learning case studies , and Python Projects that I have write to learn automations and ML using python.

automation data-science machine-learning-algorithms matplotlib mongodb pandas python3 scikit-learn seaborn webscraping

Last synced: 13 Apr 2026

https://github.com/rosieoh/emergency_dataanalysis

오픈데이터분석-응급의료체계 방안 정책 제안 데이터 분석

ipython matplotlib numpy pandas python scikit-learn scipy

Last synced: 04 Apr 2026

https://github.com/belzebu013/prever_nivel_colesterol

Projeto de IA com algoritmo de Regressão Linear múltipla para prever o nível de colesterol de um individuo.

ia jupiter-notebook pandas python regressao-linear-multipla scikit-learn

Last synced: 17 Apr 2026

https://github.com/mryutaro/spla3clip

spla3clip: キル・デスした時刻を自動で解析するスプラトゥーン3用ツール

fastapi python react scikit-learn typescript

Last synced: 04 Apr 2026

https://github.com/rickyarians/ai-ml-nlp

Directory Machine Learning, Deep Learning, Artificial Int, Natural Language Processing Project

deep-learning machine-learning modeling python scikit-learn tensorflow

Last synced: 04 Apr 2026

https://github.com/bjpcjp/scikit-learn

Updates in progress. Jupyter workbooks will be added as time allows.

python python3 scikit-learn

Last synced: 18 Apr 2026

https://github.com/justsecret123/nba-players-stats-analysis

A quick interactive Notebook to visualize some NBA players stats (points, assists, steals, blocks...) and totals, rankings and comparisons. Feel free to add any player in the .csv data files. 🏀

csv ipython-notebook ipywidgets jupyter-notebook jupyterlab matplotlib pandas python scikit-learn seaborn

Last synced: 18 Apr 2026

https://github.com/gattsu001/telecom-churn-predictor

Predicts which telecom customers are likely to churn with 95% accuracy using engineered features from usage, billing, and support data. Implements Sturges-based binning, one-hot encoding, stratified 80/20 train-test split, and a two-level ensemble pipeline with soft voting. Achieves 94.60% accuracy, 0.8968 AUC, 0.8675 precision, 0.7423 recall.

churn-prediction classification classification-algorithm customer-retention data-science data-visualization feature-engineering joblib jupyter-notebook machine-learning pandas scikit-learn supervised-learning svm

Last synced: 18 Apr 2026

https://github.com/pedroteixeiraw/variational_quantum_circuit_binary_classification

This project focuses on developing a Variational Quantum Circuit capable of performing Binary Classification between two classes: red wine and white wine, based on their characteristics using machine learning.

binary-classification cost-function json machine-learning matplotlib numpy pandas qiskit qiskit-machine-learning quantum-machine-learning scikit-learn training-data variational-circuit

Last synced: 04 Apr 2026

https://github.com/alainlebret/python-et-ia-1

Ressources personnelles du cours "Python & IA" en 2e année GPSE à l'ENSICAEN

artificial-intelligence image-processing machine-learning matplotlib numpy python scikit-image scikit-learn

Last synced: 04 Apr 2026

https://github.com/yashsonaar/machine-learning-tasks

This repository has machine learning tasks which include classification, recommendation system, fraud detection system

classification jupyter-notebook machine-learning numpy pandas prediction python scikit-learn testing

Last synced: 04 Apr 2026

https://github.com/chengetanaim/high-school-alcoholism-and-academic-performance

Student Alcoholism and Academic Performance Data Analysis

jupyter-notebook scikit-learn

Last synced: 18 Apr 2026

https://github.com/eugen-goebel/predictive-analytics-agent

Automated ML pipeline — data profiling, preprocessing, model training, and evaluation report generation

automation data-science docker machine-learning predictive-analytics python scikit-learn streamlit

Last synced: 05 Apr 2026

https://github.com/akhundmuzzammil/energyconsumptionprediction

This repository contains code and resources for training a linear regression model to predict energy consumption based on various building parameters.

data-analysis energy-consumption linear-regression machine-learning python scikit-learn streamlit visualization

Last synced: 18 Apr 2026

https://github.com/alezoon/movie-revenue-prediction

Sk-learn practice using Linear Regression, ML workflow practice.

jupyter machine-learning matplotlib-pyplot numpy pandas python scikit-learn

Last synced: 05 Apr 2026

https://github.com/simrandalal/semantic-book-recommender

A semantic content-based book recommender using sentence-transformer embeddings, cosine similarity, and a Streamlit interface.

dotenv huggingface-transformers nlp-machine-learning pandas python scikit-learn similarity-search streamlit

Last synced: 05 Apr 2026

https://github.com/deliprofesor/game-search-volume-prediction-machine-learning-models-and-forecasting

This repository uses machine learning models like Random Forest, XGBoost, LightGBM, and time-series forecasting with Prophet to predict game search volumes. Additionally, Grid Search is applied for hyperparameter tuning of the LightGBM model.

data-cleaning data-science data-visualization feature-selection forecasting-models game-search grid-search hyperparameter-tuning lightgbm machine-learning pandas prophet python random-forest scikit-learn time-series-analysis time-series-forecasting xgboost

Last synced: 18 Apr 2026

https://github.com/malick08012/heart-disease-prediction

A machine learning project that predicts the risk of heart disease based on patient health data. Includes data cleaning, EDA, visualization, model training, evaluation and feature importance analysis

artificial-intelligence heartdisease-prediction logistic-regression machine-learning python scikit-learn

Last synced: 18 Apr 2026

https://github.com/jeffandyalltogether/mlrecommendationsystem

project code for a recommendation system for Amazon using collaborative filtering, ranking, and matrix factorization to enhance customer satisfaction and product discovery.

eda matplotlib pandas python scikit-learn seaborn tensorflow

Last synced: 05 Apr 2026

https://github.com/barek2k2/ml_ruby

Ruby gem uses Machine Learning(ML) techniques to make predictions and classifications, and it's powered by Python3 under the hood.

artificial-intelligence data-science machine-learning pandas prediction python3 ruby ruby-on-rails scikit-learn

Last synced: 05 Apr 2026

https://github.com/oadultradeepfield/galaxy10-anomaly-detection

A public API and experimental PyTorch pipeline for anomaly detection in the Galaxy10 DECals dataset using ResNet50, autoencoders, and clustering techniques

flask google-cloud-run kaggle pytorch scikit-learn

Last synced: 05 Apr 2026

https://github.com/perpendicooler/elementary-research-for-steamboat-willie-s-store-in-poland

An elementary research for a company to opening store in a city using gurobi and pulp optimization.

christofides-algorithm gurobipy numpy pandas pulp python3 scikit-learn travelling-salesman-problem

Last synced: 05 Apr 2026

https://github.com/vijaykumarr1452/black_friday_sales_analysis

Black Friday Sales Analysis python machine learning project using pandas and scikit-learn for data preprocessing, model training, and performance evaluation.

confusion-matrix jupyter-notebook machine-learning pandas python random-forest-classifier sales-analysis scikit-learn

Last synced: 19 Apr 2026

https://github.com/kaladabrio2020/machine-learning-with-pytorch-and-scikit-learn

Progress on the book machine learning with pytorch and scikit-learn

deep-learning implementation machine-learning python3 pytorch scikit-learn

Last synced: 20 Apr 2026

https://github.com/vyjayanthipolapragada/car_mileage_prediction

Predicting the mileage of car using the linear regression model with Scikit-learn

kaggle-titanic linear-regression machine-learning numpy pandas predictive-modeling python scikit-learn

Last synced: 20 Apr 2026

https://github.com/bruceunx/ai-simulator

aiplayground 人工智能学习乐园

ai maching-learning scikit-learn

Last synced: 20 Apr 2026

https://github.com/grandechowhiskey/harvard-cs50-ai-projects

This project contains a collection of programming assignments from CS50’s Introduction to Artificial Intelligence with Python course.

html python scikit-learn tensorflow

Last synced: 20 Apr 2026

https://github.com/himasnhu-at/freecodecamp--ml

ML Models I built for my freeCodeCamp's Machine Learning with Python certification

freecodecamp freecodecamp-project machine-learning machine-learning-algorithms matplotlib pandas python scikit-learn

Last synced: 20 Apr 2026

https://github.com/tr-3n/-ai-powered-resume-analyzer-multi-source-job-matcher

AI-Powered Resume Analyzer & Multi-Source Job Matcher, is a web application built using Python and Streamlit that helps job seekers find the best job opportunities based on their resume. The app extracts text from uploaded resumes, matches it with job listings from multiple sources, and displays the most relevant jobs.

ai api html-css job job-recommendation job-search jobmatching natural-language-processing pandas pypdf2 python resume-analyzer scikit-learn streamlit web-development

Last synced: 20 Apr 2026