An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/g-eoj/kaggle-rotten-tomatoes

Movie review sentiment analysis with the Stanford parsed Rotten Tomatoes dataset.

cross-validation nlp nltk rotten-tomatoes scikit-learn

Last synced: 12 May 2026

https://github.com/quran-yeamen/serverlifecycleml

Predictive modeling of server lifecycle stages using synthetic data and machine learning.

data-science machine-learning predictive-modeling python scikit-learn synthetic-data

Last synced: 15 Feb 2026

https://github.com/srosalino/prediction_of_seoul_bikes_demand

The objective of this project is to predict the number of bicycles needed to be made available each hour in order to make the service as efficient as possible

cross-validation data-exploration-and-preprocessing hyperparameter-tuning machine-learning regularization-methods scikit-learn

Last synced: 13 May 2026

https://github.com/eshrathaziz/heart-disease-risk-assessment

Predicting heart disease risk using machine learning for Healthcare Insights.

data-science jupyter-notebook learning machine python scikit-learn

Last synced: 07 May 2026

https://github.com/ayaarbi/prediction_des_maladies_cardiovasculaires_avec_ml

Ce projet, développé au sein de cours de Machine Learning, utilise des algorithmes de classification supervisée pour prédire la présence de maladies cardio-vasculaires à partir de données médicales publiées sur Kaggle.

cardiovascular-diseases jupyter-notebook machine-learning matplotlib pandas python scikit-learn

Last synced: 07 May 2026

https://github.com/n1k1f0rm/car-price-predict

By car characteristics you can predict it prise

fastapi ml scikit-learn streamlit

Last synced: 07 May 2026

https://github.com/garimarao24/customer-churn-project

This repository contains a Customer Churn Prediction project that leverages Machine Learning techniques to predict customer churn and segment customers using clustering.

customer-churn kmeans-clustering logistic-regression machine-learning pca scikit-learn

Last synced: 07 May 2026

https://github.com/rishi035/advanced-house-price-predictions

This is my First Project and also participated in kaggle competition

linear-regression machine-learning python random random-forest regressor-models scikit-learn

Last synced: 07 May 2026

https://github.com/tony123105/comp4423_garbage_classification

Garbage classification using traditional machine learning approaches (HOG, LBP, SIFT features with SVM, KNN, Random Forest classifiers) and an ensemble method to categorize waste into 10 types.

computer-vision feature-extraction garbage-classification hog image-classification knn lbp machine-learning opencv python random-forest scikit-learn sift svm

Last synced: 07 May 2026

https://github.com/pspanoudakis/machine-learning-nlp

NLP 🤖 📖 projects on Vaccine Sentiment Classification 💉 and Question Answering 💬

bert-fine-tuning glove-embeddings neural-networks pytorch question-answering rnn scikit-learn sentiment-classification softmax-regression squad

Last synced: 07 May 2026

https://github.com/nicovandenhooff/wids-datathon-2022

This repository contains solution for the 2022 Women in Data Science Kaggle competition that I participated in, which obtained a top 10% leaderboard standing.

catboost data-visualization datascience energy-consumption ensemble-learning exploratory-data-analysis kaggle lightgbm machine-learning scikit-learn women-in-data-science xgboost

Last synced: 07 May 2026

https://github.com/royxlead/production-drift-detection

Production ML monitoring library - KL, PSI, MMD, and ADWIN drift detectors with empirical benchmarks, confidence tracking, and a 6-page FastAPI dashboard.

data-drift drift-detection fastapi kl-divergence mlops mmd model-monitoring production-ml psi pytorch scikit-learn uncertainty-quantification

Last synced: 23 Jun 2026

https://github.com/msikorski93/heart-failure-prediction

The subject of this repository was to perform binary classification based on respondent's collected features (age, cholesterol level, fasting blood sugar, thallium stress test results, etc.).

classification knn-classifier logistic-regression random-forest-classifier roc-curves scikit-learn svm-classifier

Last synced: 13 May 2026

https://github.com/mateusoliveira30/house-prices

This project was developed for the Kaggle competition "House Prices - Advanced Regression Techniques." The goal is to predict house sale prices using advanced regression techniques, including feature engineering, Random Forests, and Gradient Boosting.

kaggle-competition machine-learning scikit-learn

Last synced: 13 May 2026

https://github.com/andrewsy1004/linear-regression-model-for-house-price-prediction

A linear regression model to predict house prices based on features like size, location, and number of rooms. This project demonstrates the application of machine learning in real estate price estimation

linear-regression python scikit-learn xgbregressor

Last synced: 07 May 2026

https://github.com/mwasifanwar/automl_framework

Comprehensive AutoML framework that automates data preprocessing, feature engineering, model selection, hyperparameter tuning, and deployment. Features neural architecture search and automated data cleaning pipelines.

automl automl-algorithms data-science data-science-projects feature-engineering feature-engineering-algorithm feature-engineering-ml hyperparameter-optimization machine-learning machine-learning-algorithms machine-learning-models mlops mlops-workflow python scikit-learn scikit-learn-python

Last synced: 07 May 2026

https://github.com/tedim52/discjockey

a content-based recommender system for your party playlist preferences

jupyter-notebook matplotlib pandas scikit-learn spotify-web-api

Last synced: 07 May 2026

https://github.com/nafis2508/maternal-neonatal-outcome-prediction

Predicting Maternal and Neonatal Birth Outcomes using Machine Learning on 61,018 Healthcare Records from Kenya and Uganda

data-science decision-tree eda healthcare-ai healthcare-analytics machine-learning maternal-health predictive-modeling python random-forest scikit-learn

Last synced: 24 Jun 2026

https://github.com/cnoret/hexa-watts

Interactive data visualization and machine learning app for energy consumption analysis and prediction in France, built with Streamlit. (Text in French)

data-visualization electricity-forecasting energy-analysis france machine-learning scikit-learn streamlit

Last synced: 07 May 2026

https://github.com/johanneswiesner/skplot

A python package for extracting, plotting and reporting information from one or multiple sklearn classification & prediction pipelines.

plotting python scikit-learn sklearn visualization

Last synced: 14 May 2026

https://github.com/henrytseng/example_docker_scikit-learn

A quick example of using Scikit-Learn from a Docker container

docker scikit-learn

Last synced: 08 May 2026

https://github.com/imosudi/model_training

Breast Cancer Diagnosis: Logistic Regression, Random Forest, k-NN and Decision Tree classifiers models with feature importance analysis - Includes data exploration, train/test splitting, feature scaling, cross-validation, and model evaluation metrics with confusion matrices and decision boundary visualisation

classification data-science decision-tree educational feature-importance k-nearest-neighbors linear-regression machine-learning model-evaluation python3 random-forest scikit-learn

Last synced: 25 Jun 2026

https://github.com/anusha-me/disease-x-detection-ml-project

A machine learning classification system for early detection of Disease X based on patient symptoms using Python, Scikit-learn, and Streamlit.

classification data-science disease-prediction healthcare-ai machine-learning medicaldata scikit-learn streamlit

Last synced: 08 May 2026

https://github.com/aravindnathan02/machine-learning-projects

Machine Learning and Deep Learning projects which mainly focuses on predictive modeling.

deep-learning machine-learning neural-networks predictive-modeling python scikit-learn tensorflow

Last synced: 08 May 2026

https://github.com/thekartikeyamishra/data-preprocessor

A Google Colab module for interactive data preprocessing. Handles missing values, categorical encoding (One-Hot, Label), and numerical scaling (Standard, MinMax). Outputs a cleaned dataset

ipywidgets numpy pandas python scikit-learn

Last synced: 08 May 2026

https://github.com/prajjwal6969/recommender-system-using-python

A collection of content-based recommendation systems for songs and movies using Python and machine learning.

content-based-filtering cosine-similarity machine-learning movie-recommendation python recommender-system scikit-learn song-recommendation

Last synced: 08 May 2026

https://github.com/khaymanii/calories-burnt-prediction-model

This model was built using Python and XGBoost Regression algorithm

matplotlib numpy pandas python scikit-learn

Last synced: 16 Apr 2026

https://github.com/belzebu013/prever_nivel_colesterol

Projeto de IA com algoritmo de Regressão Linear múltipla para prever o nível de colesterol de um individuo.

ia jupiter-notebook pandas python regressao-linear-multipla scikit-learn

Last synced: 17 Apr 2026

https://github.com/mryutaro/spla3clip

spla3clip: キル・デスした時刻を自動で解析するスプラトゥーン3用ツール

fastapi python react scikit-learn typescript

Last synced: 04 Apr 2026

https://github.com/rickyarians/ai-ml-nlp

Directory Machine Learning, Deep Learning, Artificial Int, Natural Language Processing Project

deep-learning machine-learning modeling python scikit-learn tensorflow

Last synced: 04 Apr 2026

https://github.com/deepanshkhurana/udacityproject-prediciting-boston-housing-prices

This is a Udacity Project for the Machine Learning Nanodegree. Here, we are trying to predict Boston Housing Prices using sklearn.

data-analysis data-science machine-learning python scikit-learn udacity

Last synced: 08 May 2026

https://github.com/27ahmad/movie-recommendation-system

Welcome to the Movie Recommendation System! This project uses Streamlit to provide personalized movie recommendations based on user preferences and similarity.

movie-recommendation numpy pandas python scikit-learn

Last synced: 04 Apr 2026

https://github.com/bjpcjp/scikit-learn

Updates in progress. Jupyter workbooks will be added as time allows.

python python3 scikit-learn

Last synced: 18 Apr 2026

https://github.com/justsecret123/nba-players-stats-analysis

A quick interactive Notebook to visualize some NBA players stats (points, assists, steals, blocks...) and totals, rankings and comparisons. Feel free to add any player in the .csv data files. 🏀

csv ipython-notebook ipywidgets jupyter-notebook jupyterlab matplotlib pandas python scikit-learn seaborn

Last synced: 18 Apr 2026

https://github.com/samkazan/fraud-detection-ml

Machine learning models for enhanced fraud detection in e-commerce transactions, exploring feature engineering, distance prediction, and clustering analysis.

clustering data-science data-visualization dataanalytics dbscan eda hierarchical-clustering kmeans-clustering knn-imputer matplotlib mlxtend python scikit-learn seaborn xgboost

Last synced: 08 May 2026

https://github.com/rescurib/random_forest_arduino_uno

Ejemplo de implementación de un clasificador de bosque aleatorio en un Arduino UNO usando scikit-learn y m2cgen.

arduino scikit-learn tinyml

Last synced: 18 Apr 2026

https://github.com/yelamankarassay/personal-health-wellness-dashboard

A Streamlit-based dashboard for visualizing and analyzing personal daily data—weight, mood, meals, sleep, and more. This project uses pandas, plotly, matplotlib, seaborn, scikit-learn, and wordcloud to present insights about your health and daily habits.

matplotlib pandas plotly scikit-learn seaborn wordcloud

Last synced: 17 Apr 2026

https://github.com/tanim-mishkat/data-science-prediction-model-pds-course-

Diabetes Progression Prediction Using Regression Analysis: This project uses regression analysis in Python to predict diabetes progression based on medical and physiological data. Includes data preprocessing, model training, evaluation, and visualizations.

data-science machine-learning python regression scikit-learn

Last synced: 19 Apr 2026

https://github.com/pedroteixeiraw/variational_quantum_circuit_binary_classification

This project focuses on developing a Variational Quantum Circuit capable of performing Binary Classification between two classes: red wine and white wine, based on their characteristics using machine learning.

binary-classification cost-function json machine-learning matplotlib numpy pandas qiskit qiskit-machine-learning quantum-machine-learning scikit-learn training-data variational-circuit

Last synced: 04 Apr 2026

https://github.com/sentinel-ml/sentinel_ai

Machine Learning Model to detect fraud in financial systems

ai python pytorch scikit-learn security security-tools tensorflow

Last synced: 04 Apr 2026

https://github.com/alainlebret/python-et-ia-1

Ressources personnelles du cours "Python & IA" en 2e année GPSE à l'ENSICAEN

artificial-intelligence image-processing machine-learning matplotlib numpy python scikit-image scikit-learn

Last synced: 04 Apr 2026

https://github.com/adhadse/hands-on-machine-learning-book-notes-and-practice

This repo holds the Jupyter notebooks and datasets containing notes/comments on things I learned from this book. Feel free to use and learned from them.

data-science deep-learning jupyter-notebooks keras machine-learning python scikit-learn tensorflow

Last synced: 04 Apr 2026

https://github.com/yashsonaar/machine-learning-tasks

This repository has machine learning tasks which include classification, recommendation system, fraud detection system

classification jupyter-notebook machine-learning numpy pandas prediction python scikit-learn testing

Last synced: 04 Apr 2026

https://github.com/anushrey10/fuel_efficiency_predictor

Welcome to the Fuel Efficiency Predictor! This advanced tool uses machine learning to predict your vehicle's fuel efficiency based on various characteristics.

decision-tree gradient-boosting-classifier html-css-javascript linear-regression machile-learning matplotlib python random-forest scikit-learn tailwindcss

Last synced: 18 Apr 2026

https://github.com/giacomolat/object-detection-sperimental-thesis-for-degree

In this repository is my experimental thesis work on the recognition of museum works through object detection techniques.

convolutional-neural-networks detectron2 jupyter-notebook machine-learning neural-networks object-detection python pytorch rcnn rcnn-model scikit-learn

Last synced: 18 Apr 2026

https://github.com/sundanc/weatherprediction

This project implements a weather prediction system that predicts the temperature based on real-time weather data, including features like humidity, wind speed, and day-related features (day of the week, month

machine-learning machinelearning numpy pandas programming python scikit-learn scikitlearn-machine-learning weather-prediction

Last synced: 18 Apr 2026

https://github.com/icejan/predicton-systems

Various systems that train on data and generate a prediction

lightfm machine-learning numpy python scikit-learn

Last synced: 08 May 2026

https://github.com/akhundmuzzammil/energyconsumptionprediction

This repository contains code and resources for training a linear regression model to predict energy consumption based on various building parameters.

data-analysis energy-consumption linear-regression machine-learning python scikit-learn streamlit visualization

Last synced: 18 Apr 2026

https://github.com/alezoon/movie-revenue-prediction

Sk-learn practice using Linear Regression, ML workflow practice.

jupyter machine-learning matplotlib-pyplot numpy pandas python scikit-learn

Last synced: 05 Apr 2026

https://github.com/oriolventur/assignment-2-model-creation

Assignment 2 from Artificial Intelligence 1 course: Model creation using synthetic data and scikit-learn.

jupyter-notebook model-creation python scikit-learn

Last synced: 08 May 2026

https://github.com/seyha1007/amazon-reviews-analysis

🧐 This project analyzes Amazon Fine Food Reviews to investigate whether negative reviews are more emotionally intense and lexically repetitive than positive ones. Using R, we apply sentiment analysis and lexical diversity metrics to uncover patterns in consumer review language.

acp amazon-reviews bert data-analytics glove jupyter-notebook lstm-sentiment-analysis machine-learning nltk random-forest scikit-learn sentiment-classification sentimental-analysis support-vector-machine

Last synced: 08 May 2026

https://github.com/murugavl/flower-prediction

Flower Prediction is a machine learning project that uses the Iris dataset to classify iris flowers into three species: Setosa, Versicolor, and Virginica. The project includes data analysis, model training with various algorithms, and deployment via a Flask web application for user-friendly predictions.

flask machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/taqsblaze/hush

Hush: A lightweight, context-aware text toxicity classifier. Leveraging NLP and Random Forest ensemble learning to detect and mitigate harmful language in real-time. Built for efficiency, safety, and cleaner digital communication.

content-moderation machine-learning nlp random-forest safety-tools scikit-learn text-classification toxicity-detection

Last synced: 05 Apr 2026

https://github.com/pablonunes/houseprediction

This a simple model to predict housing price in King County in Washingthon. Uses Scikit Learn, Numpy. Seaborn, Pandas, Scipy.

housing-data housing-prices scikit-learn scikitlearn-machine-learning seaborn

Last synced: 17 Apr 2026

https://github.com/rosieoh/emergency_dataanalysis

오픈데이터분석-응급의료체계 방안 정책 제안 데이터 분석

ipython matplotlib numpy pandas python scikit-learn scipy

Last synced: 04 Apr 2026

https://github.com/manalisbhavsar/mall-customers-clustering

K-Means clustering to mall customer data, segmenting customers based on their annual income and spending score. To identify patterns and group customers for targeted marketing.

data-analysis data-visualization matplotlib numpy pandas python scikit-learn

Last synced: 18 Apr 2026

https://github.com/jeffandyalltogether/mlrecommendationsystem

project code for a recommendation system for Amazon using collaborative filtering, ranking, and matrix factorization to enhance customer satisfaction and product discovery.

eda matplotlib pandas python scikit-learn seaborn tensorflow

Last synced: 05 Apr 2026

https://github.com/yashrajgithub/crop-recommendation

KrishiGyaan is a web app designed to help farmers make informed decisions on crop selection. By analyzing soil and environmental factors, the app provides personalized crop recommendations, enhancing agricultural productivity and promoting sustainable farming practices.

api artificial-intelligence crop-recommendation-system data-preprocessing data-visualization json machine-learning-algorithms pickle python random-forest-classifier scikit-learn streamlit supervised-learning train-test-split user-interface

Last synced: 05 Apr 2026

https://github.com/barek2k2/ml_ruby

Ruby gem uses Machine Learning(ML) techniques to make predictions and classifications, and it's powered by Python3 under the hood.

artificial-intelligence data-science machine-learning pandas prediction python3 ruby ruby-on-rails scikit-learn

Last synced: 05 Apr 2026

https://github.com/emilyfelker/ieee_cis_fraud_detection

Which online transactions are fraudulent? Program that uses various machine learning algorithms to detect fraud.

decision-trees kaggle logistic-regression machine-learning neural-network pandas poetry pytest python scikit-learn sklearn tensorflow xgboost

Last synced: 05 Apr 2026

https://github.com/oadultradeepfield/galaxy10-anomaly-detection

A public API and experimental PyTorch pipeline for anomaly detection in the Galaxy10 DECals dataset using ResNet50, autoencoders, and clustering techniques

flask google-cloud-run kaggle pytorch scikit-learn

Last synced: 05 Apr 2026

https://github.com/lexxai/goit_python_ds_hw_04

Модуль 4. Класифікація та оцінка роботи моделі. Лінійна регресія: перенавчання та регуляризація

lasso-regression linear-regression numpy pandas python red regression ridge-regression scikit-learn

Last synced: 05 Apr 2026

https://github.com/perpendicooler/elementary-research-for-steamboat-willie-s-store-in-poland

An elementary research for a company to opening store in a city using gurobi and pulp optimization.

christofides-algorithm gurobipy numpy pandas pulp python3 scikit-learn travelling-salesman-problem

Last synced: 05 Apr 2026

https://github.com/labex-labs/supervised-learning-regression

Supervised Learning: Regression | This repo collects 7 of programming labs exercises for Supervised Learning: Regression. Supervised learning. If you are hearing or reading this term for the first time, then it may be completely unclear what it means. Don't worry. In this lab, you will get a comp...

challenges course exercises hands-on labex labs machine-learning playgroud programming scikit-learn

Last synced: 08 May 2026

https://github.com/rohansardar/speechflowguard

A machine learning web API that detects toxic language in user comments using classical ML

docker logistic-regression machine-learning python3 scikit-learn tf-idf tfidf-text-analysis tfidf-vectorizer

Last synced: 17 Apr 2026

https://github.com/lorenzorottigni/ml-movies

Machine Learning python bootcamp: Recommender Systems on movies dataset

ipynb machine-learning numpy pandas python recommender-system scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/shingiraibhengesa/house-price-predictor

A machine learning project that predicts house prices based on user input features such as square footage, number of bedrooms, and more.

machine-learning-models matplotlib numpy python scikit-learn seaborn

Last synced: 09 May 2026

https://github.com/thekartikeyamishra/ai-customer-feedback-summarizer

The AI Customer Feedback Summarizer is a Python-based application that processes customer feedback, extracts insights, and summarizes reviews. This basic version uses extractive summarization techniques, and the advanced version integrates advanced sentiment analysis, visualization, and industry-specific fine-tuning.

ai chatbot gpt machine-learning matplotlib nltk pandas python scikit-learn streamlit

Last synced: 18 Apr 2026

https://github.com/aasjunior/mlapp-api

Esta API fornece endpoints para aplicar algoritmos de aprendizado de máquina, como K-Nearest Neighbors (KNN), Árvore de Decisão e Algoritmo Genético. Realizado como tarefa da disciplina de Laboratório Mobile/Computação Natural no 5º Semestre de Desenvolvimento de Software Multiplataforma.

fastapi machine-learning python scikit-learn

Last synced: 09 May 2026

https://github.com/vijaykumarr1452/black_friday_sales_analysis

Black Friday Sales Analysis python machine learning project using pandas and scikit-learn for data preprocessing, model training, and performance evaluation.

confusion-matrix jupyter-notebook machine-learning pandas python random-forest-classifier sales-analysis scikit-learn

Last synced: 19 Apr 2026

https://github.com/kheriberto/linear_regression_ecommerce

Simple project showcasing crafting a linear regression model with SciKit Learn

data-analysis jupyter-notebook linear-regression pandas python scikit-learn seaborn

Last synced: 19 Apr 2026

https://github.com/yassin522/heartbeat-categorization

This project is aimed at developing a machine learning model that can accurately classify heartbeats as either normal or abnormal. The model is trained on a dataset of ECG (electrocardiogram) signals, which were collected from patients and labeled by medical professionals.

cnn deep-learning keras machine-learning scikit-learn tensorflow

Last synced: 20 Apr 2026

https://github.com/kaladabrio2020/machine-learning-with-pytorch-and-scikit-learn

Progress on the book machine learning with pytorch and scikit-learn

deep-learning implementation machine-learning python3 pytorch scikit-learn

Last synced: 20 Apr 2026

https://github.com/davidrpugh/kaust-cs-294w

Course materials for KAUST CS 294W

deep-learning machine-learning pytorch scikit-learn

Last synced: 09 May 2026

https://github.com/radoslawregula/iris-classification

Jupyter notebook implementing an efficient machine learning method to classify flowers from the Iris data set.

classification iris-dataset jupyter-notebook machine-learning python scikit-learn softmax-classifier

Last synced: 09 May 2026

https://github.com/prahaladhchandrahasan/housingprices_adavanced_regression

A machine learning model for "House Prices: Advanced Regression Techniques" kaggle competition.

machine-learning-algorithms matplotlib-pyplot numpy pandas python3 scikit-learn

Last synced: 20 Apr 2026

https://github.com/namratha2301/carprice_analysisandprediction

This project analyzes factors influencing vehicle prices using a dataset of various attributes, including Engine capacity, Power, Mileage, and Seating capacity.

data-analysis data-visualization exploratory-data-analysis machine-learning pandas predictive-modeling random-forest-classifier regression scikit-learn seaborn

Last synced: 20 Apr 2026

https://github.com/bruceunx/ai-simulator

aiplayground 人工智能学习乐园

ai maching-learning scikit-learn

Last synced: 20 Apr 2026

https://github.com/dahsie/spam_classification

Ce fut mon prémier projet NLP où j'ai réalisé la détection de spam en utilisant les algorithmes d'embedding pour encorder mes textes. J'ai utilisé Random Forest et Milti-Layres Perceptrons pour la phase de classification. Ce qui a pemit l'obtension des précisions respective de 97% et 98%. J'ai aussi appris à documenter mes codes via sphinx

doc2vec fasttext-embeddings gensim glove-embeddings python scikit-learn sphinx-doc word2vec-algorithm

Last synced: 20 Apr 2026