An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/ejw-data/ml-myopia

A variety of machine learning techniques used to identify nearsighted patients

cross-validation gridsearchcv imbalanced-classification kmeans knn machine-learning pca pipeline python random-forest scikit-learn svc tensorflow tsne

Last synced: 11 Jul 2025

https://github.com/sf-tec/openmodels

OpenModels is a flexible and extensible library for serializing and deserializing machine learning models. It's designed to support any serialization format through a plugin-based architecture, providing a safe and transparent solution for exporting and sharing predictive models.

json python scikit-learn serialization sklearn

Last synced: 15 Apr 2025

https://github.com/ansh-info/incognito

Inspired by Google's Foobar challenge. It automates secretive candidate selection based on online activity, provides timed coding challenges, and allows hiring managers to evaluate submissions

api docker docker-compose javascript jupyter-notebook kmeans leetcode logistic-regression machine-learning machinelearning mysql python3 react reactjs scikit-learn selection-algorithms supervised-learning webdevelopment

Last synced: 11 Feb 2026

https://github.com/solanovisitor/foreml

A Python package for time series forecasting using Tensorflow and Keras

deep-learning keras lstm machine-learning pandas poetry python scikit-learn tensorflow

Last synced: 02 Apr 2026

https://github.com/kookmin-sw/capstone-2023-29

자리있어? - 경기도 광역버스 좌석예측 시스템

fastapi lstm postgresql python3 pytorch react scikit-learn sqlalchemy

Last synced: 22 Aug 2025

https://github.com/alexfrancow/cms_version_detector_poc

A Machine Learning application that detects versions of WordPress with Multi-Class classification algorithms.

cms cybersecurity data-science footprinting infosec machine-learning pandas python3 random-forest scikit-learn tool wordpress

Last synced: 12 Oct 2025

https://github.com/chanmeng666/water-quality-testing-data-analysis

Statistical analysis and predictive modeling of water quality parameters using Python, pandas, scikit-learn, and statsmodels

data-analysis data-science data-visualization environmental-monitoring jupyter-notebook machine-learning pandas python scikit-learn seaborn statistics water-quality

Last synced: 18 Apr 2026

https://github.com/oneapi-src/purchase-prediction

AI Starter Kit for Purchase Prediction model using Intel® Extension for Scikit-learn*

machine-learning scikit-learn

Last synced: 04 Apr 2025

https://github.com/lakshay-a/alzheimer-diagnosis-using-cnn

An advanced deep learning tool for Alzheimer's disease diagnosis using a CNN with transfer learning from DenseNet-121 pre-trained on RadImageNet, achieving a test accuracy of 95.13%. It features a user-friendly interface for uploading MRI scans and provides immediate classification into AD, MCI, or CN stages.

kfold-cross-validation matplotlib numpy python3 scikit-learn tensorflow transfer-learning

Last synced: 23 Feb 2026

https://github.com/stewartpark/scikit-small-ensemble

scikit-small-ensemble is a library to make your ensemble models(Random Forest Classifier, etc) have a small memory footprint/usage.

compression ensemble-learning lz4 mmap random-forest-classifier scikit-learn

Last synced: 05 Feb 2026

https://github.com/janasunrise/ml-guide-and-implementation

This repository contains the predictions, and plots for the datasets included in the scikit learn library by default and also some other datasets from kaggle or other sources.

machine-learning ml python3 scikit scikit-learn scikitlearn-machine-learning sklearn

Last synced: 10 Mar 2026

https://github.com/benzlokzik/spam-detector

Training code and models for spam message detection in Russian

bert chromadb fasttext knn ml python rag scikit-learn spam-detection transformers

Last synced: 25 May 2026

https://github.com/labrijisaad/monthly-daily-energy-forecasting-docker-api

This repository houses an Energy Forecasting API that uses Machine Learning to predict daily and monthly energy consumption from historical data. It's designed as a practical demonstration of a Machine Learning Engineering workflow, from initial analysis to a deployable API packaged with Docker.

api docker docker-volume jupyter-notebooks machine-learning makefile python random-forest scikit-learn xgboost

Last synced: 11 Apr 2026

https://github.com/tikam02/wine-shop

Wine Reviews and Recommendation Engine - Web-Application [Django]

django machine-learning numpy pandas python recommender-system scikit-learn wine

Last synced: 22 Apr 2025

https://github.com/mendez-luisjose/breast-cancer-predictor-with-scikit-learn-streamlit-and-deployed-with-flask-and-aws

Breast Cancer Predictor with Scikit Learn, Streamlit and Deployed with Flask and AWS.

aws-ec2 flask logistic-regression scikit-learn streamlit

Last synced: 19 Apr 2026

https://github.com/lucsorel/watts-app

A sample application demonstrating how big data and connected objects can help energy monitoring.

energy-monitor factory modeling rxjs scikit-learn smart-energy socket-io zeromq

Last synced: 09 May 2026

https://github.com/matsunagalab/lecture_ml

Google colab notebooks used in a lecture on machine learning

lecture notebooks pymc scikit-learn torch

Last synced: 14 May 2025

https://github.com/christoph/robics

Automatic detection of robust parametrizations for LDA and NMF. Compatible with scikit-learn and gensim.

gensim lda natural-language-processing nmf robust-parametrizations scikit-learn topic-modeling topic-models

Last synced: 13 Apr 2025

https://github.com/vidhi1290/robust-yield-prediction-

"Predicting a Greener Future 🌾📊 Delve into the world of agriculture and data science with our Yield Prediction project. We harness machine learning and weather data to forecast crop yields accurately. Join us in cultivating smarter farming practices for a sustainable tomorrow."

artificial-intelligence data-analysis data-cleaning-and-preprocessing data-science data-visualization dataexploration devops docker machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot pandas python scikit-learn scikitlearn-machine-learning streamlit yield-prediction-for-food-processing

Last synced: 15 Apr 2026

https://github.com/sayakpaul/bentoml-explorations

Contains my experiments made with the mighty library BentoML

bentoml python rest-api scikit-learn tensorflow zomato

Last synced: 16 Sep 2025

https://github.com/udityamerit/complete-machine-learning-for-beginners

This repository is structured as a complete ML roadmap combining theory (PDFs) with hands-on coding (Jupyter Notebooks) to help you build a solid foundation in data science and machine learning. Ideal for students, self-learners, and professionals looking to revise or upgrade.

artificial-intelligence classification clustering clustering-algorithm machine machine-learning machinelearning matplotlib matplotlib-figures numpy pandas regression regression-models regressionalgorithms regressionanalysis scikit-learn scikitlearn-machine-learning scipy seaborn tensorflow

Last synced: 15 May 2026

https://github.com/chicolucio/churn-prediction

Predicting customer churn with machine learning

churn-prediction machine-learning python scikit-learn

Last synced: 01 May 2026

https://github.com/khaymanii/rock-or-mine-detection_model

This is a Machine learning model built using Python to detect between rock and mine

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 13 Apr 2026

https://github.com/anzai004/mechassist

AI-powered desktop engineering decision support for material selection, stress assessment and machinability advisory.

decision-support engineering-tools kmeans machine-learning mechanical-engineering python random-forest scikit-learn tkinter

Last synced: 31 May 2026

https://github.com/aianytime/recommendation_system_implementation

Complete concepts behind implementing a Recommendation System using Association Rules, Collaborative Filtering, and Matrix Factorization.

collaborative-filtering data-science machine-learning matrix-factorization python recommender-system scikit-learn

Last synced: 28 Apr 2026

https://github.com/shaadclt/data-preprocessing-pipeline

This project contains a data preprocessing pipeline implemented in Python using the pandas and numpy libraries. The pipeline handles missing values, outliers, and normalizes numeric features in a dataset.

numpy pandas scikit-learn

Last synced: 18 Apr 2026

https://github.com/azrdev/sklearn-seco

Implementation of the *Separate and Conquer* / *Covering*-Algorithm for scikit-learn

covering machine-learning scikit-learn sklearn

Last synced: 21 Jan 2026

https://github.com/khaymanii/multiple-disease-prediction-system

This system predicts if a patient has heart, parkinsons and diabetes disease

matplotlib numpy pandas python scikit-learn

Last synced: 17 Apr 2026

https://github.com/sayakpaul/patients-conversation-detector

Contains my experiments for ZS's hiring hackathon (II).

data-science keras machine-learning nlp python scikit-learn text-classification

Last synced: 23 Apr 2025

https://github.com/analitico-771/machine_learning_index_prediction

This application compares the performance of Unsupervised machine learning models and Supervised models. It downloads 3 yrs of market daily close data from all SP500 companies and divides them into Sectors to be used as features for learning and training the data, in order to predict wether the index will be a Buy or Sell the next day. The results are evaluated to determine each model's performance and the metrics are presented along with the analysis.

alpaca-trading-api data-science deep-learning fintech machine-learning neural-network pandas-dataframe python quantitative-finance random-forest-classifier scikit-learn sp500-data-analysis

Last synced: 26 Feb 2026

https://github.com/csinva/trees-to-networks

Bridging random forests and deep neural networks. Partial implementation of "Neural Random Forests" https://arxiv.org/abs/1604.07143

artificial-intelligence classification decision-tree decision-tree-classifier deep-learning machine-learning machinelearning neural-network neural-networks paper-implementations python pytorch random-forest scikit-learn statistics

Last synced: 12 Apr 2026

https://github.com/ammaryasirnaich/deeplearning_playland

This repository contains Docker Image files, which support the common frameworks required for Deep learning implementation. The images support both the latest GPU (Nvidia CUDA) and CPU processors.

cuda cuda11 cudnn cudnn8 deep-learning docker docker-image dockerfile gpu kersa opencv pytorch pytorch-cnn scikit-learn tensorflow2

Last synced: 12 Apr 2026

https://github.com/shervinnd/blood-donor-availability-predictor

A deep learning model to predict blood donor availability using TensorFlow and sklearn. Features data preprocessing, neural network training, and ROC curve visualization. Achieve high accuracy in predicting donor status! 🩺💉

binary-classification blood-donation blood-donor-prediction data-preprocessing deep-learning healthcare-ai machine-learning medical-data-analysis neural-network predictive-modeling python roc-curve scikit-learn tensorflow

Last synced: 11 Oct 2025

https://github.com/dmlls/hwr-explained

Simple Handwriting Recognition (HWR) with scikit-image and scikit-learn.

handwriting-recognition hwr scikit-learn

Last synced: 30 Mar 2025

https://github.com/vartikaraj2512/dsml-internship-devtown-notebooks-

🌟 Data Science & Machine Learning Internship Projects 📊 Explore a curated collection of DS & ML notebooks covering topics like regression models, clustering, NLP, and deep learning. Dive into real-world projects such as price prediction, sentiment analysis, and customer segmentation. This repository reflects modern data-driven industry solutions

data-science filehandling googlecolab json kaggle keras machine-learning matplotlib numpy pandas python scikit-learn seaborn sql tensorflow

Last synced: 29 Jan 2026

https://github.com/pateash/kisanmitra-python

Python Machine learning Utility for Kisanmitra Web App

jupyter-notebook machine-learning python scikit-learn

Last synced: 31 Jul 2025

https://github.com/sayakpaul/floydhub-k_means-blog

Contains the Jupyter Notebook made for a FloydHub article on K-Means

numpy pandas scikit-learn yellowbricks

Last synced: 20 Sep 2025

https://github.com/ptyadana/iris-flower-ml-app

iris flower predictions Machine Learning app using Tensorflow, Keras, ScikitLearn, Flask deployed on Heroku

flask heroku keras machine-learning scikit-learn tensorflow2

Last synced: 18 Oct 2025

https://github.com/rickiepark/ml-with-python-cookbook-2nd

<실무로 통하는 ML 문제 해결 with 파이썬>

deep-learning machie-learning pytorch scikit-learn

Last synced: 29 Oct 2025

https://github.com/34j/sklearn-utilities

Utilities for scikit-learn. Append prediction to x, append prediction to x single, append x prediction to x, compose var estimator, data frame wrapper, drop by noise prediction, drop missing rows y, dummy regressor var, estimator wrapper base, excluded column transformer pandas, feature union pandas, id transformer, included column transformer pand

catboost feature-engine feature-engineering multioutput pandas pca python pytorch regression scikit-learn sklearn sklearn-compatible skorch torch tqdm

Last synced: 13 Apr 2025

https://github.com/manena/sp-sentiment-analysis

Sentiment Analysis in Python trained with Amazon Spain reviews in Spanish

jupyter-notebook machine-learning nltk nltk-library python-3-5 pyton scikit-learn sentiment-analysis

Last synced: 09 Oct 2025

https://github.com/udityamerit/python-librearies-for-data-science

Python libraries for data science enable efficient data manipulation, analysis, and modeling. Key libraries include NumPy for numerical computing, pandas for data handling, Matplotlib for visualization, Scikit-learn for machine learning, TensorFlow for deep learning, and BeautifulSoup/requests for web scraping. These libraries simplify complex data

beautifulsoup data data-science data-science-libraries machine-learning matplotlib numpy pandas requests scikit-learn scikitlearn-machine-learning tensorflow

Last synced: 06 Feb 2026

https://github.com/michaelsdavid/charcnn

A convolutional neural network that has detection for English characters (letters) written by hand.

character-recognition cnn-model convolutional-neural-network keras opencv2-python python3 scikit-learn tensorflow

Last synced: 08 Sep 2025

https://github.com/g0r0kh/clustering

k-means & hierarchical clustering

conda matplot numpy pandas scikit-learn scipy sklearn

Last synced: 12 Apr 2025

https://github.com/thekartikeyamishra/ai-news-aggregator

This project will create an AI-powered News Aggregator that collects news from selected sources, categorizes it using NLP-based techniques, and displays the results in a user-friendly Tkinter-based GUI.

ai machine-learning nltk python python3 requests scikit-learn

Last synced: 23 Aug 2025

https://github.com/tschechlovdev/kmeans_mnist

Demonstration of using k-Means to cluster images of handwritten digits (MNIST dataset). Source Code for corresponding article on Medium.

clustering image-dataset python scikit-learn

Last synced: 07 May 2026

https://github.com/kohlerhector/tree-mbpo

Study Model-Based Policy Optimization by varying the model estimator classes (e.g Decision Trees vs MLP)

decision-tree mbpo mbrl mlp rl sac scikit-learn stable-baselines3

Last synced: 05 May 2026

https://github.com/omanshu209/diagnosify-knn

This is a Python - based application that predicts diseases based on the symptoms inputted by the user using machine learning (KNN classifier algorithm).

jupyter-notebook k-nearest-neighbor-classifier kivy kivymd machine-learning python python3 scikit-learn

Last synced: 16 Apr 2026

https://github.com/abhiramdodda/rainfall_prediction

Machine Learning model built on Telangana dataset cropped from Indian weather dataset merged with average temperature dataset

numpy pandas python3 scikit-learn scikitlearn-machine-learning

Last synced: 12 Apr 2026

https://github.com/pooranjoyb/health-bridge

Predicting the disease of a patient from a patient's video (or text) using ML algorithms. The algorithms used in this project are Natural Language Procession, and Random Forest Tree. This Project is under INTEL OneAPI Hackathon 2023

hackathon intel machine-learning nlp oneapi pandas pickle python scikit-learn

Last synced: 04 Sep 2025

https://github.com/namratha2301/intrusiondetection

Intrustion Detection Models based on Internet Traffic Data obtained from the NSL-KDD Dataset

decisiontree gradient-boosting intrusion-detection mlp-classifier naive-bayes-classifier nsl-kdd randomforest scikit-learn

Last synced: 22 Feb 2026

https://github.com/akash-peace/face-recognition

AJ Face Recognizer project objective is to make a face recognizing model from own dataset of two faces.

facenet keras matplotlib mtcnn npz numpy opencv pickle pillow python3 sav scikit-learn

Last synced: 10 Apr 2026

https://github.com/victormotogna/irislogisticregression

Iris Dataset Logistic Regression - scikit learn version & from scratch

data-science iris-dataset logistic-regression python scikit-learn

Last synced: 30 Apr 2026

https://github.com/kamomille/titanic

Auriez vous survécu au naufrage du Titanic ?

data-science jupiter-notebook machine-learning scikit-learn titanic

Last synced: 19 May 2026

https://github.com/tschechlovdev/automl4clust

Implementation of "AutoML4Clust: Efficient AutoML for Clustering Analyses", published at EDBT 2021.

automl clustering paper python scikit-learn

Last synced: 15 May 2026

https://github.com/arnoldgaius/text_classifier

基于sklearn的文本分类器 Text classifier based on sklearn

pypi scikit-learn text-classifier

Last synced: 19 May 2026

https://github.com/marcusosterberg/triage-at-home

ML-testprojekt för att använda NLP-teknik för att klassificera, ge beslutsstöd och erbjuda självtriage på distans

artificial-intelligence machine-learning nlp nlp-machine-learning nltk scikit-learn

Last synced: 18 May 2026

https://github.com/ascender1729/salarypredictionlinearreg

SalaryPredictionLinearReg is a Python-based project utilizing linear regression to predict salaries from years of experience. It covers data loading, model training, detailed statistical analysis, and visualization of results.

data-science linear-regression machine-learning python salary-prediction scikit-learn seaborn statsmodels

Last synced: 07 May 2025

https://github.com/omar-ahmed314/s-he-detector

Gender detection project based on handwriting.

machine-learning python scikit-learn

Last synced: 13 Apr 2025

https://github.com/klane/springboard

Springboard Data Science Career Track assignments

data-science jupyter-notebook pyspark python scikit-learn springboard sql

Last synced: 13 Apr 2025

https://github.com/garcane/ethereum-prediction-ml

A machine learning project that predicts the future price of Ethereum (ETH) using the price data gathered from coincodex.com.

crypto cryptocurrency ethereum jupyternotebook lstm lstm-neural-networks machine-learning machine-learning-algorithms python scikit-learn scikitlearn-machine-learning sklean svm tensorflow

Last synced: 10 Apr 2026

https://github.com/avannaldas/emailsclassification

Classification of emails received on a mass distribution group

countvectorizer email-classifier scikit-learn sklearn text-classification tfidf

Last synced: 01 Jul 2025

https://github.com/crispengari/keras-api

💎 Introduction to Keras API and TensorFlow for Researchers

ai deep-learning jupyter-notebook keras machine-learning matplotlib numpy python scikit-learn tensorflow

Last synced: 07 Apr 2026

https://github.com/chathumiamarasinghe/spam-mail-prediction-using-ml

A Python-based machine learning project to classify emails as spam or not spam. This system uses algorithms like Naive Bayes or Logistic Regression to detect spam with high accuracy, providing a reliable solution for email filtering.

data-science ipynb jupiter-notebook matplotlib numpy pandas python scikit-learn spammailprediction

Last synced: 30 Oct 2025

https://github.com/madeyoga/machine-learning

Tensorflow & scikit-learn examples

numpy pandas scikit-learn tensorflow

Last synced: 12 Apr 2025

https://github.com/oneapi-src/customer-segmentation

AI Starter Kit for Customer Segmentation for Online Retail using Intel® Extension for Scikit-learn*

machine-learning scikit-learn

Last synced: 04 Apr 2025

https://github.com/lechemi/machine-learning-vademecum

Un notebook contenente nozioni di base ed esempi pratici in python sul machine learning.

machine-learning python scikit-learn

Last synced: 20 Jun 2025

https://github.com/zehuichen123/ml_algorithm

Codes for book <Machine Learning Algorithm>

machine-learning scikit-learn

Last synced: 14 May 2026

https://github.com/shervinnd/bitcoin-price-prediction-ml-dl

Predict Bitcoin prices with ML & DL models! 📈 Uses Ridge, Lasso, Random Forest, MLP, RNN & LSTM with hyperparameter tuning. 📊 Visualizes predictions & ROC curves. 🚀 Fetch data via yfinance, evaluate with MSE/R2. Perfect for crypto enthusiasts! 💸

cryptocurrency data-science data-visualization decision-tree deep-learning financial-modeling machine-learning neural-networks predictive-analytics price-prediction python random-forest regression rnn scikit-learn simple-rnn tensorflow time-series yfinance

Last synced: 15 Oct 2025

https://github.com/greed2411/asf

Anti Spam Filter, a spam filter 🗃️ which uses a model made out of MultinomialNB algorithm 👈 from scikit-learn 🐍 to classify spam and complaints.

algorithm asf dataset joblib maintenance passes-complaints scikit-learn spam spam-filter vit-university

Last synced: 18 May 2026

https://github.com/dllllb/ml-pipelines-tutorial

SciKit-Learn vs Apache Spark pipelines

machine-learning scikit-learn spark

Last synced: 10 Apr 2026

https://github.com/juliandavidmr/machinelearningscikit

Clasificador de flores mediante aprendizaje supervisado

neural-network python scikit-learn sklearn

Last synced: 24 Feb 2025

https://github.com/machinelearningprodigy/whastapp-chat-analyzer

WhatsApp Chat Analyzer is a powerful tool that helps you analyze your WhatsApp chat history with detailed statistics and visualizations. From message trends to most active users, this tool provides deep insights into your conversations! 🚀

matplotlib plotly scikit-learn seaborn streamlit wordcloud

Last synced: 13 Apr 2025

https://github.com/amirreza81/applied-data-science-course

Comprehensive notes, practical exercises, and problem-solving solutions from the Applied Data Science course, covering data preprocessing, machine learning algorithms, statistical analysis, data visualization, and real-world applications.

accuracy-measure boosting classification data-cleaning data-preprocessing data-science data-visualisation deep-learning dimensionality-reduction eda feature-engineering image-classification imbalanced-data kaggle-dataset machine-learning multiclass-classification pandas regression scikit-learn stroke-prediction

Last synced: 22 Mar 2025

https://github.com/caiocarneloz/scyred

Automatic sklearn parameter tuning with bio-inspired algorithms

bio-inspired library parameter-tuning scikit-learn

Last synced: 17 Feb 2026

https://github.com/oneapi-src/powerline-fault-detection

AI Starter Kit for detect faulty signals in power line voltage using Intel® Extension for Scikit-learn*

machine-learning scikit-learn

Last synced: 04 Apr 2025

https://github.com/pythonicshariful/insurance-charge-predictor

This project predicts medical insurance charges based on personal details such as age, gender, BMI, number of children, smoking habits, and region. It uses a Machine Learning model trained on the insurance.csv dataset and provides a Flask web app interface for user input

flask insura machine-learning mlapp python regression scikit-learn

Last synced: 09 May 2026

https://github.com/thenorthkun/twitter-gender-text-analysis

This Project contains a vivid analysis of texts, words (using NLP) typed by Male & Female users active on Twitter. 🐦👨📝

analysis data-visualization natural-language-processing scikit-learn

Last synced: 06 May 2026