An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/khaymanii/rock-or-mine-detection_model

This is a Machine learning model built using Python to detect between rock and mine

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 13 Apr 2026

https://github.com/anzai004/mechassist

AI-powered desktop engineering decision support for material selection, stress assessment and machinability advisory.

decision-support engineering-tools kmeans machine-learning mechanical-engineering python random-forest scikit-learn tkinter

Last synced: 31 May 2026

https://github.com/mendez-luisjose/breast-cancer-predictor-with-scikit-learn-streamlit-and-deployed-with-flask-and-aws

Breast Cancer Predictor with Scikit Learn, Streamlit and Deployed with Flask and AWS.

aws-ec2 flask logistic-regression scikit-learn streamlit

Last synced: 19 Apr 2026

https://github.com/labrijisaad/monthly-daily-energy-forecasting-docker-api

This repository houses an Energy Forecasting API that uses Machine Learning to predict daily and monthly energy consumption from historical data. It's designed as a practical demonstration of a Machine Learning Engineering workflow, from initial analysis to a deployable API packaged with Docker.

api docker docker-volume jupyter-notebooks machine-learning makefile python random-forest scikit-learn xgboost

Last synced: 11 Apr 2026

https://github.com/tikam02/wine-shop

Wine Reviews and Recommendation Engine - Web-Application [Django]

django machine-learning numpy pandas python recommender-system scikit-learn wine

Last synced: 22 Apr 2025

https://github.com/benzlokzik/spam-detector

Training code and models for spam message detection in Russian

bert chromadb fasttext knn ml python rag scikit-learn spam-detection transformers

Last synced: 25 May 2026

https://github.com/chanmeng666/water-quality-testing-data-analysis

Statistical analysis and predictive modeling of water quality parameters using Python, pandas, scikit-learn, and statsmodels

data-analysis data-science data-visualization environmental-monitoring jupyter-notebook machine-learning pandas python scikit-learn seaborn statistics water-quality

Last synced: 18 Apr 2026

https://github.com/erp12/sklearn-symbolic-regression

Stack based symbolic regression using Scikit-learn estimator base classes.

genetic-programming python scikit-learn sklearn symbolic-regression

Last synced: 28 Apr 2026

https://github.com/christoph/robics

Automatic detection of robust parametrizations for LDA and NMF. Compatible with scikit-learn and gensim.

gensim lda natural-language-processing nmf robust-parametrizations scikit-learn topic-modeling topic-models

Last synced: 13 Apr 2025

https://github.com/chathumiamarasinghe/spam-mail-prediction-using-ml

A Python-based machine learning project to classify emails as spam or not spam. This system uses algorithms like Naive Bayes or Logistic Regression to detect spam with high accuracy, providing a reliable solution for email filtering.

data-science ipynb jupiter-notebook matplotlib numpy pandas python scikit-learn spammailprediction

Last synced: 30 Oct 2025

https://github.com/lugq1990/automl-engine

3 lines of code for automate machine learning for classification and regression

auto-ml automl machine-learning random-forest scikit-learn xgboost

Last synced: 01 Apr 2026

https://github.com/janasunrise/ml-guide-and-implementation

This repository contains the predictions, and plots for the datasets included in the scikit learn library by default and also some other datasets from kaggle or other sources.

machine-learning ml python3 scikit scikit-learn scikitlearn-machine-learning sklearn

Last synced: 10 Mar 2026

https://github.com/stewartpark/scikit-small-ensemble

scikit-small-ensemble is a library to make your ensemble models(Random Forest Classifier, etc) have a small memory footprint/usage.

compression ensemble-learning lz4 mmap random-forest-classifier scikit-learn

Last synced: 05 Feb 2026

https://github.com/mafrs47/lung_cancer_prediction

This project predicts lung cancer risks using machine learning models like Random Forest, Logistic Regression, and SVM. It analyzes patient data with features such as age, smoking habits, and symptoms. Data preprocessing, visualization, and performance evaluation ensure accurate predictions for early diagnosis.

cnn computational-pathology convolutional-neural-networks decision-tree-classifier decision-trees deep-learning gradientboosting histopathology jupyter-notebook lung-cancer multiple-instance-learning scikit-learn svm xgboost

Last synced: 02 Mar 2026

https://github.com/ansh-info/incognito

Inspired by Google's Foobar challenge. It automates secretive candidate selection based on online activity, provides timed coding challenges, and allows hiring managers to evaluate submissions

api docker docker-compose javascript jupyter-notebook kmeans leetcode logistic-regression machine-learning machinelearning mysql python3 react reactjs scikit-learn selection-algorithms supervised-learning webdevelopment

Last synced: 11 Feb 2026

https://github.com/matsunagalab/lecture_ml

Google colab notebooks used in a lecture on machine learning

lecture notebooks pymc scikit-learn torch

Last synced: 14 May 2025

https://github.com/tschechlovdev/kmeans_mnist

Demonstration of using k-Means to cluster images of handwritten digits (MNIST dataset). Source Code for corresponding article on Medium.

clustering image-dataset python scikit-learn

Last synced: 07 May 2026

https://github.com/thekartikeyamishra/ai-news-aggregator

This project will create an AI-powered News Aggregator that collects news from selected sources, categorizes it using NLP-based techniques, and displays the results in a user-friendly Tkinter-based GUI.

ai machine-learning nltk python python3 requests scikit-learn

Last synced: 23 Aug 2025

https://github.com/aianytime/recommendation_system_implementation

Complete concepts behind implementing a Recommendation System using Association Rules, Collaborative Filtering, and Matrix Factorization.

collaborative-filtering data-science machine-learning matrix-factorization python recommender-system scikit-learn

Last synced: 28 Apr 2026

https://github.com/g0r0kh/clustering

k-means & hierarchical clustering

conda matplot numpy pandas scikit-learn scipy sklearn

Last synced: 12 Apr 2025

https://github.com/vartikaraj2512/dsml-internship-devtown-notebooks-

๐ŸŒŸ Data Science & Machine Learning Internship Projects ๐Ÿ“Š Explore a curated collection of DS & ML notebooks covering topics like regression models, clustering, NLP, and deep learning. Dive into real-world projects such as price prediction, sentiment analysis, and customer segmentation. This repository reflects modern data-driven industry solutions

data-science filehandling googlecolab json kaggle keras machine-learning matplotlib numpy pandas python scikit-learn seaborn sql tensorflow

Last synced: 29 Jan 2026

https://github.com/dmlls/hwr-explained

Simple Handwriting Recognition (HWR) with scikit-image and scikit-learn.

handwriting-recognition hwr scikit-learn

Last synced: 30 Mar 2025

https://github.com/kohlerhector/tree-mbpo

Study Model-Based Policy Optimization by varying the model estimator classes (e.g Decision Trees vs MLP)

decision-tree mbpo mbrl mlp rl sac scikit-learn stable-baselines3

Last synced: 05 May 2026

https://github.com/michaelsdavid/charcnn

A convolutional neural network that has detection for English characters (letters) written by hand.

character-recognition cnn-model convolutional-neural-network keras opencv2-python python3 scikit-learn tensorflow

Last synced: 08 Sep 2025

https://github.com/vidhi1290/robust-yield-prediction-

"Predicting a Greener Future ๐ŸŒพ๐Ÿ“Š Delve into the world of agriculture and data science with our Yield Prediction project. We harness machine learning and weather data to forecast crop yields accurately. Join us in cultivating smarter farming practices for a sustainable tomorrow."

artificial-intelligence data-analysis data-cleaning-and-preprocessing data-science data-visualization dataexploration devops docker machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot pandas python scikit-learn scikitlearn-machine-learning streamlit yield-prediction-for-food-processing

Last synced: 15 Apr 2026

https://github.com/dllllb/ml-pipelines-tutorial

SciKit-Learn vs Apache Spark pipelines

machine-learning scikit-learn spark

Last synced: 10 Apr 2026

https://github.com/analitico-771/machine_learning_index_prediction

This application compares the performance of Unsupervised machine learning models and Supervised models. It downloads 3 yrs of market daily close data from all SP500 companies and divides them into Sectors to be used as features for learning and training the data, in order to predict wether the index will be a Buy or Sell the next day. The results are evaluated to determine each model's performance and the metrics are presented along with the analysis.

alpaca-trading-api data-science deep-learning fintech machine-learning neural-network pandas-dataframe python quantitative-finance random-forest-classifier scikit-learn sp500-data-analysis

Last synced: 26 Feb 2026

https://github.com/sf-tec/openmodels

OpenModels is a flexible and extensible library for serializing and deserializing machine learning models. It's designed to support any serialization format through a plugin-based architecture, providing a safe and transparent solution for exporting and sharing predictive models.

json python scikit-learn serialization sklearn

Last synced: 15 Apr 2025

https://github.com/oneapi-src/purchase-prediction

AI Starter Kit for Purchase Prediction model using Intelยฎ Extension for Scikit-learn*

machine-learning scikit-learn

Last synced: 04 Apr 2025

https://github.com/khaymanii/multiple-disease-prediction-system

This system predicts if a patient has heart, parkinsons and diabetes disease

matplotlib numpy pandas python scikit-learn

Last synced: 17 Apr 2026

https://github.com/jbeno/datawaza

Data science tools for exploration, visualization, and model iteration.

data-science dataviz machine-learning matplotlib pandas scikit-learn seaborn

Last synced: 24 Jul 2025

https://github.com/namratha2301/intrusiondetection

Intrustion Detection Models based on Internet Traffic Data obtained from the NSL-KDD Dataset

decisiontree gradient-boosting intrusion-detection mlp-classifier naive-bayes-classifier nsl-kdd randomforest scikit-learn

Last synced: 22 Feb 2026

https://github.com/amirreza81/applied-data-science-course

Comprehensive notes, practical exercises, and problem-solving solutions from the Applied Data Science course, covering data preprocessing, machine learning algorithms, statistical analysis, data visualization, and real-world applications.

accuracy-measure boosting classification data-cleaning data-preprocessing data-science data-visualisation deep-learning dimensionality-reduction eda feature-engineering image-classification imbalanced-data kaggle-dataset machine-learning multiclass-classification pandas regression scikit-learn stroke-prediction

Last synced: 22 Mar 2025

https://github.com/omanshu209/diagnosify-knn

This is a Python - based application that predicts diseases based on the symptoms inputted by the user using machine learning (KNN classifier algorithm).

jupyter-notebook k-nearest-neighbor-classifier kivy kivymd machine-learning python python3 scikit-learn

Last synced: 16 Apr 2026

https://github.com/victormotogna/irislogisticregression

Iris Dataset Logistic Regression - scikit learn version & from scratch

data-science iris-dataset logistic-regression python scikit-learn

Last synced: 30 Apr 2026

https://github.com/udityamerit/python-librearies-for-data-science

Python libraries for data science enable efficient data manipulation, analysis, and modeling. Key libraries include NumPy for numerical computing, pandas for data handling, Matplotlib for visualization, Scikit-learn for machine learning, TensorFlow for deep learning, and BeautifulSoup/requests for web scraping. These libraries simplify complex data

beautifulsoup data data-science data-science-libraries machine-learning matplotlib numpy pandas requests scikit-learn scikitlearn-machine-learning tensorflow

Last synced: 06 Feb 2026

https://github.com/deepentropy/french-realestate-price-prediction

Machine Learning appliquรฉ aux Valeurs Fonciรจres Franรงaise

google-colab machine-learning opendata scikit-learn

Last synced: 25 Dec 2025

https://github.com/kamomille/titanic

Auriez vous survรฉcu au naufrage du Titanic ?

data-science jupiter-notebook machine-learning scikit-learn titanic

Last synced: 19 May 2026

https://github.com/tschechlovdev/automl4clust

Implementation of "AutoML4Clust: Efficient AutoML for Clustering Analyses", published at EDBT 2021.

automl clustering paper python scikit-learn

Last synced: 15 May 2026

https://github.com/azrdev/sklearn-seco

Implementation of the *Separate and Conquer* / *Covering*-Algorithm for scikit-learn

covering machine-learning scikit-learn sklearn

Last synced: 21 Jan 2026

https://github.com/kookmin-sw/capstone-2023-29

์ž๋ฆฌ์žˆ์–ด? - ๊ฒฝ๊ธฐ๋„ ๊ด‘์—ญ๋ฒ„์Šค ์ขŒ์„์˜ˆ์ธก ์‹œ์Šคํ…œ

fastapi lstm postgresql python3 pytorch react scikit-learn sqlalchemy

Last synced: 22 Aug 2025

https://github.com/arnoldgaius/text_classifier

ๅŸบไบŽsklearn็š„ๆ–‡ๆœฌๅˆ†็ฑปๅ™จ Text classifier based on sklearn

pypi scikit-learn text-classifier

Last synced: 19 May 2026

https://github.com/ascender1729/salarypredictionlinearreg

SalaryPredictionLinearReg is a Python-based project utilizing linear regression to predict salaries from years of experience. It covers data loading, model training, detailed statistical analysis, and visualization of results.

data-science linear-regression machine-learning python salary-prediction scikit-learn seaborn statsmodels

Last synced: 07 May 2025

https://github.com/pooranjoyb/health-bridge

Predicting the disease of a patient from a patient's video (or text) using ML algorithms. The algorithms used in this project are Natural Language Procession, and Random Forest Tree. This Project is under INTEL OneAPI Hackathon 2023

hackathon intel machine-learning nlp oneapi pandas pickle python scikit-learn

Last synced: 04 Sep 2025

https://github.com/omar-ahmed314/s-he-detector

Gender detection project based on handwriting.

machine-learning python scikit-learn

Last synced: 13 Apr 2025

https://github.com/marcusosterberg/triage-at-home

ML-testprojekt fรถr att anvรคnda NLP-teknik fรถr att klassificera, ge beslutsstรถd och erbjuda sjรคlvtriage pรฅ distans

artificial-intelligence machine-learning nlp nlp-machine-learning nltk scikit-learn

Last synced: 18 May 2026

https://github.com/klane/springboard

Springboard Data Science Career Track assignments

data-science jupyter-notebook pyspark python scikit-learn springboard sql

Last synced: 13 Apr 2025

https://github.com/shaadclt/data-preprocessing-pipeline

This project contains a data preprocessing pipeline implemented in Python using the pandas and numpy libraries. The pipeline handles missing values, outliers, and normalizes numeric features in a dataset.

numpy pandas scikit-learn

Last synced: 18 Apr 2026

https://github.com/garcane/ethereum-prediction-ml

A machine learning project that predicts the future price of Ethereum (ETH) using the price data gathered from coincodex.com.

crypto cryptocurrency ethereum jupyternotebook lstm lstm-neural-networks machine-learning machine-learning-algorithms python scikit-learn scikitlearn-machine-learning sklean svm tensorflow

Last synced: 10 Apr 2026

https://github.com/sayakpaul/patients-conversation-detector

Contains my experiments for ZS's hiring hackathon (II).

data-science keras machine-learning nlp python scikit-learn text-classification

Last synced: 23 Apr 2025

https://github.com/ptyadana/iris-flower-ml-app

iris flower predictions Machine Learning app using Tensorflow, Keras, ScikitLearn, Flask deployed on Heroku

flask heroku keras machine-learning scikit-learn tensorflow2

Last synced: 18 Oct 2025

https://github.com/csinva/trees-to-networks

Bridging random forests and deep neural networks. Partial implementation of "Neural Random Forests" https://arxiv.org/abs/1604.07143

artificial-intelligence classification decision-tree decision-tree-classifier deep-learning machine-learning machinelearning neural-network neural-networks paper-implementations python pytorch random-forest scikit-learn statistics

Last synced: 12 Apr 2026

https://github.com/abhiramdodda/rainfall_prediction

Machine Learning model built on Telangana dataset cropped from Indian weather dataset merged with average temperature dataset

numpy pandas python3 scikit-learn scikitlearn-machine-learning

Last synced: 12 Apr 2026

https://github.com/crispengari/keras-api

๐Ÿ’Ž Introduction to Keras API and TensorFlow for Researchers

ai deep-learning jupyter-notebook keras machine-learning matplotlib numpy python scikit-learn tensorflow

Last synced: 07 Apr 2026

https://github.com/zehuichen123/ml_algorithm

Codes for book <Machine Learning Algorithm>

machine-learning scikit-learn

Last synced: 14 May 2026

https://github.com/alfredfrancis/jarvis2.0

An intelligent Home automation system using Internet of Things and Machine learning

flask internet-of-things machine-learning php python raspberry-pi scikit-learn

Last synced: 14 Aug 2025

https://github.com/ejw-data/ml-myopia

A variety of machine learning techniques used to identify nearsighted patients

cross-validation gridsearchcv imbalanced-classification kmeans knn machine-learning pca pipeline python random-forest scikit-learn svc tensorflow tsne

Last synced: 11 Jul 2025

https://github.com/akash-peace/face-recognition

AJ Face Recognizer project objective is to make a face recognizing model from own dataset of two faces.

facenet keras matplotlib mtcnn npz numpy opencv pickle pillow python3 sav scikit-learn

Last synced: 10 Apr 2026

https://github.com/madeyoga/machine-learning

Tensorflow & scikit-learn examples

numpy pandas scikit-learn tensorflow

Last synced: 12 Apr 2025

https://github.com/oneapi-src/customer-segmentation

AI Starter Kit for Customer Segmentation for Online Retail using Intelยฎ Extension for Scikit-learn*

machine-learning scikit-learn

Last synced: 04 Apr 2025

https://github.com/lechemi/machine-learning-vademecum

Un notebook contenente nozioni di base ed esempi pratici in python sul machine learning.

machine-learning python scikit-learn

Last synced: 20 Jun 2025

https://github.com/greed2411/asf

Anti Spam Filter, a spam filter ๐Ÿ—ƒ๏ธ which uses a model made out of MultinomialNB algorithm ๐Ÿ‘ˆ from scikit-learn ๐Ÿ to classify spam and complaints.

algorithm asf dataset joblib maintenance passes-complaints scikit-learn spam spam-filter vit-university

Last synced: 18 May 2026

https://github.com/ammaryasirnaich/deeplearning_playland

This repository contains Docker Image files, which support the common frameworks required for Deep learning implementation. The images support both the latest GPU (Nvidia CUDA) and CPU processors.

cuda cuda11 cudnn cudnn8 deep-learning docker docker-image dockerfile gpu kersa opencv pytorch pytorch-cnn scikit-learn tensorflow2

Last synced: 12 Apr 2026

https://github.com/juliandavidmr/machinelearningscikit

Clasificador de flores mediante aprendizaje supervisado

neural-network python scikit-learn sklearn

Last synced: 24 Feb 2025

https://github.com/pateash/kisanmitra-python

Python Machine learning Utility for Kisanmitra Web App

jupyter-notebook machine-learning python scikit-learn

Last synced: 31 Jul 2025

https://github.com/machinelearningprodigy/whastapp-chat-analyzer

WhatsApp Chat Analyzer is a powerful tool that helps you analyze your WhatsApp chat history with detailed statistics and visualizations. From message trends to most active users, this tool provides deep insights into your conversations! ๐Ÿš€

matplotlib plotly scikit-learn seaborn streamlit wordcloud

Last synced: 13 Apr 2025

https://github.com/chanioxaris/german-credit-data

Experimental classification algorithms on german credit data implemented using scikit-learn library

classification classifier cross-validation dataset information-entropy information-gain naive-bayes prediction random-forest scikit-learn support-vector-machines

Last synced: 30 Apr 2025

https://github.com/udityamerit/complete-machine-learning-for-beginners

This repository is structured as a complete ML roadmap combining theory (PDFs) with hands-on coding (Jupyter Notebooks) to help you build a solid foundation in data science and machine learning. Ideal for students, self-learners, and professionals looking to revise or upgrade.

artificial-intelligence classification clustering clustering-algorithm machine machine-learning machinelearning matplotlib matplotlib-figures numpy pandas regression regression-models regressionalgorithms regressionanalysis scikit-learn scikitlearn-machine-learning scipy seaborn tensorflow

Last synced: 15 May 2026

https://github.com/lakshay-a/alzheimer-diagnosis-using-cnn

An advanced deep learning tool for Alzheimer's disease diagnosis using a CNN with transfer learning from DenseNet-121 pre-trained on RadImageNet, achieving a test accuracy of 95.13%. It features a user-friendly interface for uploading MRI scans and provides immediate classification into AD, MCI, or CN stages.

kfold-cross-validation matplotlib numpy python3 scikit-learn tensorflow transfer-learning

Last synced: 23 Feb 2026

https://github.com/caiocarneloz/scyred

Automatic sklearn parameter tuning with bio-inspired algorithms

bio-inspired library parameter-tuning scikit-learn

Last synced: 17 Feb 2026

https://github.com/shervinnd/bitcoin-price-prediction-ml-dl

Predict Bitcoin prices with ML & DL models! ๐Ÿ“ˆ Uses Ridge, Lasso, Random Forest, MLP, RNN & LSTM with hyperparameter tuning. ๐Ÿ“Š Visualizes predictions & ROC curves. ๐Ÿš€ Fetch data via yfinance, evaluate with MSE/R2. Perfect for crypto enthusiasts! ๐Ÿ’ธ

cryptocurrency data-science data-visualization decision-tree deep-learning financial-modeling machine-learning neural-networks predictive-analytics price-prediction python random-forest regression rnn scikit-learn simple-rnn tensorflow time-series yfinance

Last synced: 15 Oct 2025

https://github.com/shervinnd/blood-donor-availability-predictor

A deep learning model to predict blood donor availability using TensorFlow and sklearn. Features data preprocessing, neural network training, and ROC curve visualization. Achieve high accuracy in predicting donor status! ๐Ÿฉบ๐Ÿ’‰

binary-classification blood-donation blood-donor-prediction data-preprocessing deep-learning healthcare-ai machine-learning medical-data-analysis neural-network predictive-modeling python roc-curve scikit-learn tensorflow

Last synced: 11 Oct 2025

https://github.com/alexfrancow/cms_version_detector_poc

A Machine Learning application that detects versions of WordPress with Multi-Class classification algorithms.

cms cybersecurity data-science footprinting infosec machine-learning pandas python3 random-forest scikit-learn tool wordpress

Last synced: 12 Oct 2025

https://github.com/oneapi-src/powerline-fault-detection

AI Starter Kit for detect faulty signals in power line voltage using Intelยฎ Extension for Scikit-learn*

machine-learning scikit-learn

Last synced: 04 Apr 2025

https://github.com/sayakpaul/floydhub-k_means-blog

Contains the Jupyter Notebook made for a FloydHub article on K-Means

numpy pandas scikit-learn yellowbricks

Last synced: 20 Sep 2025

https://github.com/manena/sp-sentiment-analysis

Sentiment Analysis in Python trained with Amazon Spain reviews in Spanish

jupyter-notebook machine-learning nltk nltk-library python-3-5 pyton scikit-learn sentiment-analysis

Last synced: 09 Oct 2025