An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/neelanjan-chakraborty/custoclarity

CUSTO CLARITY is a customer segmentation model built in Python. Using clustering on real retail datasets, it identifies 5 customer segments that unlocked strategic retail partnerships. Powered by scikit-learn, pandas, seaborn, and Matplotlib.

clustering-algorithm clustering-algorithms customer-analytics customer-segmentation data-visualization kmeans kmeans-clustering pandas python scikit-learn

Last synced: 11 May 2026

https://github.com/pngo1997/astrophysical-objects-classification

Project applies machine learning techniques to classify astrophysical objects using observational data from the Large Synoptic Survey Telescope (LSST).

adaptive-boosting-algorithm classification down-sampling gradient-boosting keras machine-learning neural-network python random-forest scikit-learn supervised-learning tensorflow time-series

Last synced: 10 May 2026

https://github.com/khaymanii/titanic_survival_prediction_-model

This Model was built using Python and Logistic Regression algorithm

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 02 May 2026

https://github.com/vaibhavs10/learn-ml

Modified notebooks (single) from kaggle.com/learn with added nuances

decision-trees machine-learning pandas random-forest scikit-learn

Last synced: 11 May 2026

https://github.com/rvats20/income-classification-using-ml

Model Training, Implementing various machine learning algorithms such as Logistic Regression, Decision Trees, Random Forests, and Gradient Boosting. Model Evaluation: Assessing model performance using metrics like accuracy, precision, recall, and F1-score. Hyperparameter Tuning

classification machine-learning machine-learning-algorithms ml pandas-dataframe python scikit-learn

Last synced: 11 May 2026

https://github.com/rickiepark/ml-ko

머신러닝, 딥러닝 한글 번역 저장소

deep-learning keras machine-learning python scikit-learn tensorflow

Last synced: 17 Apr 2026

https://github.com/royxlead/multi-objective-feature-selection

NSGA-II multi-objective feature selection on medical tabular data. 9 of 30 features at 94.74% accuracy - matching full-feature baselines with 70% feature reduction.

deap evolutionary-algorithms feature-selection interpretable-ml medical-ml multi-objective-optimization nsga2 pareto-front random-forest scikit-learn

Last synced: 23 Jun 2026

https://github.com/francescopaolol/decisiontree

About classify iris plants into three species in this classic dataset

decision-tree-classifier jupyter-notebook kaggle machine-learning ml pandas scikit-learn

Last synced: 16 Apr 2026

https://github.com/hasanulmukit/spam-email-classifier

This is a Spam Email Classifier built using Python and Streamlit. It uses a pre-trained model to predict whether an email is Spam or Not Spam. The app also provides the probability scores for both categories, enhancing transparency and reliability of the prediction.

email-classifier machine-learning nlp python scikit-learn spam-detection streamlit text-classification

Last synced: 11 May 2026

https://github.com/umar-saadat/car-price-prediction-ml

🚗 A Machine Learning project that predicts the price of used cars using Linear Regression. Built with Python, Scikit-learn, and Streamlit, this app takes inputs like car brand, year, mileage, engine size, and more to estimate the selling price in real-time

ai-project car-price-prediction data-science linear-regression machine-learning ml-project python scikit-learn streamlit

Last synced: 02 May 2026

https://github.com/aditya-ranjan1234/interactive-salary-prediction-with-machine-learning

A Streamlit web application for exploring the UCI Census Income dataset, training machine learning models, and predicting employee salaries.

data-science machine-learning prediction python scikit-learn streamlit xgboost

Last synced: 29 Apr 2026

https://github.com/aravindnathan02/credit-card-fraud-detection

This repository contains a Machine Learning project aimed at detecting fraudulent credit card transactions. The goal is to build a reliable and efficient model that minimizes false positives and false negatives, ensuring financial safety and improving fraud detection capabilities.

classification-model fraud-detection logistic-regression machine-learning python random-forest scikit-learn

Last synced: 11 May 2026

https://github.com/bistcuite/plainml

Painless Machine Learning Library for python based on scikit-learn

machine-learning ml plainml python scikit-learn

Last synced: 02 May 2026

https://github.com/sapsan14/water-quality-ee

Estonian water quality ML — binary classification of Terviseamet open data, Jupyter + scikit-learn.

classification estonia jupyter ml open-data scikit-learn

Last synced: 02 May 2026

https://github.com/assamirzafar/learning

My Roadmaps and challenges are in this repo...I will add my colab and kaggle notebook links along with py script files in here.

calculus convolutional-neural-networks deep-learning deep-neural-networks keras linear-algebra machine-learning numpy opencv probability python3 pytorch scikit-learn scipy statistics

Last synced: 05 Apr 2026

https://github.com/rhazra-003/fake_news_detector

A Machine Learning model to detect fake news with more than 95% accuracy

fake-news numpy pandas scikit-learn

Last synced: 18 Apr 2026

https://github.com/jordandeklerk/pygridge

A scikit-learn compatible Python package for data-driven group regularized ridge regression

python regression regularized-regression scikit-learn

Last synced: 05 May 2026

https://github.com/siam29/hybrid-feature-engineering-and-ensemble-learning

In this ML project, I proposed a methodology that provided an outperformed performance compared to another existing paper. For the comparison here focused mainly on F1, accuracy, AUC, and ROC score. This methodology provides a 99.96% accuracy score and 90.05% F1 score. 

feature-selection keras-tensorflow machine-learning matplotlib python scikit-learn

Last synced: 18 Apr 2026

https://github.com/venky-1710/stress-level-predection

Stress Level Prediction is a web app using machine learning to estimate user stress levels. It takes inputs like anxiety, sleep quality, and academic performance, then predicts stress using a Decision Tree Classifier. Built with Python, Flask, and scikit-learn, it's useful for students, researchers, and those interested in stress management.

css flask html machine-learning numpy pandas python python-sklearn scikit-learn

Last synced: 05 Apr 2026

https://github.com/sarthak-1408/rain-fall-prediction

This repository represents the End to End Machine Learning Project (Rain Fall Prediction in Australia).

heroku heroku-deployment machine-learning numpy pandas rain-fall rain-fall-prediction scikit-learn xgboost-algorithm

Last synced: 05 May 2026

https://github.com/somjit101/nlp-casestudy-quora-question-similarity

An application of NLP and classical ML algorithms to an interesting real-world use case of predicting similarity between two questions on Quora. This allows the platform to combine similar questions into one and combine their answers to avoid duplication and unnecessary confusion.

cross-validation feature-engineering feature-extraction gradient-boosting kaggle logistic-regression machine-learning model-calibration natural-language-processing nlp quora-question-pairs scikit-learn svm text-mining xgboost

Last synced: 05 Apr 2026

https://github.com/joaoassalim/class-by-description-classifier-with-nlp

Enhancing Item Classification through Natural Language Processing: Leveraging Text Descriptions for Precise Categorization

bert fine-tuning nlp nlp-machine-learning scikit-learn sklearn tensorflow

Last synced: 06 May 2026

https://github.com/emmanuelezenwere/aind-aiprojects

Portfolio of AI projects developed during my Udacity AI Nanodegree, covering Planning AI, Constraint Satisfaction, Hidden Markov Models, and Search algorithms.

alpha-beta-pruning astar-algorithm bellman-equation breadth-first-search constraint-satisfaction-problem depth-first-search hidden-markov-model kalman-filter minmax-algorithm networkx nltk numpy pandas scikit-learn scipy sympy

Last synced: 29 Apr 2026

https://github.com/ayushsaksena30/cosmic-classifier

This notebook implements a structured machine learning pipeline to classify cosmic data using the CatBoost Classifier, known for its efficiency with categorical features and minimal preprocessing requirements.

catboost-classifier label-encoder machine-learning matplotlib numpy pandas robust-scaler scikit-learn seaborn simple-imputer

Last synced: 15 Apr 2026

https://github.com/khaymanii/diabetes_prediction_model

This is a Machine learning model built using Python

matplotlib numpy pandas python scikit-learn

Last synced: 19 Apr 2026

https://github.com/drcbeatz/machine-learning-tool

Machine Learning Tool - Train and test supervised ML algorithms (incl. binary classification and regression) on custom data sets and visualize your results without knowing how to code.

data-science data-visualization django machine-learning python scikit-learn

Last synced: 06 May 2026

https://github.com/himendersharma0712/life_expectancy_pred

This repository is for a hackathon project.

jupyter-notebook machine-learning python scikit-learn

Last synced: 06 May 2026

https://github.com/shubhranpara/heart-disease-predictor

I have created this project as my Python term assignment. In this project I have trained a ML model to predict the heart disease using Scikit-learn library in python.

google-colab jupyter-notebook machine-learning medical prediction-model python scikit-learn

Last synced: 06 May 2026

https://github.com/tomwassing/brane-project

Brane example project using the Scikit-learn and Matplotlib packages

brane branescript matplotlib scikit-learn

Last synced: 17 Oct 2025

https://github.com/intscription/python-programs

Python basics-advance

numpy pandas scikit-learn

Last synced: 05 May 2026

https://github.com/sralter/classifire

Wildfire Prediction Model: Samuel Alter's BrainStation 2023 Data Science Capstone Project

qgis scikit-learn tensorflow

Last synced: 02 May 2026

https://github.com/sandeepbalachandran/predictor

A collection of prediction algorithms for different purposes

collection jupyter-notebook machine-learning notebook predictor regression-models scikit-learn

Last synced: 06 May 2026

https://github.com/deaneeth/telco-churn-prediction-mlops

Production-ready ML pipeline for telco customer churn prediction using advanced ensemble methods (XGBoost, CatBoost, Random Forest). Handles class imbalance, provides business insights, and includes modular MLOps architecture. Built with scikit-learn, featuring comprehensive EDA, feature engineering, and business impact analysis.

catboost data-preprocessing ensemble-methods feature-engineering machine-learning mlops pipeline-development python random-forest scikit-learn telco-analytics xgboost

Last synced: 15 Apr 2026

https://github.com/varun-khorgade/cvinsight-ai-resume-analyzer

AI tool that analyzes resumes, extracts keywords, and matches them with job descriptions.

css django html5 nlp python scikit-learn textparse

Last synced: 06 May 2026

https://github.com/kieranlitschel/kerassearchcv

Built for the implementation of Keras in Tensorflow. Behaves similarly to GridSearchCV and RandomizedSearchCV in Sci-Kit learn, but allows for progress to be saved between folds and for fitting and scoring folds in parallel.

classification grid-search keras keras-tensorflow multithreading randomized-search scikit-learn

Last synced: 20 Apr 2026

https://github.com/nurulashraf/ann-cancer-prediction

An Artificial Neural Network built with TensorFlow and Keras to predict breast cancer based on the Wisconsin Breast Cancer dataset.

artificial-neural-network breast-cancer-prediction deep-learning keras machine-learning python scikit-learn tensorflow

Last synced: 06 May 2026

https://github.com/k-ashik/genescout-ai-genetic-disease-pathologist

GeneScout: An interpretable AI Pathologist that predicts 5 genetic diseases with 93.5% accuracy using an Ensemble Voting Classifier and SHAP for clinical explainability.

data-science explainable-ai healthcare-ai machine-learning precision-medicine python scikit-learn shap streamlit

Last synced: 20 Apr 2026

https://github.com/khaymanii/house-price-prediction-model

This model was built using Python and XGBoost Regression algorithm

matplotlib numpy pandas python scikit-learn

Last synced: 06 May 2026

https://github.com/dipa09/riot_imgclf

Multi-class image classifier for RIOT-OS

arduino-mega-2560 emlearn esp32-cam m2cgen micromlgen riot-os scikit-learn tinyml

Last synced: 30 Apr 2026

https://github.com/texnoforge/texnomagic

TexnoMagic library for digital Magic

gmm magic numpy python recognition scikit-learn scipy

Last synced: 03 Mar 2026

https://github.com/jagadishdas21/brain-tumor-detection

This repository contains the implementation of a deep learning model to detect brain tumors from MRI images using Convolutional Neural Networks (CNN). The goal of this project is to classify MRI images as either having a brain tumor (Positive) or not having one (Negative).

computer-vision convolutional-neural-networks matplotlib scikit-learn tensorflow

Last synced: 26 Feb 2026

https://github.com/prajwalsinha/unveiling-climate-change-dynamics-through-earth-surface-temperature-analysis

Climate change analysis through global surface temperature data. Includes data preprocessing, statistical analysis, visualizations, and forecasting. Python-based project using Pandas, Matplotlib, and Scikit-learn.

data dataanalysis dynamic-mapping pyplot python scikit-learn seaborn

Last synced: 10 Feb 2026

https://github.com/pankajarm/tabular_ml_toolkit

A helper library to jumpstart your machine learning project based on tabular or structured data.

data-science feature-engineering hyperparameter-tuning machine-learning parallelism python scikit-learn structured-data tabular xgboost

Last synced: 19 Jan 2026

https://github.com/akhil888binoy/intelligent-supplychain-management-system

Blockchain-powered supply chain management system with ML-driven sales prediction. Streamlines supplier-employee transactions and inventory management. Built with MERN stack, Solidity, and Flask.

blockchain decentralized-payments ethereum express flask foundry hackathon-project inventory-management machine-learning mern-stack mongodb nodejs python react sales-prediction scikit-learn smart-contracts solidity supply-chain-management wagmi

Last synced: 09 Oct 2025

https://github.com/ghufranbarcha/codsoft-machine-learning-internship

This repository contain all Machine Learning & NLP task during my internship at Codsoft.

jupyter-notebook machinelearning nlp nltk python scikit-learn

Last synced: 17 Apr 2026

https://github.com/elcorto/gp_playground

Explore selected topics related to Gaussian processes

gaussian-processes gpy gpytorch kernel-ridge-regression machine-learning scikit-learn tinygp

Last synced: 06 May 2026

https://github.com/grachale/predict_life_expect

Predicting life expectancy (regression) with usage of custom random forest, linear regression and decision tree regressor from scikit-learn.

decision-tree-regression jupyter-notebook linear-regression pandas python random-forest regression scikit-learn

Last synced: 05 May 2026

https://github.com/rixiiz/using-knn-to-predict-the-obp-of-mlb-players

Using KNN to predict the On Base Percentage (OBP) of Major League Baseball (MLB) players at the end of the season

artificial-intelligence dataset f1-score jupyter-notebook knn-regression machine-learning matplotlib mse numpy pandas python scikit-learn supervised-learning

Last synced: 05 Apr 2026

https://github.com/magnuss0/movie-rec-system

The project extracts movie data using TheMovieDB API, processes it using TF-IDF and cosine similarity for generating recommendations, and stores the data in a DuckDB database. The system is encapsulated within a FastAPI web application and can be deployed using Docker. It provides movie recommendations in JSON format.

cosine-similarity docker duckdb movies-recommendation moviesdb-api ploomber poetry-python scikit-learn streamlit tf-idf

Last synced: 14 Apr 2026

https://github.com/mohammadvhossein/ml-gym

The ML-GYM repository showcases machine learning projects using **scikit-learn**, covering classification, regression, and clustering. It offers educational resources for beginners and practical examples for experienced users, complete with detailed instructions.

classification-algorithms clustering-methods cross-validation data-preprocessing data-science decision-trees feature-engineering machine-learning model-evaluation neural-networks python-programming random-forests regression-techniques scikit-learn supervised-learning unsupervised-learning

Last synced: 06 May 2026

https://github.com/myounus-codes/saleprice-prediction-dataset-analysis-and-cleaning-advance-regression

In this project I have cleaned the data for the model. Project Google Colab Link: https://colab.research.google.com/drive/1vQY-XEFJSdEkW2PQOSf1j13Yk8L-XXNw?usp=sharing

algorithms data-analysis data-science eda google-colab machine-learning numpy pandas python scikit-learn scikit-learn-python

Last synced: 05 May 2026

https://github.com/wesslen/dsba6211-summer2024

DSBA6211 Adv Business Analytics Lab Notebooks

scikit-learn teaching

Last synced: 17 Apr 2026

https://github.com/gigdevelopment10/neuralfunk

A Machine learning resource library for funky ML-Learners

algorithm keras machine-learning optimization-algorithms py-torch python scikit-learn tensorflow

Last synced: 29 Apr 2026

https://github.com/pngo1997/yelp-business-recommender-system

Building an item-based collaborative recommendation system using embeddings for establishments from the Yelp dataset.

content-based-recommendation embeddings geo-mapping geospatial information-retrieval python recommender-system scikit-learn spacy

Last synced: 05 May 2026

https://github.com/elifftosunn/bert-bank-model

It is a Turkish BERT-based model that will analyze people's bank complaints and classify them according to one of eight categories.

countvectorizer doc2vec f1-score huggingface huggingface-transformer huggingface-transformers nlp nltk python3 scikit-learn stopwords tagged tfidf-transformer train-test-split word-tokenizer wordnetlemmatizer

Last synced: 12 May 2026

https://github.com/george-gca/ai_papers_search_tool

Automatic paper clustering and search tool by fastext from Facebook Research

fasttext fasttext-embeddings fasttext-python nlp python scikit-learn

Last synced: 02 May 2026

https://github.com/tromesh/sinhala-parser

Sinhala parser project is based on Natural Language Processing (NLP)

flux-architecture natural-language-processing nlp python react scikit-learn sinhala

Last synced: 05 May 2026

https://github.com/rakshit-vasava/predictive-analytics-for-insurance-purchase

Predicting customer insurance purchases using stacking models and SMOTE for the Homesite Quote Conversion Problem on Kaggle.

k-nearest-neighbours kaggle-competition multilayer-perceptron python random-forest scikit-learn smote support-vector-machines

Last synced: 05 May 2026

https://github.com/omanshu209/ml-basics-2022

Machine Learnings(AI) models developed using the scikit-learn library in Python.

jupyter-notebook machine-learning python python3 scikit-learn

Last synced: 06 May 2026

https://github.com/codenexa/nairobi

Quantifying Integrity in the Digital Age Misinformation spreads rapidly, accountability often falters, and the lines between transparency and manipulation blur

csv ipynb-jupyter-notebook matpotlib pkl-model python scikit-learn

Last synced: 05 May 2026

https://github.com/sorna-fast/breast-cancer-diagnosis-neural-network

ANN-based breast cancer classifier using the Wisconsin Diagnostic Dataset. Implements advanced feature engineering and achieves 98.25% test accuracy. Includes comprehensive EDA, model training, and clinical impact analysis

keras-classification-models keras-neural-networks keras-tensorflow matplotlib-pyplot pandas-dataframe scikit-learn seaborn-plots sklearn-library tensorflow

Last synced: 20 Apr 2026

https://github.com/glencrawford/matchmaker

A k-nearest neighbors machine learning project to perform similarity matching using a dataset of OkCupid dating profiles.

django machine-learning python scikit-learn scipy

Last synced: 06 May 2026

https://github.com/kartikdixit2468/advanced-jarvis-ai-using-python

An A.I voice assistant in python using simple machine learning algorithms and BardAPI.

bard bardapi jarvis machine-learning python scikit-learn voice-assistant voice-recognition

Last synced: 16 Apr 2026

https://github.com/sralter/happy_customers

Predicting whether a customer is happy based on the results from a survey.

eda ensemble-classifier hyperopt lazypredict ml scikit-learn

Last synced: 21 Apr 2026

https://github.com/nirmalyabag20/diabetes-prediction-using-machine-learning

This project focuses on predicting diabetes using machine learning algorithms based on health metrics like glucose levels, blood pressure, and BMI. By comparing different models, the goal is to identify the most accurate approach for early diabetes detection, showcasing the potential of machine learning in healthcare.

decision-tree-classifier jupyter-notebook kneighborsclassifier logistic-regression matplotlib numpy pandas python random-forest-classifier scikit-learn seaborn svc

Last synced: 18 Jan 2026

https://github.com/aryansk/fake-news-detection

A sophisticated machine learning solution to detect fake news using multiple classification algorithms. Identify the credibility of news articles with advanced text analysis techniques!

fake-news-detection machine-learning machine-learning-algorithms matplotlib numpy pandas python random-forest-classifier scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/sivatsk26/university-admit-eligibility-predictor

This project is created using Machine Learning and Regression methods- a statistical technique to predict the outcome of event which is to verify the users’ admission eligibility level, considering the universities they have chosen. This is achieved based on the algorithms implemented, when is user feed the application with the required information

html-css-javascript ibm-cloud ibm-watson linear-regression machine-learning matplotlib numpy pandas python python-flask random-forest scikit-learn

Last synced: 13 Apr 2026

https://github.com/rohitpawar001/bone_marrow_surival_prediction

Bone marrow transplants can be life-saving, but predicting patient survival is complex. In this project, I used machine learning to analyze key medical factors and improve survival predictions. I also implemented CI/CD pipelines, used MLflow for model tracking, and deployed the model on an AWS EC2 instance.

aws docker ec2-instance flask machine-learning mlflow python scikit-learn

Last synced: 08 Apr 2026

https://github.com/strcoder4007/machine-learning-deep-learning-practice

Implementation of Linear/Logistic Reg, K-NN, SVM, Clustering, K-Means, ConvNet, ResNet, MobileNet, RNN, LSTM etc. using Pandas, SciKitLearn, NumPy & TensorFlow 2

convolutional-neural-networks matplotlib scikit-learn tensorflow2

Last synced: 15 May 2026

https://github.com/samarthmule/chatbot

This project implements a generic chatbot using Natural Language Processing (NLP) and Machine Learning techniques. The chatbot is designed to classify user input into predefined intents and provide context-aware responses. The solution is scalable, interactive, and suitable for various domains.

chatbot internship machine-learning machine-learning-algorithms nlp nltk project-repository python python3 scikit-learn streamlit

Last synced: 13 Apr 2026

https://github.com/mgckaled/ignite-devia-supervised_algorithms

Repositório que reuni os módulos 7 ao 13 da Formação Desenvolvimento IA 2023-2024, desenvolvido pela Rocketseat Education.

gradio joblib pandas python scikit-learn statsmodels uvicorn

Last synced: 12 Apr 2026

https://github.com/ksatrajit0/heart-disease-prediction-ml

Predicts the risk of heart attack in a patient using their medical record

heart-disease-prediction machine-learning matplotlib numpy pandas scikit-learn seaborn

Last synced: 19 Apr 2026

https://github.com/kaleharshavardhan07/spam_mail-_detector_ai_model

This project implements a spam detection system for SMS messages using machine learning techniques.

mathplotlib nltk numpy panda python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/benman1/python-time-series

Time-Series analysis, statistical and machine learning models for forecasting, regression, and classification

darts deep-learning forecasting mlforecast nixtla scikit-learn statsforecast time-series time-series-analysis

Last synced: 22 Feb 2026

https://github.com/imswappy/brain-tumor-detection

🧠 Deep learning project for brain tumor classification using MRI images. Built with transfer learning (VGG16 + fine-tuning), TensorFlow/Keras, and deployed via Streamlit. Dataset & model loaded dynamically from KaggleHub. Includes training notebook, evaluation, and interactive web app.

kagglehub keras numpy pandas scikit-learn streamlit tensorflow vgg16-model

Last synced: 13 Apr 2026

https://github.com/supriya811106/healthcare-recommedation-system

A Flask-based web app that predicts diseases based on symptoms and recommends specialized doctors. It uses machine learning for accurate health predictions and location-based doctor searches.

css flask-application healthcare-application html javascript machine-learning numpy pandas recommendation-system scikit-learn

Last synced: 04 Mar 2026

https://github.com/abz4375/recommendersystem

A sophisticated recommender system that leverages web mining techniques to help users find hotels that match their preferences.

cosine-similarity css html javascript pandas python scikit-learn selenium selenium-webdriver

Last synced: 13 Apr 2026

https://github.com/thananjaya/admission_chance_prediction

Admission Chance Prediction using linear regression, wrapped up using Flask framework

flask linear-regression machine-learning python3 scikit-learn

Last synced: 17 Apr 2026

https://github.com/sizzlins/kalkulator-ai

A Simple Command Line Input Symbolic Regression Engine and Computer Algebra System (CAS) capable of discovering the laws of the universe, solving calculus, algebra, and trigonometrics.

calculator calculus cli computer-algebra-system curve-fitting machine-learning mathematics numpy physics python scientific-computing scikit-learn sparse-regression symbolic-regression sympy

Last synced: 13 Jan 2026

https://github.com/tasninanika/callifornia-housing-price-prediction-svr

Support Vector Regression (SVR) is a type of Support Vector Machine used for predicting continuous values.

matplotlib numpy pandas python3 scikit-learn seaborn svm-regression

Last synced: 11 Apr 2026