scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-06-25 00:23:58 UTC
- JSON Representation
https://github.com/apfirebolt/spam_email_classifier
An Email classifier using CountVectorizer and Naive Bayes strategy. PyQt5 is used for GUI
count-vectorizer naive-bayes-classifier pandas pyqt5 python scikit-learn
Last synced: 08 May 2026
https://github.com/muscaanmnmnm/breast-cancer-detector
A predictive model for breast cancer detection using K-Nearest Neighbors, demonstrating the impact of feature scaling on model performance and recall.
breast-cancer-wisconsin data-science feature-scaling jupyter-notebook knn-classification machine-learning pandas-dataframe python-3 scikit-learn
Last synced: 06 Sep 2025
https://github.com/1adore1/deadlock-match-tracker-bot
Telegram bot for tracking real-time Deadlock matches for top 250 players of the leaderboard. Fetches match data and predicts winners using a machine learning model.
aiogram api deadlock optuna pandas python scikit-learn
Last synced: 13 Apr 2026
https://github.com/vatshayan/pokemon-analysis
Visualization, Analysis & Predicting the accuracy of finding Pokemon power, attack & speed through Machine Learning
artificial-intelligence data data-analysis data-science data-visualization dataset machine-learning machine-learning-algorithms pokemon scikit-learn
Last synced: 30 May 2026
https://github.com/vyjayanthipolapragada/kmeans_clustering_customer_analysis
Using the algorithm of KMeans to analyse real customer datasets and draw valuable insights to boost business stragegy
algorithms analysis customer-data jupyter-notebook kmeans-clustering machine-learning matplotlib pandas python scikit-learn
Last synced: 13 Apr 2026
https://github.com/ot-code/coca-cola-stock-prediction
This repo compares four predictive models—Linear Regression, ARIMA, XGBoost, and LSTM—to forecast Coca‑Cola FEMSA stock closing prices using Python and five years of historical data.
arima csv linear-regression lstm-neural-networks mae matplotlib mse numpy pandas python r2 scikit-learn seaborn tensorflow-keras xgboost
Last synced: 13 Apr 2026
https://github.com/nikhilakki/predicting-the-gender-of-the-riders-of-new-york-s-citi-bikes
Predicting the Gender of the riders of New York Citi Bikes (2015-2017)
data-science decision-trees feature-engineering machine-learning pandas python scikit-learn
Last synced: 13 Apr 2026
https://github.com/grandechowhiskey/fcc-data_analysis-projects
A collection of projects completed as part of the FreeCodeCamp "Data Analysis with Python" certification. These projects cover statistical calculations, data visualization, and trend analysis using real-world datasets.
data-analysis data-visualization matplotlib pandas python3 scikit-learn seaborn
Last synced: 01 May 2026
https://github.com/arindal1/breast-cancer-detection
A simple Neural Network model to detect Breast Cancer.
machine-learning neaural-network scikit-learn tensorflow
Last synced: 13 Apr 2026
https://github.com/NoName115/Bachelor-thesis
Bachelor thesis - Determination of Gun Type and Position in Image Scene
bachelor-thesis classification computer-vision fit gun keras machine-learning scikit-image scikit-learn vut
Last synced: 11 Mar 2025
https://github.com/dineshh912/analysis_stock_price_data
Experiment analysis of stock price data with python3
data-analysis data-visualization financial-data python3 scikit-learn stock-price-prediction
Last synced: 24 Apr 2026
https://github.com/veronsheva/global_food_wastage
Global Food Wastage Analysis
analysis data data-analitics pandas predictions python scikit-learn seaborn visualization
Last synced: 18 Apr 2026
https://github.com/nicolascoiado/nivel-mar
Este projeto realiza uma análise detalhada do nível médio global do mar (GMSL), utilizando uma base de dados pública que abrange medições históricas. O objetivo é explorar tendências, calcular a taxa média de elevação e visualizar os dados por meio de gráficos.
google-colab jupyter-notebook matplotlib numpy pandas python python3 scikit-learn
Last synced: 11 Mar 2025
https://github.com/thinker84/real-time-stock-price-prediction-and-market-analysis-using-machine-learning
Real-time stock price prediction app using LSTM, Streamlit, and historical data (2010–2023). Forecasts next 10 days & visualizes trends.
data-science django lstm machine-learning numpy pandas pandas-datareader scikit-learn stock-market stock-price-prediction stooq streamlit yahoo-finance yahoo-finance-api
Last synced: 13 Jul 2025
https://github.com/hetuvpatel/brain-stroke-prediction
Machine Learning project for predicting stroke risk using healthcare data. Includes EDA, preprocessing, SMOTE, feature selection (RFE), evaluation of Logistic Regression, Decision Tree, Random Forest, KNN, SVM, and Stacked Ensemble models.
data-mining ensemble-learning healthcare machine-learning predictive-modeling python rfe scikit-learn smote
Last synced: 17 May 2026
https://github.com/omar7001-b/data-miner
DataMiner is an interactive web application for data mining and machine learning. It helps users upload, clean, transform, and analyze datasets while building predictive models — all through a simple and powerful Streamlit interface.
data-cleaning data-mining data-preprocessing data-science data-visualization interactive-dashboards pandas python scikit-learn streamlit
Last synced: 28 Apr 2025
https://github.com/pksvv/machinelearning_svm
Various implementations of Support Vector Machine Algo
machine-learning python scikit-learn support-vector-machine
Last synced: 04 May 2026
https://github.com/mastermindromii/car-price-prediction-model
Here is My Regression Project based on Predicting Price of Car using Linear Regression.
linear-regression matplotlib numpy pandas python scikit-learn seaborn
Last synced: 13 Apr 2026
https://github.com/dharma-acha/imageclassification
This project is an interactive Streamlit web application using the VGG-13 model to classify images from the CIFAR-10 dataset. Users can upload images to receive real-time predictions and visual explanations of the model's decisions. The goal is to accurately classify images into one of the ten CIFAR-10 classes: airplanes, automobiles, birds, cats,
colab-notebook matplotlib numpy pandas python3 pytorch scikit-learn seaborn streamlit
Last synced: 13 Apr 2026
https://github.com/vishant007/annadataa
A Website For Farmers To Guide Them Regarding Crop Prouction In Their Native Language
django flask-application google-collab kaggle machine-learning-algorithms numpy pandas python3 scikit-learn
Last synced: 13 Apr 2026
https://github.com/aml-hassan-abd-el-hamid/finding-donors-for-charityml
Predicting salary of the people based on various data about them
machine-learning python scikit-learn supervised-learning udacity-machine-learning-nanodegree
Last synced: 08 May 2026
https://github.com/ozcankyo28/ds-ml-bootcamp
📊 Master data science and machine learning in one month with hands-on projects, covering the complete ML workflow from data collection to deployment.
data-science datascience jose-portilla lgbm lgbmregressor machine-learning matplotlib-pyplot python regression-models scikit-learn seaborn tensorflow udemy-course-project udemy-machine-learning
Last synced: 14 Apr 2026
https://github.com/codecraft-sanju/medvisionai-medical-image-ai-vision.
MedVisionAI is an AI-powered platform that analyzes ultrasound images to detect PCOS and provide actionable recommendations. Using CNN-based deep learning and generative AI, it ensures fast, accurate diagnosis, reduces errors, and supports clinicians with instant insights all while maintaining patient privacy and compliance.
deep-learning fastapi gemini-api genai keras-tensorflow machine-learning matplotlib python react scikit-learn seaborn tailwindcss tensorflow
Last synced: 07 Sep 2025
https://github.com/cesar312/python-data-science-toolbox
A collection of useful data science tools and techniques
data-science jupyter-notebook pandas python scikit-learn statistics visualization
Last synced: 13 Apr 2026
https://github.com/djb15/machine-learning-project
Machine learning project 2018 - Imperial College London
machine-learning project python3 scikit-learn scikitlearn-machine-learning university university-project
Last synced: 27 Apr 2026
https://github.com/wilfordaf/ml-sect-introduction-task
Test task for students assosiation
classic-machine-learning keras machine-learning regression-models scikit-learn
Last synced: 28 Feb 2025
https://github.com/mmerlyn/analysis-of-tomato-prices
Forecasting tomato prices in Karnataka using machine learning to help farmers make better crop planning and selling decisions.
css flask html matplotlib numpy pandas python scikit-learn seaborn
Last synced: 06 Jul 2025
https://github.com/lukacerr/lovelytics
Lovelytics technical task for AI engineer position
ai-agents deepagents langchain ml python scikit-learn
Last synced: 31 May 2026
https://github.com/okerx/spotifymoods
A simple ML model to classify Spotify tracks using audio features.
machine-learning pandas python scikit-learn
Last synced: 09 May 2026
https://github.com/sasank-sasi/subtheme-sentiment-analysis-for-review
"Comprehensive Subtheme Sentiment Analysis of Customer Reviews Using Advanced NLP Techniques"
matplotlib natural-language-processing nltk plotly python scikit-learn spacy vader-sentiment-analysis
Last synced: 05 Feb 2026
https://github.com/anirudh-pulavarthy/car-evaluation-using-smote
machine-learning python scikit-learn smote-sampling
Last synced: 24 Apr 2026
https://github.com/subratamondal1/heart-attack-prediction
Heart Attack Prediction of patients based on the required data. Data Ingestion - Data Preparation - Exploratory Data Analysis (EDA) - Modelling - Evaluation.
data-analysis data-science data-visualization kaggle-dataset machine-learning matplotlib-pyplot numpy pandas python3 scikit-learn seaborn
Last synced: 09 Apr 2026
https://github.com/duruii/contest-dingtalkcup2-a
2023年第二届“钉钉杯”大学生大数据挑战赛——智能手机用户监测数据分析
data-mining machine-learning pandas scikit-learn xgboost
Last synced: 12 Mar 2025
https://github.com/abhipatel35/svm-hyperparameter-optimization-for-breast-cancer
Utilizing SVM for breast cancer classification, this project compares model performance before and after hyperparameter tuning using GridSearchCV. Evaluation metrics like classification report showcase the effectiveness of the optimized model.
breast-cancer cancer-diagnosis classification data-analysis data-science gridsearchcv healthcare hyperparameter-tuning jupyter-notebook machine-learning medical-imaging pycharm python scikit-learn support-vector-machine svm
Last synced: 05 Feb 2026
https://github.com/hotequil/computer-vision
Study about computer vision.
jupyter-notebook matplotlib numpy python scikit-learn
Last synced: 13 Apr 2026
https://github.com/oceanuz/car-price-regression
A comprehensive ML evaluation and improvement notebook for a car price prediction model. It includes topics such as scoring with r2, cross-validation, overfitting/underfitting diagnosis, and polynomial regression. *Ridge regression* is applied to reduce overfitting, and (GridSearchCV) techniques are used to find the best alpha hyperparameter.
cross-validation data-science grid-search hyperparameter-tuning machine-learning machine-learning-models model-evaluation overfitting python regression ridge-regression scikit-learn
Last synced: 11 Dec 2025
https://github.com/18mahi/digital_cave
An intermediate-level deep learning project that compares Convolutional Neural Networks (CNN) and Multi-Layer Perceptrons (MLP) on the MNIST handwritten digits dataset. This project demonstrates data augmentation, learning rate scheduling, and visual comparison of model performance
cnn confusion-matrix data-augmentation data-science deep-learning evaluation-metrics jupyter-notebook keras learning-rate-scheduler machine-learning matplotlib mlp numpy python3 scikit-learn seaborn tensorflow
Last synced: 13 Apr 2026
https://github.com/pranavsp108/time-series-forcasting
A time-series forecasting project to predict hourly energy consumption using Python, Pandas, and an XGBoost regression model.
data-analysis data-science energy-consumption forecasting matplotlib numpy pandas python scikit-learn sustainability time-series xgboost
Last synced: 10 Apr 2026
https://github.com/ahmadbuilds/fake-news-classifier
Classifies news articles as real or fake using an NLP pipeline with TF-IDF + n-grams and machine learning models. Includes text preprocessing, feature engineering, model training, and evaluation.
fastapi logistic-regression matplotlib n-grams nextjs nltk numpy pandas python3 random-forest-classifier react scikit-learn seaborn supervised-learning tf-idf typescript xgboost-classifier
Last synced: 11 Apr 2026
https://github.com/pranavsp108/financial-fraud-detection
A comprehensive machine learning project for detecting financial fraud using XGBoost and LightGBM, with a focus on advanced feature engineering, class imbalance handling, and hyperparameter tuning.
classification-model data-science feature-engineering fraud-detection hyperparameter-tuning lightgbm machine-learning pandas python scikit-learn xgboost
Last synced: 04 May 2026
https://github.com/imehranasgari/mlflow_starter
This project is a hands-on guide to the complete end-to-end MLflow workflow, designed as an educational resource. It demonstrates how MLflow is used in practice for experiment tracking, model versioning, and ensuring a reproducible MLOps lifecycle, focusing on the methodology and best practices rather than high model accuracy.
data-science experiment-tracking mlflow mlops model-registry python scikit-learn
Last synced: 11 May 2026
https://github.com/njorogepaul-moghul/house-price-predictions-kaggle-competition-
Built a predictive model for the Kaggle House Prices competition using feature engineering and LightGBM, achieving strong leaderboard performance."
data-science house-price-prediction-with-lightgbm kaggle-competition lightgbm machine-learning predicting-home-values-using-machine-learning random-forest scikit-learn
Last synced: 15 May 2026
https://github.com/kostadinlambov/time-series-forecasting
This project evaluates the predictive performance of a CNN-LSTM Hybrid deep learning model for Bitcoin price movement prediction.
keras-tensorflow matplotlib-pyplot mlflow numpy optuna pandas python scikit-learn seaborn statsmodels ta-lib tensorflow
Last synced: 07 Apr 2026
https://github.com/javedfazlulahf/customer-churn-prediction
📊 Predict customer churn in telecom using machine learning to enhance retention strategies and drive better business outcomes.
churn-prediction cross-validation data-science factorization-machines imbalanced-learn libsvm machine-learning model-evaluation pipelines plotly scikit-learn seaborn shap-values spark-ml survival-analysis tensorflow watson-studio xgboost4j
Last synced: 11 May 2026
https://github.com/kianaabrisham/stroke-prediction-ml-pipeline
Clinical ML pipeline with ROC/PR and interpretability
class-imbalance clinical-data healthcare interpretability machine-learning pandas pipeline precision-recall roc-auc scikit-learn
Last synced: 05 Oct 2025
https://github.com/nihanthbhargav/time-series-stock-market
This project combines computer vision and NLP by segmenting pet images with a U-Net model and generating captions using CNN-RNN/LSTM. Using the Oxford-IIIT Pets dataset, it demonstrates a unified pipeline that integrates pixel-level segmentation with automatic caption generation for meaningful image understanding.
matplotlib numpy pandas plotly python scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/inesruizblach/data-science-project
A data science project exploring Portuguese "Vinho Verde" wine quality prediction. Features EDA, feature engineering, ML models, and evaluation using Python, pandas, scikit-learn, and visualization tools.
binary-classification classification data-science exploratory-data-analysis feature-engineering imbalanced-learn jupyter-notebook machine-learning model-evaluation pandas regression scikit-learn seaborn uci-dataset wine-quality
Last synced: 09 May 2026
https://github.com/scorchinghot/core-machine-learning-exploration
This repository provides a hands-on exploration of classical machine learning algorithms applied to the MovieLens 100k dataset, aiming to build intuition and understanding of core ML concepts.
core-ml data-science hands-on machine-learning ml-algorithms python scikit-learn tutorial
Last synced: 05 Oct 2025
https://github.com/josepablodmg/python--linear-regression---housing-exercise
A predictive analysis exploring the relationship between household characteristics and median income in California. Using linear regression, the project investigates whether blocks with fewer households correspond to higher median incomes.
california data-analysis data-science exploratory-data-analysis housing-data linear-regression machine-learning python regression scikit-learn statistics visualization
Last synced: 05 Oct 2025
https://github.com/robertfmath/mammal-taxonomy-gradient-boosting-classifier
Predicting mammalian taxonomic order based on ecological, geographic, and life-history traits
classification gradient-boosting gradient-boosting-classifier machine-learning mammals python scikit-learn taxonomic-classification taxonomy
Last synced: 06 Oct 2025
https://github.com/vivekky57/car-price-prediction
Now you can get Car Price with this wonderful end-to-end project.
flask machine-learning machine-learning-algorithms python python3 random-forest-classifier scikit-learn
Last synced: 13 Apr 2026
https://github.com/sora468/best-of-ml-python
🏆 Discover top-ranked Python libraries for machine learning, updated weekly to help you find the best tools for your projects.
airport airport-simulation chatgpt configuration data-analysis data-science data-visualization data-visualizations gpt keras machine-learning nlp python scikit-learn tensorflow transformer usg-ai-training-data usg-artificial-intelligence
Last synced: 09 May 2026
https://github.com/harris-giki/e-comdataanalysis_ml
E-commerce Customer Analysis with Linear Regression: analyzes customer behavior within an e-commerce setting and predict yearly customer spending based on various features using a linear regression model.
development ecommerce linear-regression machine-learning model prediction-model python scikit-learn
Last synced: 14 Apr 2026
https://github.com/dukebw/ml-model-selection
Machine learning model selection using Dlib and scikit-learn.
dlib machine-learning ranking scikit-learn
Last synced: 07 Oct 2025
https://github.com/sducournau/ign_lidar_hd_dataset
🏗️ Comprehensive Python library for processing IGN LiDAR HD data into machine learning-ready datasets for Building Level of Detail (LOD) classification. Features GPU/CPU processing, smart data management, and complete ML pipeline integration.
building-classification data-processing dataset france geospatial gis ign lidar lidar-hd numpy point-cloud scikit-learn
Last synced: 20 Jan 2026
https://github.com/prarthana-singh/bangalore-house-price-predictor
🏡 Bangalore House Price Prediction – A Machine Learning model to predict house prices in Bangalore using real estate data. Built with Linear Regression, Python, Pandas, NumPy, and Scikit-Learn.
data-analysis eda house-price-prediction linear-regression machine-learning numpy pandas python real-estate regression scikit-learn
Last synced: 19 Apr 2026
https://github.com/arish-mhrjn/aimodelinspector
A fairly comprehensive Python library allowing for exploration, self-education and categorizaton of AI models
ai analysis coreml-models diffusers diffusion-models ggml hdf5-format jax model-discovery model-insights openvino-models pytorch scikit-learn scikitlearn-machine-learning
Last synced: 07 Oct 2025
https://github.com/ercodex/predicting-new-medicines-with-sgd-classifier
"Shittiest Model of the Competition" award winner!
classifier-model dask kaggle-competition machine-learning numpy python scikit-learn
Last synced: 14 Apr 2026
https://github.com/r-gg/ml-37
Amazon Reviews ~ Sentiment analysis evaluation: fine-tuned BERT vs LSTM. (+ Extensive Data Mining & Visualization)
bert deep-learning ipynb-jupyter-notebook lstm machine-learning python scikit-learn uni-project
Last synced: 05 Feb 2026
https://github.com/shubhamsoni98/classification-with-random-forest-1
To classify sales into categories (Low, Moderate, High) using Random Forests to inform strategic decisions and optimize marketing strategies.
algorithms anaconda data data-science datacleaning eda jupyter-notebook machine-learning pyhton random-forest scikit-learn visualization
Last synced: 18 Jan 2026
https://github.com/madsondeluna/mvp_pucrio_data_analytics_and_machine_learning
MVP referente a sprint de Machine Learning & Analytics (40530010056_20250_01) da Pós-Graduação em Data Science and Analytics da PUC-Rio.
comparative-analysis data-analytics data-science machine-learning-algorithms postgraduate-course python pytorch scikit-learn
Last synced: 03 May 2026
https://github.com/hiftd22/wpattern
📈 Analyze and visualize stock patterns with wPattern for better trading insights. Simplify your investment decisions through data-driven analysis.
cli finance financial-analysis financial-data matplotlib numpy pandas pattern-recognition python scikit-learn stock-scanner technical-analysis yfinance
Last synced: 29 Apr 2026
https://github.com/pranavsp108/market_basket_analysis-instacart
Customer segmentation and market basket analysis using the Instacart dataset with Python, Pandas, and K-Means clustering.
customer-segmentation-and-buying-behavior data-analysis data-visualization instacart jupyter-notebook kmeans-clustering market-basket-analysis pandas python scikit-learn
Last synced: 05 May 2026
https://github.com/a-iftikhar02/telco-churn-pipeline
End-to-End Customer Churn Prediction Pipeline using Scikit-learn, GridSearchCV, and Gradio. Automatically preprocesses Telco data, tunes Logistic Regression & Random Forest models, and deploys an interactive web app for real-time churn predictions.
automation churn-prediction gradio gridsearchcv hyperparameter-tuning joblib machine-learning ml-pipeline python scikit-learn streamlit telco-churn
Last synced: 08 Oct 2025
https://github.com/fersihela/ml_project
Machine learning project developed for the Data Science course at Esprit School of Engineering. Focuses on predictive modeling using Python and scikit-learn.
deep-learning esprit-school-of-engineering machine-learning predective-modeling python scikit-learn
Last synced: 25 Feb 2026
https://github.com/enyaude/california_house_price_prediction
Developed a California house price prediction model utilizing linear regression and Random Forest, and applied machine learning techniques such as Ridge, and Lasso for optimization in Python.
jupyter-notebook linear-regression python random-forest scikit-learn streamlit
Last synced: 23 Feb 2026
https://github.com/greatepee/book-recommendation-system
Book Recommendation System
collaborative-filtering cosine-similarity evaluation-metrics k-precision kaggle kaggledatasets mae numpy pandas python scikit-learn scipy svd svd-matrix-factorisation
Last synced: 12 Mar 2026
https://github.com/agrawalchaitany/teamelytra_solar_eff_prediction_model
A robust, modular machine learning pipeline for predicting solar panel efficiency, featuring domain-specific preprocessing, advanced feature engineering, model training, evaluation, selection, and easy batch prediction from CSV files.
hackerearth-solutions lightgbm-regressor machine-learning optuna python3 scikit-learn solar-panel-efficiency xgboost-regression zelestra
Last synced: 09 Oct 2025
https://github.com/hmotez/medclaimml
An AI-powered machine learning application designed to process healthcare reimbursement claims. It analyzes medical documents, client information, insurance policies, and legal regulations to predict accurate reimbursement amounts, ensuring efficiency, compliance, and fraud detection.
healthcare jest-tests mern-stack mongodb nodejs nosql numpy pytorch react scikit-learn tailwindcss
Last synced: 12 Apr 2026
https://github.com/lorenzorottigni/dl-houses
Deep Learning python bootcamp: exploratory data analysis & deep learning on houses price dataset
deep-learning ipynb machine-learning numpy pandas python scikit-learn seaborn tensorflow
Last synced: 11 Apr 2026
https://github.com/bhavnanahar/coral_project
ChatGPT said: This project analyzes and predicts the health of coral reefs in the Florida Keys by examining coral cover and species richness trends over time.
colab-notebook matplotlib numpy pandas plotly python random-forest scikit-learn seabo
Last synced: 14 Apr 2026
https://github.com/animesh-sh/selectanime
django django-rest-framework react redux scikit-learn
Last synced: 14 Apr 2026
https://github.com/sohitbennett/roadsafe
A Deep learning computer vision system for real-time traffic safety monitoring.
computer-vision esrgan keras numpy pandas python scikit-learn tensorflow tesseract-ocr yolov5 yolov8
Last synced: 08 Apr 2026
https://github.com/shadmanshaikh/ml_algo_from_scratch
All standard machine learning algorithms from scratch in python 🐍
classification deep-learning machine-learning neural-nets python regression scikit-learn
Last synced: 09 May 2026
https://github.com/hirkojoba/fintrack
Full-stack financial tracking app with ML forecasting and AI insights. Built with Rails, PostgreSQL, Python/scikit-learn, and OpenAI API.
artificial-intelligence data-visualization fintech full-stack machine-learning openai postgresql python ruby-on-rails scikit-learn
Last synced: 14 Apr 2026
https://github.com/divyajnanakshi-cloud/phishing-detector
This project presents a Phishing Detection System implemented as a Chrome Extension designed to help users determine whether a website is legitimate or malicious
chrome-extension css hashing-algorithm html javascript python qr-code-processing random-forest scikit-learn steganography
Last synced: 14 Apr 2026
https://github.com/sharvesh1401/battsense
BattSense is a machine learning project focused on predicting the State of Health (SOH) of lithium-ion batteries using operational parameters such as voltage, current, temperature, and capacity. The model enables accurate, data-driven diagnostics for battery performance monitoring in electric vehicles and portable devices.
battery-diagnostics battery-health battery-health-prediction battery-soh data-analysis electric-vehicles energy-storage machine-learning predictive-maintenance python regression scikit-learn
Last synced: 07 May 2026
https://github.com/anjaliwork20/heart-failure-prediction
Heart Failure Prediction in Humans
failure-detection heart heart-disease machine-learning machine-learning-algorithms matplotlib numpy pandas python scikit-learn scikitlearn-machine-learning seaborn svc-model
Last synced: 21 Jan 2026
https://github.com/mylnikovdan/house_pricing
ml python scikit-learn visualization
Last synced: 19 Jan 2026
https://github.com/alisonmitchell/titanic
Exploration of a subset of the Titanic passenger manifest to create a predictive classification model to determine which passengers were more likely to survive.
deep-learning keras machine-learning matplotlib numpy pandas python scikit-learn scipy seaborn tensorflow
Last synced: 14 Apr 2026
https://github.com/priyanshulathi/air-quality-index-prediction
Machine learning based air quality index prediction using environmental and pollutant data to classify and forecast pollution levels.
machine-learning matplotlib numpy pandas python scikit-learn seaborn
Last synced: 19 Jan 2026
https://github.com/khushirajurkar/exoplanet-habitability-prediction-model
Predicts whether an exoplanet is habitable using ML. Handles class imbalance with ADASYN, tests multiple models, and saves the best one. Includes confusion matrices, ROC curves, and a clean Jupyter notebook
adasyn astroinformatics confusion-matrix exoplanets logistic-regression machine-learning multiclass-classification python roc-curve scikit-learn smote
Last synced: 06 May 2026
https://github.com/jeus0522/7-explore-different-classifier-ml-app
A project exploring various classification algorithms, showcasing their implementation, comparison, and evaluation using Python and scikit-learn.
k-nearest-neighbours knn random-forest scikit-learn streamlit support-vector-machine svm
Last synced: 21 Jan 2026
https://github.com/lemma-osu/sknnr
scikit-learn compatible estimators for various kNN imputation methods
classification gnn gradient-nearest-neighbor imputation k-nearest-neighbor knn most-similar-neighbor msn random-forest-nearest-neighbor regression rfnn scikit-learn sklearn-estimator
Last synced: 23 Feb 2026
https://github.com/katjaweb/king-county-house-price-prediction
This project aims to predict house prices based on various features such as square footage, number of rooms or location.
machine-learning python regression scikit-learn
Last synced: 19 Jan 2026
https://github.com/probokrishnacahya/machine_learning
IMT01306601-A Machine Learning
ipynb machine-learning matplotlib numpy pandas python scikit-learn seaborn
Last synced: 14 Apr 2026
https://github.com/lorenzorottigni/ml-breast-cancer
Machine Learning python bootcamp: Support Vector Machines using breast cancer dataset
ipynb machine-learning numpy pandas python scikit-learn seaborn support-vector-machines
Last synced: 14 Apr 2026
https://github.com/manome/python-supervised-learning
This project provides sample code for performing supervised learning.
conformal-prediction scikit-learn supervised-learning
Last synced: 19 Jan 2026
https://github.com/sudarshanc00/smishing
This project aims to classify text messages to detect potential smishing (SMS phishing) attacks. Using machine learning, the project provides a classifier that can differentiate between legitimate messages and smishing attempts, helping to prevent scams.
nltk numpy pandas python scikit-learn scipy
Last synced: 14 Apr 2026
https://github.com/afkewolczyk/data_science_bootcamp
A data science project to learn data science essentials such as: pandas, Matplotlib, Scikit learn
ai data-science machine-learning pandas scikit-learn
Last synced: 07 May 2026
https://github.com/ricardorobledo/paymentcardfrauddetection2025
Comparative analysis of probabilistic classification models for credit card fraud detection, focusing on model calibration and threshold optimization in highly imbalanced datasets.
imbalanced-learn matplotlib numpy pandas python3 scikit-learn search
Last synced: 14 Apr 2026
https://github.com/nikshithmenta/fake-news-detector
This repository contains a Streamlit web app designed for fake news detection. Users can input a news article, and the app will predict whether it's real or fake based on its content. It also allows users to choose between different vectorizers (TF-IDF or Bag of Words) and classifiers (Linear SVM or Naive Bayes) to customize the prediction model.
bag-of-words fake-news-detection linear-svc naive-bayes-classifier scikit-learn streamlit-application tf-idf
Last synced: 15 May 2026
https://github.com/1adityakadam/tweet-classification-using-nlp-techniques
This project classifies tweets as toxic or non-toxic using NLP and machine learning. It includes preprocessing, feature engineering, and models like Logistic Regression, Random Forest, and XGBoost on labeled tweet datasets. Technologies: Python, Pandas, NLTK, Scikit-learn, XGBoost.
nltk pandas python scikit-learn xgboost
Last synced: 05 May 2026
https://github.com/sudothearkknight/15-machinelearningprojects
A curation of 15 Machine Learning projects in various fields that are helping me gain a better understanding of the different machine learning tools, techniques, algorithms and methodalogies.
classification-algorithm machine-learning machine-learning-algorithms natural-language-processing pycharm-ide python3 regression-models scikit-learn scikitlearn-machine-learning spam-detection
Last synced: 19 Jan 2026
https://github.com/vivekkdagar/naivebayesclassifier
Multinomial Naive Bayes Language Classification model
artificial-intelligence beautifulsoup4 college-project github joblib kaggle kaggle-dataset linux machine-learning multinomial-naive-bayes naive-bayes naive-bayes-classifier natural-language-processing popos pycharm python3 scikit-learn simple-project
Last synced: 19 Jan 2026
https://github.com/srijon57/disease-detector
Practice python3 detecting diseases based on symptoms
flask machine-learning pandas python3 scikit-learn typescript vite
Last synced: 11 Apr 2026
https://github.com/angelalim88/jakarta-air-quality-index-classification
This project classifies Jakarta's Air Quality Index (AQI) from 2010 to 2023 using machine learning models (Random Forest, MLP, SVM) based on pollutant concentrations.
data-analysis data-visua machine-learning scikit-learn tensorflow
Last synced: 13 Oct 2025
https://github.com/abideen-olawuwo/heart-diseases
A Heart diseases Machine Learning classification problem
decision-tree-classifier kneighborsclassifier logistic-regression matplotlib numpy pandas python random-forest-classifier scikit-learn seaborn
Last synced: 14 Apr 2026