scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-06-25 00:23:58 UTC
- JSON Representation
https://github.com/lorenzorottigni/ml-advertising
Machine Learning python bootcamp: logistic regression on advertising dataset
ipynb logistic-regression machine-learning numpy pandas python scikit-learn seaborn
Last synced: 16 Apr 2026
https://github.com/eljandoubi/predict-customer-churn-with-clean-code
Clean Code Principles
autopep8 logging numpy pandas pylint pytest scikit-learn
Last synced: 16 Apr 2026
https://github.com/sergeimakarovv/solar-panel-detection
Applying deep learning models to detect solar panel installations in satellite imagery and estimating their generation capacity
albumentations convolutional-neural-networks deep-learning geopandas pandas pvlib python pytorch rasterio scikit-learn wms-service
Last synced: 16 Apr 2026
https://github.com/drkbluescience/wids2024_challenge2_metastaticdiagnosisregression
This notebook presents an exploratory data analysis (EDA) and regression modeling approach for the WiDS Datathon 2024 Challenge #2.
catboost data-visualization ensemble-learning exploratory-data-analysis imputation-methods kfold-cross-validation machine-learning metastatic-breast-cancer regression-models scikit-learn tabular-data women-in-data-science
Last synced: 16 Apr 2026
https://github.com/pierluigi-failla/pipesnake
a pandas sklearn-inspired pipeline data processor
feature-engineering feature-extraction pandas pipeline pipeline-data-processor python scikit-learn
Last synced: 03 Mar 2026
https://github.com/khaymanii/calories-burnt-prediction-model
This model was built using Python and XGBoost Regression algorithm
matplotlib numpy pandas python scikit-learn
Last synced: 16 Apr 2026
https://github.com/thekartikeyamishra/customer-retention-predictor
The Customer Retention Predictor is a Python-based tool designed to help businesses predict customer churn using historical data. This project is particularly beneficial for small businesses and MSMEs in India, allowing them to identify customers at risk of leaving and take proactive measures to retain them.
joblib machine-learning numpy pandas python scikit-learn tinker
Last synced: 16 Apr 2026
https://github.com/meiyor/abatech_ai_test
This repository contains the files for deploying an Exploratory Data Analysis (EDA) for participant demographic and company-based data collected by the outsourcing service given by the company ABATech located in Colombia. This repository also includes the evaluation of three different classifiers to decode the level of satisfaction of the users.
keras python scikit-learn scikitlearn-machine-learning tensorflow
Last synced: 16 Apr 2026
https://github.com/ry4ngch/dsml
Data Science and Machine Learning Repository
data-science data-science-projects jupyter-notebook knn-classification machine-learning neural-network python regression scikit-learn supervised-learning unsupervised-learning
Last synced: 16 Apr 2026
https://github.com/archish27/pythontutorial
Python Programming Tutorial for new geeks who want to learn python from scratch to deal with various applications
matplotlib numpy pandas pygame python python-2 python-3 scikit-learn soup
Last synced: 01 Apr 2026
https://github.com/dan-niles/iris-ml
Machine learning on the Iris dataset
iris-dataset machine-learning scikit-learn
Last synced: 16 Apr 2026
https://github.com/shreeparab1890/indian-cricketer-classifier
This notebook is trying to bulia a model which will predict a Indian Cricketer based on the given image. In this project we have handled 8 Indian Cricketers and build a model to classify the given image between this 8 Cricketers.
image-classification matplotlib numpy opencv pandas python random-forest-classifier scikit-learn sklearn streamlit
Last synced: 01 Apr 2026
https://github.com/leftcoastnerdgirl/supervised_learning
This project demonstrates supervised machine learning using scikit-learn.
classification-reports confusion-matrix jupyter-notebook numpy pandas-python pathlib scikit-learn sklearn
Last synced: 16 Apr 2026
https://github.com/sanikamal/deep-learning-atoz
A collection of deep learning architectures ,model, code snippets, tips and mini projects.
computer-vision deep-learning nlp scikit-learn skimage tensorflow
Last synced: 16 Apr 2026
https://github.com/capsuleismail/income-census-prediction
Predict whether annual income of an individual exceeds $50K per annum based on census data. Also known as "Census Income" dataset.
datascience jupyter-notebook machinelearning-python scikit-learn
Last synced: 16 Apr 2026
https://github.com/grupoguerreroherrera/ethical-ai-recruitment-audit
Bias audit toolkit reproducing the recruitment AI case from Activity 6 — Unidad 3, Electiva II Inteligencia Artificial Avanzada. Empirical analysis with reweighing mitigation, Model Card documentation, and APA 7 references.
academic-project algorithmic-auditing artificial-intelligence bias-mitigation disparate-impact ethical-ai fairness machine-learning model-card python random-forest recruitment-bias reweighing scikit-learn unesco-ai-ethics
Last synced: 03 Jun 2026
https://github.com/drorata/mnist-examples
ML examples for the MNIST dataset
machine-learning ml mnist python scikit-learn torch
Last synced: 19 Apr 2026
https://github.com/supershivam5/python_projects
💻 Python programming with Numpy, Pandas, Matplotlib.🌟 Love exploring new technologies. Check out my projects!
matplotlib-pyplot numpy pandas scikit-learn seaborn
Last synced: 17 Apr 2026
https://github.com/priyanshul28/ml_eda_regression_energyconsumptionforecasting
An EDA and Machine Learning Time-Series Regression Forecasting exercise on the PMJE Energy Consumption dataset demonstrating time-series analysis and the use of Time-Series Split, XGBoost, etc. The model is optimized using hyperparameter tuning through GridSearchCV. A Rob Mulla guided exercise.
forcasting machine-learning numpy pandas scikit-learn time-series-analysis
Last synced: 17 Apr 2026
https://github.com/ejw-data/proj-food-inspections
Analyzing Chicago Food Inspection data for interesting insights by combining multiple data resources and performing feature engineering.
decision-trees pandas preprocessing python scikit-learn
Last synced: 17 Apr 2026
https://github.com/lingumd/credit_risk_analysis
Machine learning models for predicting credit risk in LendingClub dataset.
balancedrandomforestclassifier classification-report cluster-centroids-undersampling confusion-matrix easyensembleclassifier get-dummies google-colab imbalanced-learn machine-learning matplotlib-pyplot numpy pandas pathlib randomoversampler scikit-learn smote smoteenn
Last synced: 17 Apr 2026
https://github.com/archismwanchatterjee/football-match-prediction
Football Match Predictor for top leagues
ensemble-machine-learning machine-learning pandas random-forest-classifier rmse-score scikit-learn streamlit
Last synced: 17 Apr 2026
https://github.com/abhishek-k-git/soulmegle
expressjs flask html nodejs numpy python3 reactjs scikit-learn socket tailwindcss
Last synced: 01 Apr 2026
https://github.com/erikglz/coap-mtd
Repository for an IoT security project implementing Moving Target Defense (MTD) through CoAP protocol randomization to mitigate spoofing attacks and enhance adaptive security.
coap-protocol cybersecurity iot machine-learning python scikit-learn spoofing
Last synced: 17 Apr 2026
https://github.com/zenklinov/regression_logistic_-_sentiment_analysis_movie_data
This repository contains code for performing sentiment analysis using scikit-learn and logistic regression
llm natural-language-processing nlp nltk scikit-learn sentiment-analysis
Last synced: 10 May 2026
https://github.com/dimdasci/car-price-prediction-demo
Demo project of EDA and regression task solution: Pandas, Jupyter Notebook, Scikit-learn, LightGBM
eda lightgbm-regressor regression scikit-learn
Last synced: 03 Jun 2026
https://github.com/danicc097/python-ml-app
Various [arguably useless] Machine Learning services with gRPC and OpenTelemetry for demo purposes
grpc-python opentelemetry scikit-learn
Last synced: 17 Apr 2026
https://github.com/kheriberto/knn_project
This is a simple project that uses dummie data to practice and demonstrate my knowledge of the KNN algorithm.
data-analysis knn-classifier numpy python scikit-learn seaborn
Last synced: 02 Apr 2026
https://github.com/amirmohammadgholampour/mall-customer-segmentation
Project for segmenting customers in a shopping mall using the Clustering algorithm.
numpy pandas python scikit-learn
Last synced: 02 Apr 2026
https://github.com/akshitvats026/heart_disease_prediction
An ML-based Heart Disease Prediction System that predicts the likelihood of heart disease based on user health parameters. Built using Python, Pandas, and Scikit-learn, the system performs data preprocessing, trains a predictive model, and provides real-time predictions with high accuracy.
accuracy-score logistic-regression machine-learning matplotlib-pyplot numpy pandas python scikit-learn
Last synced: 02 Apr 2026
https://github.com/satyas567/weatherdataanalysis
Comprehensive Weather Data Analysis with Python: Explore trends, visualize patterns, detect outliers, and predict temperature using humidity and wind speed
jupyter-notebook linear-regression matplotlib numpy pandas python scikit-learn seaborn
Last synced: 02 Apr 2026
https://github.com/raphael-ufrj/analise_algodao
Análise histórica de plantio de algodão, analise do plantio com base no clima e nos dados históricos.
analysis data-science data-visualization dataset docker pandas provenance python python3 scikit-learn seaborn streamlit
Last synced: 02 Apr 2026
https://github.com/anshvaid4/ml_practice
This is the new repository, where I have added all the notebooks demonstrating the usage of various transformers and models for Supervised and Unsupervised algorithms
anaconda jupyter-notebook machine-learning machine-learning-algorithms python scikit-learn
Last synced: 17 Apr 2026
https://github.com/prashver/end-to-end-model-deployment-on-aws
Student Performance Analysis with Machine Learning analyzes factors impacting student outcomes using a robust machine learning pipeline. Achieving an impressive R2 score, it predicts student performance effectively. With extensive data preprocessing and deployment on AWS Elastic Beanstalk, it ensures scalability and high availability.
amazon-web-services aws-elastic-beanstalk end-to-end-deployment flask machine-learning-algorithms matplotlib numpy pandas scikit-learn seaborn
Last synced: 02 Apr 2026