scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-06-23 00:27:46 UTC
- JSON Representation
https://github.com/suvasish114/house-price-estimation
A machine learning model that estimate housing prices in California using the California census data
jupyter-notebook machine-learning python scikit-learn
Last synced: 09 May 2026
https://github.com/bhoomikaniranjan/pulmotrainer
A Deep Learning-based Lung Cancer Detection application using a 3D CNN model with TensorFlow and OpenCV, featuring an interactive Tkinter GUI for easy data processing and training.
matplotlib numpy-pandas opencv python scikit-learn seaborn tensorflow-keras
Last synced: 09 May 2026
https://github.com/mpolinowski/fisher-discriminant-analysis
LDA is a widely used dimensionality reduction technique built on Fisher’s linear discriminant.
linear-discriminant-analysis matplotlib-pyplot python scikit-learn
Last synced: 10 May 2026
https://github.com/rudrakhp/ir-project-blog-recommender
machine-learning python scikit-learn
Last synced: 10 May 2026
https://github.com/hassanislam463/nyc_airbnb_eda
This project is a comprehensive data analysis of Airbnb listings in New York City, exploring pricing trends, seasonality effects, host market dynamics, rental preferences, and revenue estimation. It provides valuable insights for hosts, investors, and policymakers to optimize Airbnb operations and understand the short-term rental landscape in NYC.
exploratory-data-analysis matplotlib python scikit-learn seaborn
Last synced: 10 May 2026
https://github.com/djdhairya/student-attendance-management
folium matplotlib pandas scikit-learn
Last synced: 10 May 2026
https://github.com/vijaykumarr1452/ipl-first-innings-score-prediction-deployment
Deployment of IPL Score Prediction Analyser Model. https://github.com/vijaykumarr1452/IPL-First-Innings-Score-Prediction)
css deployment gunicorn html machine-learning ml predictive-analytics python scikit-learn
Last synced: 11 May 2026
https://github.com/anras5/criteo-search-data
EDA and statistical tests on CriteoSearchData dataset
data-science pandas scikit-learn statistics
Last synced: 11 May 2026
https://github.com/ananyagubba/bike-sharing-demand-prediction
Using machine learning techniques, the model learns from features such as weather conditions, time of day, season, and holiday information to forecast hourly or daily demand.
machine-learning python scikit-learn seaborn
Last synced: 11 May 2026
https://github.com/shubhamkarampure/asl-streamlit-signlingo
streamlit based web-app for teaching sign language through real-time hand gesture recognition.
learning-exercise mediapipe opencv-python python scikit-learn sign-language streamlit-webapp
Last synced: 12 May 2026
https://github.com/xunchiasg/nyc_property_sales
Exploratory Data Analysis of rolling property sales data in NYC from March 2023-2025
matplotlib-pyplot plotly python scikit-learn
Last synced: 12 May 2026
https://github.com/arjunan-k/medical_insurance
Project to analyze and forecast medical insurance costs of patients using data science framework.
medical-insurance scikit-learn tableau
Last synced: 12 Jun 2026
https://github.com/neelimabonangi/defect-detection-hot-rolling
Defect Detection in Hot Rolling Using Machine Learning
classification data-analysis data-science defect-detection jupyter-notebook machine-learning manufacturing numpy pandas predictive-analytics python random-forest scikit-learn
Last synced: 12 Jun 2026
https://github.com/latiefdatavisionary/scikit-learn-with-indonesia-belajar
scikit-learn scikit-learn-api scikit-learn-benchmarks scikit-learn-download scikit-learn-exercises scikit-learn-installer scikit-learn-ml scikit-learn-pipelines scikit-learn-python scikit-learn-tutorial scikitlearn-machine-learning
Last synced: 20 Jun 2026
https://github.com/royxlead/production-drift-detection
Production ML monitoring library - KL, PSI, MMD, and ADWIN drift detectors with empirical benchmarks, confidence tracking, and a 6-page FastAPI dashboard.
data-drift drift-detection fastapi kl-divergence mlops mmd model-monitoring production-ml psi pytorch scikit-learn uncertainty-quantification
Last synced: 23 Jun 2026
https://github.com/adadalshabab/human-stress-analysis-greadsearch-classifier
The project leverages data from physiological signals, self-reported surveys, behavioral observations, or other relevant sources to infer and analyze stress levels.
classification knn-classification machine-learning machine-learning-algorithms matplotlib pandas scikit-learn
Last synced: 09 May 2026
https://github.com/michael95-m/packaging-insurance-claim-model
Packaging regression model from scikit-learn
feature-engineering machine-learning python python-package scikit-learn
Last synced: 07 May 2026
https://github.com/z-fran/walmart-store-sales-forecasting
Data analysis and machine learning solution in Python for the Kaggle competition Walmart Recruiting - Store Sales Forecasting.
machine-learning sales-analysis sales-forecasting sales-prediction scikit-learn walmart-sales-forecasting
Last synced: 07 May 2026
https://github.com/saswatamcode/datascienceapi
This is a RESTful API built using Flask and Scikit-Learn. It provides a host of Classification and Regression algorithms that can be used readily and returns results in the form of predictions, confusion matrices, accuracy scores and more.
api flask ml python3 scikit-learn
Last synced: 07 May 2026
https://github.com/nicovandenhooff/wids-datathon-2022
This repository contains solution for the 2022 Women in Data Science Kaggle competition that I participated in, which obtained a top 10% leaderboard standing.
catboost data-visualization datascience energy-consumption ensemble-learning exploratory-data-analysis kaggle lightgbm machine-learning scikit-learn women-in-data-science xgboost
Last synced: 07 May 2026
https://github.com/jonperk318/machine-learning-in-python
ML models built from scratch in Python 3.9.13
classification clustering feature-extraction jupyter-notebook linear-classification linear-regression machine-learning mnist pca principal-component-analysis python scikit-learn
Last synced: 07 May 2026
https://github.com/jennynzhuang/bootstrap_ml_model_evaluation
Enhancing ML Model Evaluation with Bootstrapping
bootstrapping computational-statistics jupyter-notebook machine-learning python scikit-learn
Last synced: 07 May 2026
https://github.com/andrewsy1004/linear-regression-model-for-house-price-prediction
A linear regression model to predict house prices based on features like size, location, and number of rooms. This project demonstrates the application of machine learning in real estate price estimation
linear-regression python scikit-learn xgbregressor
Last synced: 07 May 2026
https://github.com/henrytseng/example_docker_scikit-learn
A quick example of using Scikit-Learn from a Docker container
Last synced: 08 May 2026
https://github.com/anusha-me/disease-x-detection-ml-project
A machine learning classification system for early detection of Disease X based on patient symptoms using Python, Scikit-learn, and Streamlit.
classification data-science disease-prediction healthcare-ai machine-learning medicaldata scikit-learn streamlit
Last synced: 08 May 2026
https://github.com/samjoesilvano/password_strength_prediction_using_nlp
Developed a predictive model to categorize passwords as Strong, Good, or Weak, enhancing security and reducing breach risks. The project involves cleaning and analyzing data from an SQL database, using the TF-IDF technique for transformation, and implementing a Logistic Regression model to achieve accurate classifications.
data-analysis data-classification data-cleaning data-visualization logistic-regression machine-learning natural-language-processing pandas password-security password-strength python scikit-learn sql tf-idf
Last synced: 08 May 2026
https://github.com/jatin-mehra119/churn_modeling
This repository is dedicated to predicting customer churn using machine learning techniques. It includes comprehensive scripts for data preprocessing, model training, and evaluation, along with detailed visualizations and insights.
classification-model datavisualization pandas scikit-learn
Last synced: 08 May 2026
https://github.com/msikorski93/detecting-panic-disorder
Panic disorder detecting using machine learning techniques.
artificial-neural-networks classification knn logistic-regression machine-learning panic-disorder random-forest scikit-learn sgd svm tensorflow xgboost
Last synced: 08 May 2026