scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-06-25 00:23:58 UTC
- JSON Representation
https://github.com/ry4ngch/dsml
Data Science and Machine Learning Repository
data-science data-science-projects jupyter-notebook knn-classification machine-learning neural-network python regression scikit-learn supervised-learning unsupervised-learning
Last synced: 16 Apr 2026
https://github.com/sahiltiwariiii/dssp
Predicting student math scores ! This project utilizes advanced machine learning techniques and MLOps tools like DVC and MLflow to predict a student's math score based on various factors such as gender, race/ethnicity, parental level of education, lunch type, test preparation course, writing etc
docker dotenv dvc flask machine-learning mlflow mlops mysql mysql-connector-python numpy pandas pymysql python python-dotenv scikit-learn seaborn sklearn-library statistics streamlit
Last synced: 27 Mar 2026
https://github.com/pranav-nagpure/plant-disease-prediction-nb
IPython Notebook to build the model for Plant Disease Prediction
efficientnet image-augmentation ipython-notebook keras-tensorflow machine-learning scikit-learn
Last synced: 15 Feb 2026
https://github.com/msikorski93/detecting-panic-disorder
Panic disorder detecting using machine learning techniques.
artificial-neural-networks classification knn logistic-regression machine-learning panic-disorder random-forest scikit-learn sgd svm tensorflow xgboost
Last synced: 08 May 2026
https://github.com/bkamapantula/discover
Code search utility to assist developer workflows via code discovery. Currently uses TF-IDF estimator.
developer-tools python scikit-learn tf-idf
Last synced: 16 Apr 2026
https://github.com/smuralee/machine-learning-samples
Machine learning samples
Last synced: 15 Feb 2026
https://github.com/supershivam5/python_projects
π» Python programming with Numpy, Pandas, Matplotlib.π Love exploring new technologies. Check out my projects!
matplotlib-pyplot numpy pandas scikit-learn seaborn
Last synced: 17 Apr 2026
https://github.com/lingumd/credit_risk_analysis
Machine learning models for predicting credit risk in LendingClub dataset.
balancedrandomforestclassifier classification-report cluster-centroids-undersampling confusion-matrix easyensembleclassifier get-dummies google-colab imbalanced-learn machine-learning matplotlib-pyplot numpy pandas pathlib randomoversampler scikit-learn smote smoteenn
Last synced: 17 Apr 2026
https://github.com/abhishek-k-git/soulmegle
expressjs flask html nodejs numpy python3 reactjs scikit-learn socket tailwindcss
Last synced: 01 Apr 2026
https://github.com/zenklinov/regression_logistic_-_sentiment_analysis_movie_data
This repository contains code for performing sentiment analysis using scikit-learn and logistic regression
llm natural-language-processing nlp nltk scikit-learn sentiment-analysis
Last synced: 10 May 2026
https://github.com/danicc097/python-ml-app
Various [arguably useless] Machine Learning services with gRPC and OpenTelemetry for demo purposes
grpc-python opentelemetry scikit-learn
Last synced: 17 Apr 2026
https://github.com/iamwatchdogs/cardiovascular-risk-prediction
This mini-project uses machine learning algorithms to predict possible risks of heart disease by analyzing given data.
jupyter-notebook machine-learning-algorithms matplotlib numpy pandas python scikit-learn seaborn
Last synced: 02 Apr 2026
https://github.com/satyas567/weatherdataanalysis
Comprehensive Weather Data Analysis with Python: Explore trends, visualize patterns, detect outliers, and predict temperature using humidity and wind speed
jupyter-notebook linear-regression matplotlib numpy pandas python scikit-learn seaborn
Last synced: 02 Apr 2026
https://github.com/cinodon/outlier-detection-saas-tfg
Outlier Detection System in SaaS Access Data
anomaly-detection clustering outlier-detection saas scikit-learn unsupervised-learning
Last synced: 13 Feb 2026
https://github.com/isshiki/machine-learning-with-python
ι£θΌγPythonγ§ε¦γΆγζ©ζ’°ε¦ηΏγε ₯ιγοΌοΌ ITοΌγ§δ½Ώη¨γγγγΌγγγγ―γι εΈγγγ¦γγγͺγγΈγγͺγ§γγ
data-science machine-learning machinelearning-python python scikit-learn
Last synced: 17 Apr 2026
https://github.com/felixamaladhas/amazon-reviews-sentiment-analysis
This is a sentiment analysis project that classifies Amazon product reviews as positive or negative using machine learning techniques.
matplotlib numpy pandas python scikit-learn
Last synced: 02 Apr 2026
https://github.com/otuemre/obesity-classification
Machine learning project to classify obesity levels based on health metrics like age, sex, height, weight, and BMI.
classification data-science healthcare machine-learning obesity-classification scikit-learn
Last synced: 17 Apr 2026
https://github.com/gregoritsch3/dl_cv_e2e_potatodiseaseclassification
A guided CodeBasics Deep Learning Project where a Convolutional Model is deployed onto a Website (FastAPI) and Mobile App (React Native, Google Cloud). Its purpose is the classification of potato plant images into "healthy", "Early Blight" and "Late Blight" categories.
cnn-classification gcp model-deployment scikit-learn tensorflow
Last synced: 08 May 2026
https://github.com/sergeimakarovv/energy-data-analytics-ml
Analyzing global data on sustainable energy, predicting CO2 emissions per capita
machine-learning pandas plotly python scikit-learn streamlit
Last synced: 12 Feb 2026
https://github.com/selcia25/iris-dataset-classification
βThis repository contains a Python script for classifying the Iris dataset using the Random Forest algorithm.
data-processing iris-classification pandas random-forest-classifier scikit-learn
Last synced: 16 Apr 2026
https://github.com/shahzadmustafa15/dbscan-clustering
DBSCAN clustering algorithm applied on synthetic non-linear data (make_moons dataset).
data-science data-visualization dbscan-clustering density-based-clustering machine-learning ml-projects python scikit-learn unsupervised-learning
Last synced: 08 May 2026
https://github.com/vyjayanthipolapragada/logistic_regression_classifier
Built a classifier using Logistic Regression model to classify different species of flowers
cancer-classification classification data-science flower-classification jupyter-notebook logistic-regression pandas python scikit-learn
Last synced: 17 Apr 2026
https://github.com/c2ramel/autonomous-semantic-discovery
An unsupervised machine learning engine that utilizes Non-negative Matrix Factorization (NMF) to autonomously extract and visualize latent semantic topics from the 20 Newsgroups dataset.
data-visualization machine-learning nlp nmf python scikit-learn unsupervised-learning
Last synced: 16 Apr 2026
https://github.com/shaharband/calcofi-oceanographic-analysis
This repository contains an analysis of the CalCOFI (California Cooperative Oceanic Fisheries Investigations) dataset, which represents one of the longest and most complete time series of oceanographic and larval fish data in the world.
pandas regression scikit-learn
Last synced: 10 May 2026
https://github.com/27ahmad/movie-recommendation-system
Welcome to the Movie Recommendation System! This project uses Streamlit to provide personalized movie recommendations based on user preferences and similarity.
movie-recommendation numpy pandas python scikit-learn
Last synced: 04 Apr 2026
https://github.com/justsecret123/nba-players-stats-analysis
A quick interactive Notebook to visualize some NBA players stats (points, assists, steals, blocks...) and totals, rankings and comparisons. Feel free to add any player in the .csv data files. π
csv ipython-notebook ipywidgets jupyter-notebook jupyterlab matplotlib pandas python scikit-learn seaborn
Last synced: 18 Apr 2026
https://github.com/arseniistratiuk/decision-tree-classifier-lab
A student lab project focused on tree algorithms and a Decision Tree classifier.
bellman-ford decision-tree-classifier floyd-warshall graph-algorithms kruskal-algorithm machine-learning networkx prim-algorithm scikit-learn
Last synced: 27 Feb 2026
https://github.com/abdul-rafay19/california-housing-price-prediction
This project predicts California housing prices using machine learning regression models, including Random Forests and Decision Trees. It covers data preprocessing, exploratory analysis, model training, and hyperparameter tuning to optimize performance.
decision-trees gridsearchcv linear-regression matplotlib numpy pandas python random-forest randomsearch-cv scikit-learn scipy seaborn
Last synced: 04 Apr 2026
https://github.com/kaladabrio2020/livro-ml-with-pytorch-and-sk
Progresso em cada capitulo
jupyter-notebook matplotlib-pyplot nump pandas python3 pytorch scikit-learn
Last synced: 04 Apr 2026
https://github.com/anushrey10/fuel_efficiency_predictor
Welcome to the Fuel Efficiency Predictor! This advanced tool uses machine learning to predict your vehicle's fuel efficiency based on various characteristics.
decision-tree gradient-boosting-classifier html-css-javascript linear-regression machile-learning matplotlib python random-forest scikit-learn tailwindcss
Last synced: 18 Apr 2026
https://github.com/hariprasath-v/machinehack-analytics-olympiad-2022
Create a machine learning model to help an insurance company understand which claims are worth rejecting and the claims which should be accepted for reimbursement.
catboost-classifier exploratory-data-analysis logloss machinehack numpy optuna pandas python scikit-learn shap
Last synced: 18 Apr 2026
https://github.com/sundarmd/breast-cancer-detection
Breast-Cancer-Detection is a machine learning project that utilizes logistic regression to predict whether a tumor is benign or malignant based on the Breast Cancer Wisconsin (Diagnostic) dataset. The project demonstrates data preprocessing, model training, and evaluation using the `scikit-learn` library.
logistic-regression machine-learning python scikit-learn
Last synced: 09 May 2026
https://github.com/brossend/automl_bank_project
Automated ML pipeline for the UCI Bank Marketing dataset: ETL, Optuna-based AutoML, model evaluation, MLflow logging, pytest tests, Docker, and CI/CD.
automl bank-marketing binary-classification ci-cd classification data-science docker docker-compose etl github-actions gitlab-ci machine-learning ml-pipeline mlflow model-monitoring optuna pytest python scikit-learn uci-dataset
Last synced: 02 Jun 2026