scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-06-25 00:23:58 UTC
- JSON Representation
https://github.com/cserajdeep/elm-python-iris
Different Python implementations of Extreme Learning Machine (ELM) on Iris dataset
ann elm iris python scikit-learn
Last synced: 29 Apr 2026
https://github.com/jarif87/text-key-extractor
A Django web app that uses TF-IDF to extract keywords from text, featuring a modern, responsive UI with animated gradients and glassmorphism.
django-application keywords-extraction pandas python scikit-learn
Last synced: 29 Apr 2026
https://github.com/vaishnavijain25/pca-based-digit-classification
A machine learning project that uses Principal Component Analysis (PCA) for dimensionality reduction and Logistic Regression for classifying handwritten digit images from the scikit-learn digits dataset.
digit-recognition dimensionality-reduction image-classification logistic-regression machine-learning pca-analysis scikit-learn
Last synced: 29 Apr 2026
https://github.com/christopherkindl/spotify-artist-success
Predicting artists’ success by using machine learning approaches on features identified in spotify data
Last synced: 29 Apr 2026
https://github.com/m-muecke/text-normalizer
Text normalizer integration for sklearn.pipeline.Pipeline class
Last synced: 29 Apr 2026
https://github.com/adnanrahin/sentiment_classification_logistic_regeression
Sentiment Analysis extracts subjective information in the source material. It's widely used in modern business, to understand the business module, product quality and consumer point of view regarding the products or the business.
logistic-regression machine-learning natural-language-processing preprocessing python3 scikit-learn
Last synced: 29 Apr 2026
https://github.com/saikumar787/car_price_prediction_using_linear-regression
A machine learning project to predict the selling price of used cars using regression techniques. Includes data preprocessing, model training, evaluation, and testing on new data.
car-price-prediction-with-machine-learning data-analysis joblib jupiter-notebook linear-regression-models model-deployment python scikit-learn standardscaler
Last synced: 29 Apr 2026
https://github.com/inclinedadarsh/regression-metrics
A simple jupyter notebook demonstrating how to use different metrics from 'scikit-learn' library.
jupyter-notebook machine-learning notebook scikit-learn
Last synced: 29 Apr 2026
https://github.com/kizman-23/recommendations
For You Algorithm System
cosine-similarity scikit-learn vectorizers
Last synced: 29 Apr 2026
https://github.com/yousefalaaali/twitter-sentiment-analysis
Natural language processing - Twitter Sentiment Analysis
deep-learning logistic-regression matplotlib naive-bayes nltk numpy pandas scikit-learn seaborn tf-idf word-embeddings
Last synced: 29 Apr 2026
https://github.com/pdoup/ml-codes
Python source files and notebooks for the Machine Learning course weekly tasks
Last synced: 29 Apr 2026
https://github.com/karimosman89/energy-consumption-forecasting
Predict future energy consumption based on historical data.Create a model that predicts energy consumption in households or businesses to optimize energy distribution and reduce costs.Assist energy companies in planning and managing supply efficiently.
arima lstm matplotlib pandas python scikit-learn
Last synced: 29 Apr 2026
https://github.com/henriqueotogami/imersao-dados-3-alura
Terceira edição da Imersão Dados da Alura (03 a 07/05/21). O projeto dessa edição foi inspirado em um desafio do Laboratory Innovation Science at Harvard disponibilizado no Kaggle.
alura bioinformatics data-science drug-discovery google-collab harvard-university imersaodados jupyter-notebook kaggle-challenge laboratory-innovation-science matplotlib pandas python3 scikit-learn seaborn
Last synced: 29 Apr 2026
https://github.com/karmaniket/gtavcontrol
created dataset using different hand gestures and trained the ML model for in-game real time control for GTA V. Have fun!
gaming gta5 machine-learning mediapipe opencv python3 scikit-learn
Last synced: 29 Apr 2026
https://github.com/mertafacan/fertilizer-prediction-kaggle-playground-s05e06
Top 9% in Kaggle Playground Series - Predicting Optimal Fertilizers - Season 5, Episode 6
catboost kaggle kaggle-competition machine-learning optuna scikit-learn xgboost
Last synced: 29 Apr 2026
https://github.com/xbants/recommendation-api
🎬 Intelligent movie recommendation system with FastAPI backend, Streamlit frontend, and collaborative filtering ML. Rate movies, get personalized suggestions, and enjoy automatic model retraining.
fastapi machine-learning movie-recommedation python3 scikit-learn streamlit
Last synced: 29 Apr 2026
https://github.com/shahzadmustafa15/credit-card-fraud-detection
Credit card fraud detection using Random Forest with Stratified K-Fold cross-validation and F1-score evaluation.
classification confusion cross-validation f1-score fraud-detection imbalanced-data kaggle machine-learning python random-forest scikit-learn
Last synced: 29 Apr 2026
https://github.com/hexbyte-lab/resumatch
AI-powered resume-to-job matching tool with NLP analysis | Python + Flask + Machine Learning
cosine-similarity flask job-search machine-learning nltk portfolio-project python resume scikit-learn tfidf
Last synced: 29 Apr 2026
https://github.com/fatulm/house-price-prediction
House Price Prediction
data-science deep-learning keras machine-learning python regression scikit-learn
Last synced: 29 Apr 2026
https://github.com/jarif87/dna-based-identification-of-e.coli
Django web app predicting E. coli in DNA sequences using a machine learning model, with a responsive interface and client-side validation. Files generated by project.py.
classification django-application dna-sequences html-css-javascript mlp-classifier python3 scikit-learn
Last synced: 29 Apr 2026
https://github.com/abhinav330/instagram-influencers-analysis
This Jupyter Notebook focuses on preprocessing and visualizing data from an Instagram profiles dataset. It includes data loading, inspection, visualization, and some data preprocessing steps.
data data-science data-visualization exploratory-data-analysis exploratory-data-visualizations influncer-products instagram scikit-learn sklearn
Last synced: 08 Jun 2026
https://github.com/rishi-sutar/healwise-ai-your-way-to-wellness
Healwise-AI is a health diagnostic tool that uses a Support Vector Classifier (SVC) model to predict diseases based on user-reported symptoms. After predicting, it offers detailed health advice, including descriptions, diets, medications, and workouts related to the diagnosis.
machine-learning scikit-learn support-vector-machine
Last synced: 30 Apr 2026
https://github.com/ledsouza/machine-learning-semisupervisionado
Este projeto utiliza algoritmos de aprendizado de máquina semi-supervisionado para classificar a qualidade do leite como alta, média ou baixa.
data-science joblib machine-learning machine-learning-algorithms pandas python scikit-learn
Last synced: 30 Apr 2026
https://github.com/das-debjit/emotion-detection
A simple ML-powered web app for real-time emotion detection from text using Streamlit and TF-IDF-based classification.
machine-learning nlp python scikit-learn sentiment-analysis streamlit text-classification tfidf web-app
Last synced: 30 Apr 2026
https://github.com/pramodyasahan/grade-predictor
This project aims to predict student performance based on various features such as job, study time, failures, absences, and first and second period grades. The project utilizes a linear regression model from the scikit-learn library in Python.
machine-learning matplotlib numpy pandas python regression scikit-learn
Last synced: 30 Apr 2026
https://github.com/sayed-ashfaq/delhivery-dataanalysis
In this project, I conducted basic analysis, feature engineering, normalization, and outlier handling, along with statistical and non-parametric testing to extract insights.
feature-engineering normalization outlier-detection pandas python scikit-learn statistcal-tests statistical-analysis
Last synced: 30 Apr 2026
https://github.com/fikri-rouzan/burnaway-capstone-data-science
Dashboard analitik interaktif untuk memetakan faktor fisik dan pola kerja pemicu burnout pada software developer.
jupyter-notebook matplotlib pandas pillow plotly python scikit-learn seaborn statsmodels streamlit
Last synced: 08 Jun 2026
https://github.com/ttsudipto/recurrence-pred-genomics
ML-based prediction of NSCLC recurrence with gene expression data
boruta gene-expression imbalanced-learn machine-learning mcfs multilayer-perceptron non-small-cell-lung-cancer python r random-forest recurrence-prediction rna-seq scikit-learn smote support-vector-machine
Last synced: 30 Apr 2026
https://github.com/kumailn/machinelearning
Machine learning with Python
machine-learning python scikit-learn tensorflow
Last synced: 30 Apr 2026
https://github.com/boladjivinny/fire-prediction
Notebook for the Fire fighting using data on Zindi. Ranked number 5 on the public leaderboard and 8 on the private leaderboard. https://zindi.africa/hackathons/cmu-africa-fighting-fire-with-data
feature-engineering hackhathon machine-learning regression scikit-learn stacking
Last synced: 30 Apr 2026