scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-06-25 00:23:58 UTC
- JSON Representation
https://github.com/sudothearkknight/15-machinelearningprojects
A curation of 15 Machine Learning projects in various fields that are helping me gain a better understanding of the different machine learning tools, techniques, algorithms and methodalogies.
classification-algorithm machine-learning machine-learning-algorithms natural-language-processing pycharm-ide python3 regression-models scikit-learn scikitlearn-machine-learning spam-detection
Last synced: 19 Jan 2026
https://github.com/farhad-here/predict_student_performance
Predict Student Performance, is a data analysis and machine learning project aimed at predicting students' final performance (g3) based on demographic, family, and academic features. The project supports both Regression (predicting exact grades) and classification (Pass/Fail categories).
classification data-analysis data-visualization linear-regression machine-learning numpy pandas postgresql powerbi scikit-learn streamlit
Last synced: 14 Apr 2026
https://github.com/darkdk123/customer-churn-prediction-innobytes
Predicting Customer churns as an Internship project at Innobytes services.
data-science python scikit-learn streamlit xgboost-classifier
Last synced: 14 Apr 2026
https://github.com/eesunmoon/machine_learning
[Spring 2021] Machine Learning
data-analysis kaggle machine-learning ml python scikit-learn sklearn
Last synced: 14 Apr 2026
https://github.com/navindafernando/iris
Hello world for ML :mushroom:
classification joblib k-nearest-neighbor machine-learning pandas scikit-learn
Last synced: 09 May 2026
https://github.com/cycle-sync-ai/student-score-analysis
A data-driven student performance analysis project using UCI dataset (396 students, 33 features). Implements machine learning models (K-means, PCA, Decision Tree, Random Forest, Linear Regression) to analyze academic patterns and predict student scores based on lifestyle, health, and study habits.
clustering clustering-algorithm decision-trees feature-engineering learning-management-system linear-regression machine-learning machine-learning-algorithms matplotlib numpy pandas pca pickle prediction prediction-algorithm scikit-learn score seaborn student
Last synced: 06 May 2026
https://github.com/hassan11196/churn-nn
A simple Churn Predictor using Scikit's Multi-Layer Perceptron Classifier
jupyter-notebook machine-learning ml neural-network python scikit-learn
Last synced: 14 Apr 2026
https://github.com/mecha-aima/fake-bills-detection
This Python project implements a simple classification model comparison using scikit-learn to classify banknotes as either "Authentic" or "Counterfeit" based on four features
classification-model machine-learning model-selection scikit-learn
Last synced: 27 Jan 2026
https://github.com/icepanorama/internship-visualizations-and-demonstrations
A collection of some of the programs that I've written over the course of my internship.
artificial-intelligence machine-learning matplotlib numpy pandas python3 pytorch scikit-learn
Last synced: 14 Apr 2026
https://github.com/lintangwisesa/pdb_mti_ui_lab1_k6
Tugas Lab 1 Pengelolaan Data Besar MTI UI 2023
machine-learning python3 scikit-learn
Last synced: 06 May 2026
https://github.com/mahdi-meyghani/movie-recommendation-system
A Python-based movie recommendation system utilizing popularity-based, content-based, and collaborative filtering models with data science and machine learning techniques.
data-analysis data-science machine-learning recommendation-system scikit-learn scikitlearn-machine-learning
Last synced: 23 Jan 2026
https://github.com/djdhairya/rooftop-solar-detection
data-processing data-science deep-learning eda machine-learning pandas scikit-learn tif
Last synced: 27 Jan 2026
https://github.com/1adityakadam/tweet-classification-using-nlp-techniques
This project classifies tweets as toxic or non-toxic using NLP and machine learning. It includes preprocessing, feature engineering, and models like Logistic Regression, Random Forest, and XGBoost on labeled tweet datasets. Technologies: Python, Pandas, NLTK, Scikit-learn, XGBoost.
nltk pandas python scikit-learn xgboost
Last synced: 05 May 2026
https://github.com/nikshithmenta/fake-news-detector
This repository contains a Streamlit web app designed for fake news detection. Users can input a news article, and the app will predict whether it's real or fake based on its content. It also allows users to choose between different vectorizers (TF-IDF or Bag of Words) and classifiers (Linear SVM or Naive Bayes) to customize the prediction model.
bag-of-words fake-news-detection linear-svc naive-bayes-classifier scikit-learn streamlit-application tf-idf
Last synced: 15 May 2026
https://github.com/juzershakir/student-intervention
Predicting the likelihood that a given student will pass or not!
adaboost classification f1-score gridsearchcv logistic-regression machine-learning python2 random-forest scikit-learn student-intervention supervised-learning
Last synced: 22 Oct 2025
https://github.com/gabrielmazzotta/nlp-clustering--movie-similarity-from-plot-summaries
A Python-based movie recommendation system leveraging NLP and clustering techniques. This project includes data processing, vectorization of plot summaries, and the implementation of recommendation algorithms to suggest similar movies based on user input.
clustering cosine-similarity hierarchical-clustering kmeans lemmatization nlp recommendation-engine scikit-learn similarity-score spacy tokenization
Last synced: 21 Jan 2026
https://github.com/shubhamsoni98/prediction-with-binomial-logistic-regression
To predict client subscription to term deposits and optimize marketing strategies by identifying potential subscribers.
binomial data data-science eda machine-learning matplotlib pipeline python scikit-learn seaborn sklearn sql visualization
Last synced: 06 Feb 2026
https://github.com/trilokida/named-entity-recognition-and-classification
Given a string statement, the aim is to identify the B-Protein entity in the statement.
bagging-ensemble classification dictvectorizer ensemble-learning machine-learning naive-bayes-classifier named-entity-recognition ner random-forest scikit-learn
Last synced: 17 May 2026
https://github.com/haseeeb21/machine-learning-models
Machine Learning Models trained on Scikit-learn datasets. This repository contains the code files and saved models trained on Toy datasets (Classification & Regression), and Real World dataset.
anaconda classification classification-models jupyter-notebook knn knn-classification machine-learning machine-learning-algorithms python3 regression regression-models scikit-learn scikit-learn-python scikitlearn-machine-learning svm svm-classifier vscode
Last synced: 07 May 2026
https://github.com/chaakshay/heartdrive
A Streamlet-based tool that analyzes cardiovascular health data, predicts population risk using ML, and suggests targeted government actions like awareness campaigns, health checkups, and policy changes.
csv mathplotlib numpy pandas pandas-dataframe python scikit-learn seaborn streamlit
Last synced: 05 Apr 2026
https://github.com/ricardorobledo/paymentcardfrauddetection2025
Comparative analysis of probabilistic classification models for credit card fraud detection, focusing on model calibration and threshold optimization in highly imbalanced datasets.
imbalanced-learn matplotlib numpy pandas python3 scikit-learn search
Last synced: 14 Apr 2026
https://github.com/stewartpark/sklearn2gem
⚡ sklearn2gem ports your scikit-learn model into a fast ruby C binding!
ruby rubygem scikit-learn sklearn
Last synced: 01 Mar 2026
https://github.com/messierandromeda/sentiment-analysis
Sentiment analysis with the IMDB movie review dataset.
imdb-dataset python scikit-learn sentiment-analysis
Last synced: 28 Jan 2026
https://github.com/afkewolczyk/data_science_bootcamp
A data science project to learn data science essentials such as: pandas, Matplotlib, Scikit learn
ai data-science machine-learning pandas scikit-learn
Last synced: 07 May 2026
https://github.com/sudarshanc00/smishing
This project aims to classify text messages to detect potential smishing (SMS phishing) attacks. Using machine learning, the project provides a classifier that can differentiate between legitimate messages and smishing attempts, helping to prevent scams.
nltk numpy pandas python scikit-learn scipy
Last synced: 14 Apr 2026
https://github.com/soumyapro/parkinson-disease-prediction
This project predicts Parkinson's disease using machine learning models.
logistic-regression numpy pandas scikit-learn svc xgboost
Last synced: 19 Jan 2026
https://github.com/bilgenurbekar/turkishcyberbullying
Contains fine-tuned BERT models and results in the text classification category using Turkish social media data
bert-fine-tuning huggingface-transformers matplotlib numpy pandas python pytorch scikit-learn transformers
Last synced: 07 Mar 2026
https://github.com/kartheekdama/salary-prediction
This salary prediction model leverages machine learning techniques, including Random Forest, Decision Tree, and Linear Regression, to estimate salaries based on individual attributes such as age, gender, education level, job title, and years of experience. The Random Forest model outperforms the others, achieving the highest R-squared score.
decision-tree exploratory-data-analysis feature-importance linear-regression machine-learning random-forest scikit-learn
Last synced: 06 May 2026
https://github.com/alexliap/sk_serve
Deployment of a Scikit-Learn model and it's column transformations made easy.
machine-learning mlops model-deployment scikit-learn
Last synced: 24 Oct 2025
https://github.com/pratanup/linear-regression-from-scratch-in-python
Linear Regression from Scratch in Python
algorithms data-science evaluation linear-regression machine-learning machine-learning-algorithms programming python scikit-learn
Last synced: 06 May 2026