scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-06-25 00:23:58 UTC
- JSON Representation
https://github.com/ejw-data/ml-playground
Testing the limitations, inabilities, and strengths of models with synthetic data
machine-learning python scikit-learn
Last synced: 06 May 2026
https://github.com/tharunkumar666/employee_attrition_prediction
Predict employee attrition using Logistic Regression. Use Python with Pandas and Scikit-learn to analyze factors like salary, satisfaction, and promotion history. Model classifies if an employee will stay or leave, helping HR take proactive retention measures.
pandas python regression-models scikit-learn
Last synced: 06 May 2026
https://github.com/williyam-m/company-registration-trends
Utilized Linear Regression from scikit-learn to predict future company registration trends.
flask matplotlib numpy pandas-python scikit-learn
Last synced: 06 May 2026
https://github.com/lintangwisesa/pdb_mti_ui_lab1_k6
Tugas Lab 1 Pengelolaan Data Besar MTI UI 2023
machine-learning python3 scikit-learn
Last synced: 06 May 2026
https://github.com/kartheekdama/salary-prediction
This salary prediction model leverages machine learning techniques, including Random Forest, Decision Tree, and Linear Regression, to estimate salaries based on individual attributes such as age, gender, education level, job title, and years of experience. The Random Forest model outperforms the others, achieving the highest R-squared score.
decision-tree exploratory-data-analysis feature-importance linear-regression machine-learning random-forest scikit-learn
Last synced: 06 May 2026
https://github.com/sahilmate/ebm-breast-cancer-classifier
This repository implements an Explainable Boosting Machine (EBM) model for breast cancer classification using scikit-learn and interpret. The project includes data preprocessing, model training, accuracy evaluation, and feature importance visualization.
breast-cancer-classification data-visualization explainable-boosting-machine feature-importance interpret machine-learning scikit-learn
Last synced: 06 May 2026
https://github.com/rafay-imraan/recommendation-system
A machine learning model that outputs personalized similar movie recommendations for people based on the ones they have rated positively.
machine-learning pandas python scikit-learn
Last synced: 06 May 2026
https://github.com/theweird-kid/ml-notes
Machine Learning Notes
machine-learning matplotlib numpy scikit-learn seaborn
Last synced: 06 May 2026
https://github.com/galaxy092/samsung-innovation-campus-big-data-capstone-project
Samsung Innovation Campus Big Data Capstone Project - Weather Prediction
hadoop jupyter-notebook pandas pyspark scikit-learn sparksql
Last synced: 06 May 2026
https://github.com/blacknahil/spam-detection
A simple web application for detecting spam messages using a machine learning model. The application is built using Flask and provides an interactive interface for users to input a message and get a prediction whether it is spam or ham along with the probability.
flask html-css-javascript pandas scikit-learn
Last synced: 06 May 2026
https://github.com/jbizzlefoshizzle/ibm_capstone_project
Used K-means clustering and mapping libraries to determine best cities in San Diego to open a Mexican restaurant
beautifulsoup4 folium-maps geopy pandas-python scikit-learn
Last synced: 06 May 2026
https://github.com/superjarle/jarles-lekekasse
DataCamps Data Science, RJDMETERA, +++ Just fun
data-validation data-visualizations datacamp datacamp-course pandas rjdemetra scikit-learn scipy statistics
Last synced: 07 May 2026
https://github.com/eshrathaziz/heart-disease-risk-assessment
Predicting heart disease risk using machine learning for Healthcare Insights.
data-science jupyter-notebook learning machine python scikit-learn
Last synced: 07 May 2026
https://github.com/n1k1f0rm/car-price-predict
By car characteristics you can predict it prise
fastapi ml scikit-learn streamlit
Last synced: 07 May 2026
https://github.com/z-fran/walmart-store-sales-forecasting
Data analysis and machine learning solution in Python for the Kaggle competition Walmart Recruiting - Store Sales Forecasting.
machine-learning sales-analysis sales-forecasting sales-prediction scikit-learn walmart-sales-forecasting
Last synced: 07 May 2026
https://github.com/govind-prakash/machinelearning
A collection of my machine learning projects, tutorial exercises, algorithm implementations, and related code.
decision-trees gradientboostinclassifier linear-regression logistic-regression scikit-learn unsupervised-learning
Last synced: 07 May 2026
https://github.com/rishi035/advanced-house-price-predictions
This is my First Project and also participated in kaggle competition
linear-regression machine-learning python random random-forest regressor-models scikit-learn
Last synced: 07 May 2026
https://github.com/pspanoudakis/machine-learning-nlp
NLP 🤖 📖 projects on Vaccine Sentiment Classification 💉 and Question Answering 💬
bert-fine-tuning glove-embeddings neural-networks pytorch question-answering rnn scikit-learn sentiment-classification softmax-regression squad
Last synced: 07 May 2026
https://github.com/sumdiboii/loan-prediction-decision-trees
A Decision Tree Classifier was implemented to predict personal loan acceptance using a dataset of 5,000 customers. Key features included income, education, mortgage, and credit card usage. The model achieved 97% accuracy, with 92% precision and 76% recall for positive loan predictions, validated using a classification report and confusion matrix.
classification data-visualisation decision-trees loan-prediction machine-learning python scikit-learn supervised-learning
Last synced: 07 May 2026
https://github.com/jonperk318/machine-learning-in-python
ML models built from scratch in Python 3.9.13
classification clustering feature-extraction jupyter-notebook linear-classification linear-regression machine-learning mnist pca principal-component-analysis python scikit-learn
Last synced: 07 May 2026
https://github.com/mwasifanwar/automl_framework
Comprehensive AutoML framework that automates data preprocessing, feature engineering, model selection, hyperparameter tuning, and deployment. Features neural architecture search and automated data cleaning pipelines.
automl automl-algorithms data-science data-science-projects feature-engineering feature-engineering-algorithm feature-engineering-ml hyperparameter-optimization machine-learning machine-learning-algorithms machine-learning-models mlops mlops-workflow python scikit-learn scikit-learn-python
Last synced: 07 May 2026
https://github.com/tedim52/discjockey
a content-based recommender system for your party playlist preferences
jupyter-notebook matplotlib pandas scikit-learn spotify-web-api
Last synced: 07 May 2026
https://github.com/jimmymugendi/bulding-a-decision-tree-to-predict-customer-churn
This repo desribes bulding a decision tree to predict customer churn in a given organisation
accuracy-score decision-tree-classifier matplotlib-pyplot numpy pandas-dataframe scikit-learn
Last synced: 07 May 2026
https://github.com/cnoret/hexa-watts
Interactive data visualization and machine learning app for energy consumption analysis and prediction in France, built with Streamlit. (Text in French)
data-visualization electricity-forecasting energy-analysis france machine-learning scikit-learn streamlit
Last synced: 07 May 2026
https://github.com/ynsrc/machine-learning
Machine Learning Examples
classification data-science machine-learning machine-learning-algorithms matplotlib nlp nlp-machine-learning regression scikit-learn sklearn
Last synced: 07 May 2026
https://github.com/henrytseng/example_docker_scikit-learn
A quick example of using Scikit-Learn from a Docker container
Last synced: 08 May 2026
https://github.com/anusha-me/disease-x-detection-ml-project
A machine learning classification system for early detection of Disease X based on patient symptoms using Python, Scikit-learn, and Streamlit.
classification data-science disease-prediction healthcare-ai machine-learning medicaldata scikit-learn streamlit
Last synced: 08 May 2026
https://github.com/aravindnathan02/machine-learning-projects
Machine Learning and Deep Learning projects which mainly focuses on predictive modeling.
deep-learning machine-learning neural-networks predictive-modeling python scikit-learn tensorflow
Last synced: 08 May 2026
https://github.com/samjoesilvano/password_strength_prediction_using_nlp
Developed a predictive model to categorize passwords as Strong, Good, or Weak, enhancing security and reducing breach risks. The project involves cleaning and analyzing data from an SQL database, using the TF-IDF technique for transformation, and implementing a Logistic Regression model to achieve accurate classifications.
data-analysis data-classification data-cleaning data-visualization logistic-regression machine-learning natural-language-processing pandas password-security password-strength python scikit-learn sql tf-idf
Last synced: 08 May 2026
https://github.com/prajjwal6969/recommender-system-using-python
A collection of content-based recommendation systems for songs and movies using Python and machine learning.
content-based-filtering cosine-similarity machine-learning movie-recommendation python recommender-system scikit-learn song-recommendation
Last synced: 08 May 2026
https://github.com/fareedkhan-dev/sales-linear-regression-sklearn
linear-regression machine-learning python scikit-learn
Last synced: 08 May 2026
https://github.com/msikorski93/detecting-panic-disorder
Panic disorder detecting using machine learning techniques.
artificial-neural-networks classification knn logistic-regression machine-learning panic-disorder random-forest scikit-learn sgd svm tensorflow xgboost
Last synced: 08 May 2026
https://github.com/samkazan/fraud-detection-ml
Machine learning models for enhanced fraud detection in e-commerce transactions, exploring feature engineering, distance prediction, and clustering analysis.
clustering data-science data-visualization dataanalytics dbscan eda hierarchical-clustering kmeans-clustering knn-imputer matplotlib mlxtend python scikit-learn seaborn xgboost
Last synced: 08 May 2026
https://github.com/icejan/predicton-systems
Various systems that train on data and generate a prediction
lightfm machine-learning numpy python scikit-learn
Last synced: 08 May 2026
https://github.com/seyha1007/amazon-reviews-analysis
🧐 This project analyzes Amazon Fine Food Reviews to investigate whether negative reviews are more emotionally intense and lexically repetitive than positive ones. Using R, we apply sentiment analysis and lexical diversity metrics to uncover patterns in consumer review language.
acp amazon-reviews bert data-analytics glove jupyter-notebook lstm-sentiment-analysis machine-learning nltk random-forest scikit-learn sentiment-classification sentimental-analysis support-vector-machine
Last synced: 08 May 2026
https://github.com/shahzadmustafa15/dbscan-clustering
DBSCAN clustering algorithm applied on synthetic non-linear data (make_moons dataset).
data-science data-visualization dbscan-clustering density-based-clustering machine-learning ml-projects python scikit-learn unsupervised-learning
Last synced: 08 May 2026
https://github.com/sundarmd/breast-cancer-detection
Breast-Cancer-Detection is a machine learning project that utilizes logistic regression to predict whether a tumor is benign or malignant based on the Breast Cancer Wisconsin (Diagnostic) dataset. The project demonstrates data preprocessing, model training, and evaluation using the `scikit-learn` library.
logistic-regression machine-learning python scikit-learn
Last synced: 09 May 2026
https://github.com/aasjunior/mlapp-api
Esta API fornece endpoints para aplicar algoritmos de aprendizado de máquina, como K-Nearest Neighbors (KNN), Árvore de Decisão e Algoritmo Genético. Realizado como tarefa da disciplina de Laboratório Mobile/Computação Natural no 5º Semestre de Desenvolvimento de Software Multiplataforma.
fastapi machine-learning python scikit-learn
Last synced: 09 May 2026
https://github.com/vijaykumarr1452/customer-churn-prediction
Analysis the data of telecom company and insights gained to reduce customer churn.
anaconda jupyter-notebook machine-learning pandas prediction scikit-learn
Last synced: 09 May 2026
https://github.com/radoslawregula/iris-classification
Jupyter notebook implementing an efficient machine learning method to classify flowers from the Iris data set.
classification iris-dataset jupyter-notebook machine-learning python scikit-learn softmax-classifier
Last synced: 09 May 2026
https://github.com/navindafernando/heart-risk-prediction-model
Heart Risk Level Predicting Regression Model :broken_heart:
joblib linear-regression machine-learning pandas r-squared regression-models scikit-learn supervised
Last synced: 09 May 2026
https://github.com/mayankanand007/yfraud
Credit card fraud detection platform using scikit-learn and xgboost 💳
knearest-neighbor-algorithm linear-regression machine-learning predictive-analytics python3 scikit-learn svm xgboost
Last synced: 09 May 2026
https://github.com/peterchain/titanic
Script for the Titanic dataset for evaluating which passengers survived
kaggle machine-learning pandas-dataframe python3 scikit-learn
Last synced: 09 May 2026
https://github.com/otuemre/viginids
VigiNIDS: A machine learning-based system for detecting malicious network traffic using the UNSW-NB15 dataset. It distinguishes between normal and attack activities, providing a data-driven approach to network security.
classification cybersecurity intrusion-detection-system machine-learning network-intrusion-detection python scikit-learn unsw-nb15 xgboost
Last synced: 09 May 2026
https://github.com/roggersanguzu/tomato-disease-detector
This project Uses transfer learning with MobileNetV2 to accurately classify tomato leaf diseases including Mosaic Virus, Septoria Leaf Spot, Blight, and Healthy leaves.
deep-learning python scikit-learn transfer-learning
Last synced: 09 May 2026
https://github.com/callmerajesh/ames-housing-price-prediction
Predicting house prices using Decision Tree Regressor on the Ames dataset
ames-housing data-science decision-tree machine-learning python regression scikit-learn
Last synced: 09 May 2026
https://github.com/saahilanande/naivebayes
Implimenting Naive Bayes classifier from scratch for sentiment analysis of IMDB dataset
machine-learning naive-bayes-classifier python-3 scikit-learn
Last synced: 09 May 2026
https://github.com/malisha4065/flightdelaypredictiongroup99
This project focuses on predicting flight delays in the United States domestic air traffic system over 500 000+ data using machine learning techniques. Leveraging a dataset from the Bureau of Transportation Statistics for the year 2020, we aim to develop a predictive model that can anticipate flight delays with 93.1 % high accuracy.
k-nearest-neighbors machine-learning python scikit-learn support-vector-machine
Last synced: 09 May 2026
https://github.com/jaswanthv99/basic_ml-model_understanding
This project explains basic ML-Models(KNN, Naive bayes, Logistic Regression, SVM, A neural N/W)
matplotlib-python pandas-python scikit-learn tensorflow
Last synced: 09 May 2026
https://github.com/piras-s/braincancerclassifier
Classifying brain tumors using Gaussian Naive Bayes with MRI-derived features. Includes feature selection, model evaluation, prediction uncertainty, and probability calibration.
baysian-inference calibrated-classification classification data-visualization feature-selection machine-learning medical-imaging naive-bayes-classifier python scikit-learn uncertainty-estimation
Last synced: 09 May 2026
https://github.com/njaffe/eda_example_2025
Sample end-to-end data analysis walkthrough using Python and Scikit-learn.
data-science data-visualization jupyter-notebooks machine-learning python regression scikit-learn
Last synced: 09 May 2026
https://github.com/vivprime/diabetes-prediction-system
MERISKILL INTERNSHIP: To predict whether an individual have Diabetes or not
Last synced: 09 May 2026
https://github.com/bhoomikaniranjan/pulmotrainer
A Deep Learning-based Lung Cancer Detection application using a 3D CNN model with TensorFlow and OpenCV, featuring an interactive Tkinter GUI for easy data processing and training.
matplotlib numpy-pandas opencv python scikit-learn seaborn tensorflow-keras
Last synced: 09 May 2026
https://github.com/ledsouza/reg-linear
Projeto de estudos de regressão linear
data-science data-visualization pandas pickles scikit-learn seaborn sklearn vitrinedev
Last synced: 10 May 2026
https://github.com/mpolinowski/fisher-discriminant-analysis
LDA is a widely used dimensionality reduction technique built on Fisher’s linear discriminant.
linear-discriminant-analysis matplotlib-pyplot python scikit-learn
Last synced: 10 May 2026
https://github.com/naufal-yafi/text-mining-nb.model
Text mining using Naive Bayes algorithm
clasification data-science machine-learning naive-bayes-algorithm python3 scikit-learn streamlit-webapp text-mining
Last synced: 10 May 2026
https://github.com/rudrakhp/ir-project-blog-recommender
machine-learning python scikit-learn
Last synced: 10 May 2026
https://github.com/macdon112/credit-card-fraud-detection
Comparing ML models (Random Forest, KNN, Decision Tree) for credit card fraud detection using SMOTE and stratified cross-validation.
classification data-analysis fraud-detection imbalanced-data machine-learning python scikit-learn
Last synced: 10 May 2026
https://github.com/aneeshmurali-n/nlp-emotion-classification-in-text
Develop machine learning models to classify emotions in text samples.
bag-of-words data emotion-classification feature-extraction machine-learning naive-bayes natural-language-processing nlp nltk preprocessing python scikit-learn svm text-classification tf-idf tokenizer vectorizer
Last synced: 10 May 2026
https://github.com/hassanislam463/nyc_airbnb_eda
This project is a comprehensive data analysis of Airbnb listings in New York City, exploring pricing trends, seasonality effects, host market dynamics, rental preferences, and revenue estimation. It provides valuable insights for hosts, investors, and policymakers to optimize Airbnb operations and understand the short-term rental landscape in NYC.
exploratory-data-analysis matplotlib python scikit-learn seaborn
Last synced: 10 May 2026
https://github.com/djdhairya/student-attendance-management
folium matplotlib pandas scikit-learn
Last synced: 10 May 2026
https://github.com/afonsojramos/feup-iart
Projects developed for Artificial Intelligence class.
feup feup-iart iart neural-network python scikit-learn tensorflow
Last synced: 10 May 2026
https://github.com/tnleite/real-estate-opportunities-analysis
Este repositório apresenta uma análise de oportunidades no mercado imobiliário, combinando séries temporais, clusterização e previsões para identificar estados com maior potencial de crescimento e orientar estratégias de expansão eficientes.
catboostregressor cluster-analysis data-science kmeans-clustering lightgbm-regressor machine-learning-algorithms numpy regression-models scikit-learn xgboost-regression
Last synced: 10 May 2026
https://github.com/i30101/mathworks2024
Coding tools for 2024 MathWorks Math Modeling Challenge
machine-learning mathematical-modelling python scikit-learn
Last synced: 10 Jun 2026
https://github.com/alphacrypto246/student-learning-style-prediction
An interactive web application built with Streamlit that predicts a student's preferred learning style (visual, auditory, or kinesthetic) using machine learning, aiding educators in personalizing teaching strategies.
machine-learning scikit-learn scikitlearn-machine-learning streamlit
Last synced: 11 May 2026
https://github.com/dtroupe18/statsfinalproject
Simple ML project using UCI dataset
abalone jupyter-notebook linear-regression machine-learning mathplotlib python3 scikit-learn uci-machine-learning
Last synced: 11 May 2026
https://github.com/monarch1108/customerinsights-kmeans
understanding customers using KMeans and RFM(recency, frequency & monetary) analysis
data-analysis data-visualization kmeans-clustering machine-learning matplotlib numpy pandas scikit-learn
Last synced: 11 May 2026
https://github.com/pdoup/atml-notebooks
Proposed assignment notebooks for Advanced Topics in Machine Learning tasks
active-learning cost-sensitive-learning imbalanced-data machine-learning multi-instance-learning multi-label-classification numpy scikit-learn
Last synced: 11 May 2026
https://github.com/theladev/machine-learning
This repository is focus on show u my personal projects and interests on Machine Learning and Data Science. Hope u enjoy it.
data-science machine-learning machine-learning-models pandas python scikit-learn
Last synced: 11 May 2026
https://github.com/johannesvc/data-science-portfolio
A curated portfolio of applied data science projects focused on machine learning, NLP, and social impact.
academic-portfolio data-science deep-learning keras machine-learning media-bias nlp pandas scikit-learn
Last synced: 11 May 2026
https://github.com/cplaza0997/py-ml
Machine learning
clustering linear-regression logistic-regression ml pyspark python scikit-learn sparkml
Last synced: 11 May 2026
https://github.com/sharvesh1401/inverse-design-patch-antenna
A machine learning approach to the inverse design of microstrip patch antennas by predicting optimal physical dimensions from desired performance metrics.
antenna-design deep-learning engineering-project gradio jupyter-notebook machine-learning patch-antenna python regression-model scikit-learn
Last synced: 11 May 2026
https://github.com/antonio-f/k_nearest_neighbors
Quick k-nearest neighbors example
easy k-nearest-neighbors knn machine-learning matplotlib python scikit-learn visualization
Last synced: 11 May 2026
https://github.com/cptanalatriste/copycat-detector
A Naive-Bayes classifier for detecting plagiarism.
amazon-sagemaker naive-bayes-classifier scikit-learn
Last synced: 12 May 2026
https://github.com/capsuleismail/rt-iot2022
RT-IoT2022 is a dataset obtained from a real-time IoT infrastructure. This project aims to compare the accuracy of three machine learning models: XGBoost and LGBMClassifier.
datascience jupyter-notebook machinelearning-python scikit-learn
Last synced: 12 May 2026
https://github.com/msikorski93/seed-clustering
Performing basic clustering on a seeds dataset.
agglomerative clustering dbscan gaussian-mixture-model gmm mini-batch-kmeans scikit-learn seeds
Last synced: 13 May 2026
https://github.com/johanneswiesner/skplot
A python package for extracting, plotting and reporting information from one or multiple sklearn classification & prediction pipelines.
plotting python scikit-learn sklearn visualization
Last synced: 14 May 2026
https://github.com/sedefkjamili/dengai-ml-prediction
Machine learning project for predicting dengue fever outbreaks using climate and environmental data.
data-science dengue gradient-boosting healthcare machine-learning python scikit-learn time-series
Last synced: 12 Jun 2026
https://github.com/muditnautiyal-21/mudra-ml
Glass-box autonomous data science in Python. Profiles data, builds leakage-safe pipelines, recommends and tunes models, and logs every decision behind the result.
automl classification clustering data-science explainable machine-learning pipeline python regression scikit-learn
Last synced: 12 Jun 2026
https://github.com/neelimabonangi/defect-detection-hot-rolling
Defect Detection in Hot Rolling Using Machine Learning
classification data-analysis data-science defect-detection jupyter-notebook machine-learning manufacturing numpy pandas predictive-analytics python random-forest scikit-learn
Last synced: 12 Jun 2026
https://github.com/tomdewildt/interactive-and-explainable-ai-design
Code for The Interactive And Explainable AI Design course of my master's degree
jupyter lime numpy pandas python scikit-learn shap
Last synced: 18 Jun 2026
https://github.com/rb-thompson/machine-learning-basics
Implement a machine learning pipeline on the Iris flower dataset.
data-preprocessing data-visualization model-training python scikit-learn
Last synced: 18 Jun 2026
https://github.com/jayemscript/lab-to-code
A complete Python learning roadmap for scientists and researchers — covering data science, biology, chemistry, physics, and mathematics with curated libraries, tools, and resources.
bioinformatics chemistry data-science jupyter-notebook machine-learning mathematics numpy pandas physics python research roadmap scientific-computing scikit-learn
Last synced: 19 Jun 2026
https://github.com/hyoaru/anonalyze-process
A machine learning project part of the anonalyze initiative focused on sentiment and emotion classification through Natural Language Processing (NLP). Utilizing a Multinomial Naive Bayes classifier, this system analyzes Twitter data to predict sentiments (positive, negative, neutral) and emotions (sadness, joy, love, anger, fear, surprise).
feature-engineering machine-learning multinomial naive-bayes natural-language-processing python scikit-learn tf-idf
Last synced: 19 Jun 2026
https://github.com/royxlead/production-drift-detection
Production ML monitoring library - KL, PSI, MMD, and ADWIN drift detectors with empirical benchmarks, confidence tracking, and a 6-page FastAPI dashboard.
data-drift drift-detection fastapi kl-divergence mlops mmd model-monitoring production-ml psi pytorch scikit-learn uncertainty-quantification
Last synced: 23 Jun 2026
https://github.com/nafis2508/maternal-neonatal-outcome-prediction
Predicting Maternal and Neonatal Birth Outcomes using Machine Learning on 61,018 Healthcare Records from Kenya and Uganda
data-science decision-tree eda healthcare-ai healthcare-analytics machine-learning maternal-health predictive-modeling python random-forest scikit-learn
Last synced: 24 Jun 2026
https://github.com/josepablodmg/python--linear-regression---housing-exercise
A predictive analysis exploring the relationship between household characteristics and median income in California. Using linear regression, the project investigates whether blocks with fewer households correspond to higher median incomes.
california data-analysis data-science exploratory-data-analysis housing-data linear-regression machine-learning python regression scikit-learn statistics visualization
Last synced: 05 Oct 2025
https://github.com/imosudi/unsupervised-ml-kmeans-analysis
K-Means clustering analysis using synthetic datasets generated with scikit-learn, including meshgrid visualisation, silhouette score evaluation, and investigation of cluster count and random seed effects.
clustering data-analysis jupyter-notebook kmeans kmeans-clustering machine-learning matplotlib python3 scikit-learn silhouette-score unsupervised-learning
Last synced: 25 Jun 2026
https://github.com/robertfmath/mammal-taxonomy-gradient-boosting-classifier
Predicting mammalian taxonomic order based on ecological, geographic, and life-history traits
classification gradient-boosting gradient-boosting-classifier machine-learning mammals python scikit-learn taxonomic-classification taxonomy
Last synced: 06 Oct 2025
https://github.com/vivekky57/car-price-prediction
Now you can get Car Price with this wonderful end-to-end project.
flask machine-learning machine-learning-algorithms python python3 random-forest-classifier scikit-learn
Last synced: 13 Apr 2026
https://github.com/dearabhin/girlfriend-predictor
Using machine learning to solve the ultimate college classification problem. A fun project applying Python and Logistic Regression to predict relationship outcomes based on a (hilariously) synthetic dataset. 📊❤️
classification data-science fun-project google-colab jyputer-notebook jypyternotebook logistic-regression machine-learning pandas python scikit-learn
Last synced: 06 Oct 2025
https://github.com/sora468/best-of-ml-python
🏆 Discover top-ranked Python libraries for machine learning, updated weekly to help you find the best tools for your projects.
airport airport-simulation chatgpt configuration data-analysis data-science data-visualization data-visualizations gpt keras machine-learning nlp python scikit-learn tensorflow transformer usg-ai-training-data usg-artificial-intelligence
Last synced: 09 May 2026
https://github.com/muellerconstantin/house-prices
Data analysis about house prices in Ames (Iowa) with advanced regression techniques.
dvc jupyter-notebook python python3 scikit-learn
Last synced: 14 Apr 2026
https://github.com/harris-giki/e-comdataanalysis_ml
E-commerce Customer Analysis with Linear Regression: analyzes customer behavior within an e-commerce setting and predict yearly customer spending based on various features using a linear regression model.
development ecommerce linear-regression machine-learning model prediction-model python scikit-learn
Last synced: 14 Apr 2026
https://github.com/dukebw/ml-model-selection
Machine learning model selection using Dlib and scikit-learn.
dlib machine-learning ranking scikit-learn
Last synced: 07 Oct 2025
https://github.com/sducournau/ign_lidar_hd_dataset
🏗️ Comprehensive Python library for processing IGN LiDAR HD data into machine learning-ready datasets for Building Level of Detail (LOD) classification. Features GPU/CPU processing, smart data management, and complete ML pipeline integration.
building-classification data-processing dataset france geospatial gis ign lidar lidar-hd numpy point-cloud scikit-learn
Last synced: 20 Jan 2026
https://github.com/albarji/teachingcontainer
A Docker container I use for my lectures
docker keras machine-learning scikit-learn
Last synced: 14 Apr 2026
https://github.com/prarthana-singh/bangalore-house-price-predictor
🏡 Bangalore House Price Prediction – A Machine Learning model to predict house prices in Bangalore using real estate data. Built with Linear Regression, Python, Pandas, NumPy, and Scikit-Learn.
data-analysis eda house-price-prediction linear-regression machine-learning numpy pandas python real-estate regression scikit-learn
Last synced: 19 Apr 2026
https://github.com/jyablonski/nba_elt_mlflow
ML Pipeline for NBA ELT Project
Last synced: 17 Jan 2026