scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-07-02 00:27:34 UTC
- JSON Representation
https://github.com/kunalpisolkar24/dsbda_lab
Collection of practical codes for Savitribai Phule Pune University's Data Science and Big Data Analytics Laboratory (310256).
data-analytics data-preprocessing data-science data-wrangling descriptive-statistics linear-regression logistic-regression mapreduce scala scikit-learn sppu-computer-engineering tf-idf
Last synced: 05 May 2026
https://github.com/divinenaman/color-extraction-api
Extract colours from images using K-means, along with FastAPI pipeline.
fastapi k-means-clustering scikit-learn
Last synced: 05 May 2026
https://github.com/zenitsu272/fault-detection-ml
Machine Learning based Fault Detection in machines using sensor data
artificial-intelligence decsion-tree machine-learning pandas pandas-dataframe pandas-python scikit-learn
Last synced: 05 May 2026
https://github.com/sadmansakib93/mental-resilience-analysis-using-machine-learning
Utilized supervised and unsupervised ML techniques to analyze mental health and resilience levels of medical students [Project completed on December, 2019]
artificial-intelligence classification clustering correlation linear-regression machine-learning machine-learning-algorithms mental-health python regression resilience scikit-learn statistical-analysis
Last synced: 06 May 2026
https://github.com/rishisolanke/twitter-sentiment-analysis-using-machine-learning-
A research project that classifies tweets as positive, negative, or neutral using ML algorithms (Logistic Regression, Naïve Bayes, SVM) with NLP preprocessing.
data-science data-visualization logistic-regression machine-learning ml-models naive-bayes natural-language-processing nlp scikit-learn sentiment-analysis svm text-classification twitter-data
Last synced: 06 May 2026
https://github.com/samia35-2973/living-type-classification-from-codon-usage
Machine learning project to classify living types based on codon usage data using Random Forest and XGBoost classifiers.
classification codon-usage data-cleaning data-preprocessing excel exploratory-data-analysis living-type machine-learning python random-forest-classifier scikit-learn supervised-learning xgboost-classifier
Last synced: 06 May 2026
https://github.com/nicolas-giacomelli/modelo_regressao_linear_vendas
Modelo de regressão linear para previsão de vendas Desafio do curso de IA da RocketSeat
matplotlib pandas python3 scikit-learn
Last synced: 06 May 2026
https://github.com/billgewrgoulas/recommendation-systems
Algorithms for joke rating prediction using the joke data-set from Kaggle.
algorithm clustering collaborative-filtering machine-learning numpy pandas recommender-system scikit-learn scypi
Last synced: 06 May 2026
https://github.com/lazarust/jupyternotebooks
Storage spot for all my Jupyter Notebooks. Check some of them out!!
jupyter-notebook jupyter-notebooks keras scikit-learn sklearn
Last synced: 06 May 2026
https://github.com/erick957/saleprice-prediction-dataset-analysis-and-cleaning-advance-regression
🏠 Predict house prices using advanced regression techniques with this comprehensive analysis and cleaning project, from data loading to model deployment.
data-analysis data-science eda google-colab machine-learning numpy pandas python scikit-learn scikit-learn-python
Last synced: 06 May 2026
https://github.com/andrewsy1004/logistic-regression-spam-classifier
This project implements a spam email classifier using Logistic Regression.
Last synced: 06 May 2026
https://github.com/sabin74/boston_house_prediction
This project aims to predict the median value of owner-occupied homes in Boston suburbs using various machine learning regression models. Multiple regression techniques were applied, including Linear Regression, Decision Tree, Random Forest, Gradient Boosting and dimensionality reduction with PCA. Hyperparameter tuning was performed.
boston-housing-price-prediction hyperparameter-tuning kaggle-dataset pca-analysis python3 regression-models scikit-learn
Last synced: 06 May 2026
https://github.com/adesartika33/proyek-analisis-data-dataset-iris
Proyek ini bertujuan untuk menganalisis dataset Iris, salah satu dataset klasik dalam bidang Machine Learning dan Data Science. Dataset ini terdiri dari 150 sampel bunga Iris dari tiga spesies (Setosa, Versicolor, dan Virginica)
classification data-science data-visualization eda exploratory-data-analysis iris-dataset machine-learning python random-forest scikit-learn
Last synced: 06 May 2026
https://github.com/msikorski93/predicting-prices-on-king-county-housing-dataset
Predicting house prices using different regression analysis models.
catboost eda gradient-boosting king-county lightgbm linear-regression machine-learning neural-network polynomial-regression real-estate regression-models scikit-learn tensorflow xgboost
Last synced: 06 May 2026
https://github.com/deshwalx/diabetes-prediction-svm
My first ML project using SVM to predict diabetes
beginner-project classification diabetes machine-learning python scikit-learn svm svm-classifier
Last synced: 06 May 2026
https://github.com/douglas-data-analyst/predictive-analysis
Modelo preditivo para previsão de vendas usando scikit-learn e machine learning
data-science machine-learning predictive-analytics python sales-forecasting scikit-learn time-series
Last synced: 06 May 2026
https://github.com/pradeep-r04/spam-email-classification
Spam Email Classification Using NLP and Machine Learning involves building a system to identify and categorize emails as either spam or non-spam (ham). This process typically uses Natural Language Processing (NLP) techniques to analyze and preprocess text data and machine learning algorithms to train a model for classification.
artificial-intelligence machine-learning naive-bayes-classifier nlp pkl python scikit-learn streamlit
Last synced: 06 May 2026
https://github.com/williyam-m/company-registration-trends
Utilized Linear Regression from scikit-learn to predict future company registration trends.
flask matplotlib numpy pandas-python scikit-learn
Last synced: 06 May 2026
https://github.com/lintangwisesa/pdb_mti_ui_lab1_k6
Tugas Lab 1 Pengelolaan Data Besar MTI UI 2023
machine-learning python3 scikit-learn
Last synced: 06 May 2026
https://github.com/kartheekdama/salary-prediction
This salary prediction model leverages machine learning techniques, including Random Forest, Decision Tree, and Linear Regression, to estimate salaries based on individual attributes such as age, gender, education level, job title, and years of experience. The Random Forest model outperforms the others, achieving the highest R-squared score.
decision-tree exploratory-data-analysis feature-importance linear-regression machine-learning random-forest scikit-learn
Last synced: 06 May 2026
https://github.com/sahilmate/ebm-breast-cancer-classifier
This repository implements an Explainable Boosting Machine (EBM) model for breast cancer classification using scikit-learn and interpret. The project includes data preprocessing, model training, accuracy evaluation, and feature importance visualization.
breast-cancer-classification data-visualization explainable-boosting-machine feature-importance interpret machine-learning scikit-learn
Last synced: 06 May 2026
https://github.com/josepablodmg/python--linear-regression-advertising
A linear regression analysis to predict sales based on advertising spending across TV, radio, and newspaper channels. The project includes exploratory data analysis, model training, coefficient visualization, and residual analysis.
advertising data-analysis exploratory-data-analysis linear-regression machine-learning python regression scikit-learn visualization
Last synced: 06 May 2026
https://github.com/rafay-imraan/recommendation-system
A machine learning model that outputs personalized similar movie recommendations for people based on the ones they have rated positively.
machine-learning pandas python scikit-learn
Last synced: 06 May 2026
https://github.com/theweird-kid/ml-notes
Machine Learning Notes
machine-learning matplotlib numpy scikit-learn seaborn
Last synced: 06 May 2026
https://github.com/avtorgenii/ml-playground
A repository for exploring and experimenting with datasets, building machine learning models, and testing various techniques in data preprocessing, feature engineering, and model evaluation.
matplotlib ml pandas scikit-learn
Last synced: 06 May 2026
https://github.com/samudraneel05/stanford-open-policing
The Stanford Open Policing Project (SOPP) aims to bring transparency to police interactions by collecting and analyzing data on traffic stops across the United States. It accumulates a vast dataset on traffic stops, encompassing details such as demographics, location, and outcomes.
clustering heirarchical-clustering k-means-clustering machine-learning matplotlib pandas python scikit-learn
Last synced: 06 May 2026
https://github.com/jbizzlefoshizzle/ibm_capstone_project
Used K-means clustering and mapping libraries to determine best cities in San Diego to open a Mexican restaurant
beautifulsoup4 folium-maps geopy pandas-python scikit-learn
Last synced: 06 May 2026
https://github.com/michael95-m/packaging-insurance-claim-model
Packaging regression model from scikit-learn
feature-engineering machine-learning python python-package scikit-learn
Last synced: 07 May 2026
https://github.com/taquynhnga2001/regression-calories-burnt-prediction
Develop regression models which can predict the total calories a person has burnt during a workout based on some biological measures.
machine-learning python regression-analysis scikit-learn
Last synced: 07 May 2026
https://github.com/eshrathaziz/heart-disease-risk-assessment
Predicting heart disease risk using machine learning for Healthcare Insights.
data-science jupyter-notebook learning machine python scikit-learn
Last synced: 07 May 2026
https://github.com/kirillshiryaev61/customer_activity_prediction
Прогнозирование снижения покупательской активности в интернет-магазине. Модель на основе ML выявляет клиентов с риском оттока для повышения удержания. Учебный проект.
jupyter pandas python scikit-learn
Last synced: 07 May 2026
https://github.com/ayaarbi/prediction_des_maladies_cardiovasculaires_avec_ml
Ce projet, développé au sein de cours de Machine Learning, utilise des algorithmes de classification supervisée pour prédire la présence de maladies cardio-vasculaires à partir de données médicales publiées sur Kaggle.
cardiovascular-diseases jupyter-notebook machine-learning matplotlib pandas python scikit-learn
Last synced: 07 May 2026
https://github.com/z-fran/walmart-store-sales-forecasting
Data analysis and machine learning solution in Python for the Kaggle competition Walmart Recruiting - Store Sales Forecasting.
machine-learning sales-analysis sales-forecasting sales-prediction scikit-learn walmart-sales-forecasting
Last synced: 07 May 2026
https://github.com/rishi035/advanced-house-price-predictions
This is my First Project and also participated in kaggle competition
linear-regression machine-learning python random random-forest regressor-models scikit-learn
Last synced: 07 May 2026
https://github.com/tony123105/comp4423_garbage_classification
Garbage classification using traditional machine learning approaches (HOG, LBP, SIFT features with SVM, KNN, Random Forest classifiers) and an ensemble method to categorize waste into 10 types.
computer-vision feature-extraction garbage-classification hog image-classification knn lbp machine-learning opencv python random-forest scikit-learn sift svm
Last synced: 07 May 2026
https://github.com/pspanoudakis/machine-learning-nlp
NLP 🤖 📖 projects on Vaccine Sentiment Classification 💉 and Question Answering 💬
bert-fine-tuning glove-embeddings neural-networks pytorch question-answering rnn scikit-learn sentiment-classification softmax-regression squad
Last synced: 07 May 2026
https://github.com/sumdiboii/loan-prediction-decision-trees
A Decision Tree Classifier was implemented to predict personal loan acceptance using a dataset of 5,000 customers. Key features included income, education, mortgage, and credit card usage. The model achieved 97% accuracy, with 92% precision and 76% recall for positive loan predictions, validated using a classification report and confusion matrix.
classification data-visualisation decision-trees loan-prediction machine-learning python scikit-learn supervised-learning
Last synced: 07 May 2026
https://github.com/jennynzhuang/bootstrap_ml_model_evaluation
Enhancing ML Model Evaluation with Bootstrapping
bootstrapping computational-statistics jupyter-notebook machine-learning python scikit-learn
Last synced: 07 May 2026
https://github.com/andrewsy1004/linear-regression-model-for-house-price-prediction
A linear regression model to predict house prices based on features like size, location, and number of rooms. This project demonstrates the application of machine learning in real estate price estimation
linear-regression python scikit-learn xgbregressor
Last synced: 07 May 2026
https://github.com/therayyanshariff/cinereview
A Machine Learning web app for sentiment analysis, using a Scikit-learn NLP model with a custom-styled Streamlit UI.
machine-learning nlp python scikit-learn sentiment-analysis streamlit
Last synced: 04 May 2026
https://github.com/jimmymugendi/bulding-a-decision-tree-to-predict-customer-churn
This repo desribes bulding a decision tree to predict customer churn in a given organisation
accuracy-score decision-tree-classifier matplotlib-pyplot numpy pandas-dataframe scikit-learn
Last synced: 07 May 2026
https://github.com/ynsrc/machine-learning
Machine Learning Examples
classification data-science machine-learning machine-learning-algorithms matplotlib nlp nlp-machine-learning regression scikit-learn sklearn
Last synced: 07 May 2026
https://github.com/henrytseng/example_docker_scikit-learn
A quick example of using Scikit-Learn from a Docker container
Last synced: 08 May 2026
https://github.com/anmolian/cassava_leaf_disease_detection
Image Classification
computer-vision deep-learning machine-learning scikit-learn tensorflow
Last synced: 08 May 2026
https://github.com/aravindnathan02/machine-learning-projects
Machine Learning and Deep Learning projects which mainly focuses on predictive modeling.
deep-learning machine-learning neural-networks predictive-modeling python scikit-learn tensorflow
Last synced: 08 May 2026
https://github.com/amiegirl/fellowship_ai
Sentiment Analysis of Movies Dataset
decision-tree-classification logisticregression matplotlib pandas random-forest-classification scikit-learn wordcloud
Last synced: 08 May 2026
https://github.com/prajjwal6969/recommender-system-using-python
A collection of content-based recommendation systems for songs and movies using Python and machine learning.
content-based-filtering cosine-similarity machine-learning movie-recommendation python recommender-system scikit-learn song-recommendation
Last synced: 08 May 2026
https://github.com/jatin-mehra119/churn_modeling
This repository is dedicated to predicting customer churn using machine learning techniques. It includes comprehensive scripts for data preprocessing, model training, and evaluation, along with detailed visualizations and insights.
classification-model datavisualization pandas scikit-learn
Last synced: 08 May 2026
https://github.com/deepanshkhurana/udacityproject-prediciting-boston-housing-prices
This is a Udacity Project for the Machine Learning Nanodegree. Here, we are trying to predict Boston Housing Prices using sklearn.
data-analysis data-science machine-learning python scikit-learn udacity
Last synced: 08 May 2026
https://github.com/samkazan/fraud-detection-ml
Machine learning models for enhanced fraud detection in e-commerce transactions, exploring feature engineering, distance prediction, and clustering analysis.
clustering data-science data-visualization dataanalytics dbscan eda hierarchical-clustering kmeans-clustering knn-imputer matplotlib mlxtend python scikit-learn seaborn xgboost
Last synced: 08 May 2026
https://github.com/icejan/predicton-systems
Various systems that train on data and generate a prediction
lightfm machine-learning numpy python scikit-learn
Last synced: 08 May 2026
https://github.com/oriolventur/assignment-2-model-creation
Assignment 2 from Artificial Intelligence 1 course: Model creation using synthetic data and scikit-learn.
jupyter-notebook model-creation python scikit-learn
Last synced: 08 May 2026
https://github.com/shahzadmustafa15/dbscan-clustering
DBSCAN clustering algorithm applied on synthetic non-linear data (make_moons dataset).
data-science data-visualization dbscan-clustering density-based-clustering machine-learning ml-projects python scikit-learn unsupervised-learning
Last synced: 08 May 2026
https://github.com/laksh2005/fashtag
Fashion Attribute Classification App
beatifulsoup fastapi nextjs pandas python scikit-learn selenium torch torchvision typescript
Last synced: 10 Jun 2026
https://github.com/sundarmd/breast-cancer-detection
Breast-Cancer-Detection is a machine learning project that utilizes logistic regression to predict whether a tumor is benign or malignant based on the Breast Cancer Wisconsin (Diagnostic) dataset. The project demonstrates data preprocessing, model training, and evaluation using the `scikit-learn` library.
logistic-regression machine-learning python scikit-learn
Last synced: 09 May 2026
https://github.com/aasjunior/mlapp-api
Esta API fornece endpoints para aplicar algoritmos de aprendizado de máquina, como K-Nearest Neighbors (KNN), Árvore de Decisão e Algoritmo Genético. Realizado como tarefa da disciplina de Laboratório Mobile/Computação Natural no 5º Semestre de Desenvolvimento de Software Multiplataforma.
fastapi machine-learning python scikit-learn
Last synced: 09 May 2026
https://github.com/davidrpugh/kaust-cs-294w
Course materials for KAUST CS 294W
deep-learning machine-learning pytorch scikit-learn
Last synced: 09 May 2026
https://github.com/ahmed122000/ml_model_deployment
The HR Analytics: Job Change Predictor is a Flask-based web application that uses machine learning to predict whether an employee will stay with a company or leave. It allows users to train models, evaluate their performance, and make predictions based on employee data, providing valuable insights for HR decision-making.
classification flask machine-learning python3 rest-api scikit-learn
Last synced: 09 May 2026
https://github.com/l1ght14/customer-churn-prediction
Predict customer churn using machine learning models like Logistic Regression and Random Forest. Includes data preprocessing, model evaluation, feature importance, and insights to drive retention strategies.
churn-prediction classification customer-churn customer-churn-prediction data-analysis logistic-regression machine-learning python random-forest scikit-learn telecom
Last synced: 09 May 2026
https://github.com/alphacrypto246/employee-attrition
This project analyzes employee attrition data to uncover key factors driving employee turnover. Using Python, it employs data preprocessing, exploratory data analysis, and machine learning models to predict attrition and provide actionable insights for improving employee retention strategies.
decision-tree-classifier machine-learning machine-learning-algorithms python scikit-learn scikitlearn-machine-learning
Last synced: 09 May 2026
https://github.com/peterchain/titanic
Script for the Titanic dataset for evaluating which passengers survived
kaggle machine-learning pandas-dataframe python3 scikit-learn
Last synced: 09 May 2026
https://github.com/otuemre/viginids
VigiNIDS: A machine learning-based system for detecting malicious network traffic using the UNSW-NB15 dataset. It distinguishes between normal and attack activities, providing a data-driven approach to network security.
classification cybersecurity intrusion-detection-system machine-learning network-intrusion-detection python scikit-learn unsw-nb15 xgboost
Last synced: 09 May 2026
https://github.com/mpolinowski/multi-dimensional-scaling
Multidimensional Scaling is a family of statistical methods that focus on creating mappings of items based on distance.
matplotlib-pyplot multi-dimensional-scaling python scikit-learn
Last synced: 09 May 2026
https://github.com/payall03/spam-mail-detection
A Web App for Detecting Spam Messages using Machine Learning | Flask · TfidfVectorizer · Naive Bayes
css deploy-to-render flask html machine-learning ml-project mlproject naive-bayes naive-bayes-classifier natural-language-processing nlp python scikit-learn spam-detection spam-filter text-classification tfidf-joblib webapp
Last synced: 09 May 2026
https://github.com/saahilanande/naivebayes
Implimenting Naive Bayes classifier from scratch for sentiment analysis of IMDB dataset
machine-learning naive-bayes-classifier python-3 scikit-learn
Last synced: 09 May 2026
https://github.com/rajan-bhateja/aqi-predictor
Different models trained on Indian Cities to predict AQI
machine-learning-algorithms model-comparison neural-networks scikit-learn tensorflow
Last synced: 09 May 2026
https://github.com/samuelson777/iris-flower-classification
Iris Flower Classification: A machine learning project that classifies iris flowers into three species based on sepal and petal dimensions. Includes data exploration, visualization, and model evaluation using Python and scikit-learn.
classification data-science data-visualization iris-dataset jupyter-notebook machine-learning python scikit-learn
Last synced: 09 May 2026
https://github.com/suvasish114/house-price-estimation
A machine learning model that estimate housing prices in California using the California census data
jupyter-notebook machine-learning python scikit-learn
Last synced: 09 May 2026
https://github.com/mpolinowski/fisher-discriminant-analysis
LDA is a widely used dimensionality reduction technique built on Fisher’s linear discriminant.
linear-discriminant-analysis matplotlib-pyplot python scikit-learn
Last synced: 10 May 2026
https://github.com/naufal-yafi/text-mining-nb.model
Text mining using Naive Bayes algorithm
clasification data-science machine-learning naive-bayes-algorithm python3 scikit-learn streamlit-webapp text-mining
Last synced: 10 May 2026
https://github.com/rudrakhp/ir-project-blog-recommender
machine-learning python scikit-learn
Last synced: 10 May 2026
https://github.com/linggarm/text-summarization-using-tf-idf-vectorizer
Simple Text Summarization using TF-IDF Vectorizer and NLTK library
artificial-intelligence data-science google-colab jupyter-notebook machine-learning natural-language-processing nlp nltk python scikit-learn summarization text-summarization tf-idf
Last synced: 10 May 2026
https://github.com/macdon112/credit-card-fraud-detection
Comparing ML models (Random Forest, KNN, Decision Tree) for credit card fraud detection using SMOTE and stratified cross-validation.
classification data-analysis fraud-detection imbalanced-data machine-learning python scikit-learn
Last synced: 10 May 2026
https://github.com/aneeshmurali-n/nlp-emotion-classification-in-text
Develop machine learning models to classify emotions in text samples.
bag-of-words data emotion-classification feature-extraction machine-learning naive-bayes natural-language-processing nlp nltk preprocessing python scikit-learn svm text-classification tf-idf tokenizer vectorizer
Last synced: 10 May 2026
https://github.com/hassanislam463/nyc_airbnb_eda
This project is a comprehensive data analysis of Airbnb listings in New York City, exploring pricing trends, seasonality effects, host market dynamics, rental preferences, and revenue estimation. It provides valuable insights for hosts, investors, and policymakers to optimize Airbnb operations and understand the short-term rental landscape in NYC.
exploratory-data-analysis matplotlib python scikit-learn seaborn
Last synced: 10 May 2026
https://github.com/zescalante/data1030-final-project
Final project for DATA1030
data-science machine-learning scikit-learn
Last synced: 10 May 2026
https://github.com/djdhairya/student-attendance-management
folium matplotlib pandas scikit-learn
Last synced: 10 May 2026
https://github.com/tnleite/real-estate-opportunities-analysis
Este repositório apresenta uma análise de oportunidades no mercado imobiliário, combinando séries temporais, clusterização e previsões para identificar estados com maior potencial de crescimento e orientar estratégias de expansão eficientes.
catboostregressor cluster-analysis data-science kmeans-clustering lightgbm-regressor machine-learning-algorithms numpy regression-models scikit-learn xgboost-regression
Last synced: 10 May 2026
https://github.com/i30101/mathworks2024
Coding tools for 2024 MathWorks Math Modeling Challenge
machine-learning mathematical-modelling python scikit-learn
Last synced: 10 Jun 2026
https://github.com/alphacrypto246/student-learning-style-prediction
An interactive web application built with Streamlit that predicts a student's preferred learning style (visual, auditory, or kinesthetic) using machine learning, aiding educators in personalizing teaching strategies.
machine-learning scikit-learn scikitlearn-machine-learning streamlit
Last synced: 11 May 2026
https://github.com/dtroupe18/statsfinalproject
Simple ML project using UCI dataset
abalone jupyter-notebook linear-regression machine-learning mathplotlib python3 scikit-learn uci-machine-learning
Last synced: 11 May 2026
https://github.com/jazib-2004/prediction-classification-and-clustering-on-public-expenses-dataset
Applying end-to-end ML pipeline incl. EDA to get to know data more, data preprocessing to prepare data for modelling, and at last REGRESSION to predict one feature's value, CLASSIFICATION to classify one feature, and K-means for clustering and its analysis.
data-preprocessing exploratory-data-analysis k-means-clustering lasso-regression logistic-regression matplotlib ml-pipeline python scikit-learn
Last synced: 11 May 2026
https://github.com/monarch1108/customerinsights-kmeans
understanding customers using KMeans and RFM(recency, frequency & monetary) analysis
data-analysis data-visualization kmeans-clustering machine-learning matplotlib numpy pandas scikit-learn
Last synced: 11 May 2026
https://github.com/bheemisme/brain-tumor-classification
brain tumor classification using machin learning
deep-learning machine-learning pytorch scikit-learn xgboost
Last synced: 11 May 2026
https://github.com/shridhar1504/titanic-survivor-prediction-datascience-classification-project
This projects predicts whether a passenger on the titanic survived or not using machine learning algorithms with the given details of the passenger data.
classification-algorithm data-analysis data-analytics data-cleaning data-preprocessing data-science data-visualization eda exploratory-data-analysis gradient-boosting jupyter-notebook machine-learning machine-learning-algorithms matplotlib naive-bayes-classifier predictive-modeling python3 scikit-learn seaborn supervised-learning
Last synced: 11 May 2026
https://github.com/theladev/machine-learning
This repository is focus on show u my personal projects and interests on Machine Learning and Data Science. Hope u enjoy it.
data-science machine-learning machine-learning-models pandas python scikit-learn
Last synced: 11 May 2026
https://github.com/johannesvc/data-science-portfolio
A curated portfolio of applied data science projects focused on machine learning, NLP, and social impact.
academic-portfolio data-science deep-learning keras machine-learning media-bias nlp pandas scikit-learn
Last synced: 11 May 2026
https://github.com/deaneeth/churn-prediction-model-training
Step-by-step guide to building machine learning models for customer churn prediction, continuing from the data preprocessing phase. The repo covers training, evaluation, and saving of models, with weekly updates.
churn-prediction data-science-projects jupyter-notebook machine-learning model-evaluation model-training model-training-and-evaluation python scikit-learn
Last synced: 11 May 2026
https://github.com/cplaza0997/py-ml
Machine learning
clustering linear-regression logistic-regression ml pyspark python scikit-learn sparkml
Last synced: 11 May 2026
https://github.com/sharvesh1401/inverse-design-patch-antenna
A machine learning approach to the inverse design of microstrip patch antennas by predicting optimal physical dimensions from desired performance metrics.
antenna-design deep-learning engineering-project gradio jupyter-notebook machine-learning patch-antenna python regression-model scikit-learn
Last synced: 11 May 2026
https://github.com/antonio-f/k_nearest_neighbors
Quick k-nearest neighbors example
easy k-nearest-neighbors knn machine-learning matplotlib python scikit-learn visualization
Last synced: 11 May 2026
https://github.com/cptanalatriste/copycat-detector
A Naive-Bayes classifier for detecting plagiarism.
amazon-sagemaker naive-bayes-classifier scikit-learn
Last synced: 12 May 2026
https://github.com/capsuleismail/rt-iot2022
RT-IoT2022 is a dataset obtained from a real-time IoT infrastructure. This project aims to compare the accuracy of three machine learning models: XGBoost and LGBMClassifier.
datascience jupyter-notebook machinelearning-python scikit-learn
Last synced: 12 May 2026
https://github.com/msikorski93/seed-clustering
Performing basic clustering on a seeds dataset.
agglomerative clustering dbscan gaussian-mixture-model gmm mini-batch-kmeans scikit-learn seeds
Last synced: 13 May 2026
https://github.com/mateusoliveira30/house-prices
This project was developed for the Kaggle competition "House Prices - Advanced Regression Techniques." The goal is to predict house sale prices using advanced regression techniques, including feature engineering, Random Forests, and Gradient Boosting.
kaggle-competition machine-learning scikit-learn
Last synced: 13 May 2026
https://github.com/janek1842/mlbyjan-sandbox
Testbed for private ML investigations
Last synced: 14 May 2026
https://github.com/fulviofavilla/cvd-prediction-ml
Comparative ML analysis for CVD prediction. Winner of the 2023 HPCC Systems Poster Competition.
data-science ecl healthcare hpcc-systems machine-learning pandas python scikit-learn
Last synced: 11 Jun 2026
https://github.com/sedefkjamili/dengai-ml-prediction
Machine learning project for predicting dengue fever outbreaks using climate and environmental data.
data-science dengue gradient-boosting healthcare machine-learning python scikit-learn time-series
Last synced: 12 Jun 2026