scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-06-25 00:23:58 UTC
- JSON Representation
https://github.com/fadlani-aditya/iris-plant-classification
This project focuses on classifying different species of Iris flowers using the Random Forest algorithm. The dataset, sourced from Scikit-learn, contains four key features: sepal length, sepal width, petal length, and petal width, which are used to predict the flower species (Setosa, Versicolor, and Virginica).
agriculture data-science iris-dataset machine-learning python scikit-learn supervised-learning
Last synced: 01 May 2026
https://github.com/myahninsi/customer-segmentation-recommendation-ml
This project addressed challenges in understanding customer behavior and personalizing shopping experiences for an e-commerce platform. Developed ML solutions including K-Means clustering for segmentation, Random Forest regression for CLV prediction, and collaborative filtering for product recommendations.
collaborative-filtering k-means-clustering pandas python random-forest scikit-learn
Last synced: 01 May 2026
https://github.com/afondiel/intro-to-artificial-intelligence-free-course-le-wagon
Intro to Artificial Intelligence Free Course from @LeWagon
ai ai-courses artificial-intelligence data-science lewagon machine-learning numpy pandas prophet-facebook regression-models scikit-learn sklearn
Last synced: 01 May 2026
https://github.com/himanshugoyal77/shell-detection-frontend
Fraud detection of companies using Machine learning and django
Last synced: 01 May 2026
https://github.com/vansh-khaneja/spam-email-detection
This is a spam email detection model
machine-learning naive-bayes-classifier scikit-learn spam-detection
Last synced: 01 May 2026
https://github.com/kristishqau/sentimentanalysis_nlp
A project for sentiment analysis of tweets using various NLP techniques and machine learning models.
datascience jupyter-notebook machine-learning nlp nltk python scikit-learn sentiment-analysis xgboost
Last synced: 01 May 2026
https://github.com/k-karna/ml-scikit
Machine Learning feature engineering through Scikit-Learn
cross-validation-grid-search dimensionality-reduction gradient-boosting machine-learning poisson-regression python3 scikit-learn t-sne
Last synced: 01 May 2026
https://github.com/30mb1/ml-linear-algorithms
Using linear models for classification.
classification linear-algorithms linear-models machine-learning machine-learning-algorithms matplotlib perceptron quality scikit-learn scikitlearn-machine-learning svm svm-classifier
Last synced: 01 May 2026
https://github.com/jlee9503/medical-readmission
Conduct an analysis of medical readmission status using hospital patient data and the Social Determinants of Health dataset. Identify key factors influencing readmission rates to provide insights for improving healthcare outcomes.
python random-forest-regression scikit-learn tableau
Last synced: 01 May 2026
https://github.com/maxwelllzh/linearizer
Linearizing parameters for linear regression
data-analysis machine-learning scikit-learn
Last synced: 02 May 2026
https://github.com/sundanc/btcprediction
Predict Bitcoin prices based on historical data using machine learning techniques
bitcoin-prediction keras machine-learning pandas python python3 scikit-learn scikitlearn-machine-learning
Last synced: 02 May 2026
https://github.com/vutsalsinghal/churn-prediction
Predict if a customer will stay or not
data-science jupyter-notebook machine-learning pandas python scikit-learn
Last synced: 02 May 2026
https://github.com/bishopce16/cryptocurrencies
An analysis on cryptocurrencies dataset using unsupervised machine learning, PCA algorithm, and K-means clustering.
hvplot jupyter-notebook pandas plotly python scikit-learn unsupervised-machine-learning visual-studio-code
Last synced: 02 May 2026
https://github.com/jofaval/red-wine-quality
Data Analysis of the Red Portuguese's Wine's Quality in 2009
classification data-analysis data-science data-visualization google-colab kaggle logistic-regression machine-learning python scikit-learn wine-quality xgboost
Last synced: 03 May 2026
https://github.com/moritzkoerber/data_science_posts
This repository hosts the code for my data science related blog posts.
hyperparameter-tuning machine-learning pipeline python scikit-learn
Last synced: 03 May 2026
https://github.com/viniciusds2020/ml_pycaret_classificacao
Sistema de preprocessamento e treinamento de modelos de machine learning utilizando PyCaret. Uma metodologia low-code para processos de MLops
machine-learning mlops preprocessing pycaret python scikit-learn
Last synced: 03 May 2026
https://github.com/alessandromonolo/fraud-detection-binary-classification-model
This project builds a machine learning model to classify fraudulent clients using a banking dataset. Data preprocessing, statistical analysis, and feature selection were performed before training KNN and Random Forest Classifier. Model performance was evaluated using accuracy, precision, recall, and F1-score.
classification-model fraud-detection knn-classification machine-learning pandas python random-forest scikit-learn statistical-analysis
Last synced: 03 May 2026
https://github.com/arrhythmia-detection/authorprovidedfeaturescombineddt
Deploys a vanilla Decision Tree for Arrhythmia classification using Chapman ECG dataset on Arduino UNO board
arduino-uno arrhythmia-classification atmega328p chapman-ecg decision-tree-classifier eloquent scikit-learn
Last synced: 09 Jun 2026
https://github.com/zhenglinlei/zdmp
Industry 4.0 Optimization with Machine Learning AI
industry-4 knn-classification machine-learning pandas python scikit-learn
Last synced: 03 May 2026
https://github.com/apfirebolt/movie_recommendation_using_scikitlearn_and_pyqt5
A movie recommendation system built using KNN model from scikit-learn library. GUI components are powered by pyQt5, a library to create GUI applications in Python
cosine-similarity jupyter-notebook knn-algorithm movie-recommedation pandas python scikit-learn
Last synced: 03 May 2026
https://github.com/furk4nbulut/uygulamalarla-makine-ogrenmesi-ve-derin-ogrenme-atolyesi
Bu repository, Manisa'da gerçekleştirilen BTK Akademi Uygulamalı Makine Öğrenmesi ve Derin Öğrenme Atölyesi'ne ait eğitim sürecini kapsamaktadır. Atölyede katılımcılar, ileri düzey makine öğrenmesi ve derin öğrenme teknikleriyle ilgili teorik ve pratik bilgiler edinmektedir.
matplotlib numpy pandas scikit-learn seaborn
Last synced: 03 May 2026
https://github.com/pramodyasahan/binary-classifier
This repository houses the code for a machine learning model designed to predict customer churn. The model is built using Support Vector Machine (SVM) from the scikit-learn library and incorporates preprocessing, pipeline, and grid search techniques for optimal performance.
Last synced: 03 May 2026
https://github.com/alestankiewicz/credit-card-fraud-detection
Credit Card Fraud Detection Excercise In Python
pandas plotly python3 scikit-learn xgboost
Last synced: 03 May 2026
https://github.com/arnavk-09/phishing-detection
🎣 Detect Phishing URLs with Data Pre-fitted... API & Web UI
csv data fastapi flask python scikit-learn
Last synced: 03 May 2026
https://github.com/atchayaah/home-value-insights-kc
Data-driven project predicting King County housing prices using EDA, regression models, and ML techniques, developed as part of IBM’s Data Analysis with Python course on Coursera.
joblib matplotlib numpy pandas pickle python scikit-learn seaborn
Last synced: 03 May 2026
https://github.com/lucs1590/commom_segmentations
The purpose of this repository is to document and expose code samples using common threading techniques.
computational-vision machine-learning open-source opencv python scikit-image scikit-learn segmentation sklearn
Last synced: 03 May 2026
https://github.com/ceodaniyal/telecom_customer_churn_prediction
A machine learning project that predicts whether a telecom customer will churn (leave the service) using customer demographics, account information, and service usage. The repository includes data preprocessing, model training (with logistic regression), feature scaling, and example predictions.
classification customer-churn-prediction data-science logistic-regression machine-learning ml-project pandas prediction python scikit-learn streamlit telecom
Last synced: 04 May 2026
https://github.com/codejsha/machine-learning-examples
Examples of machine learning using scikit-learn
Last synced: 04 May 2026
https://github.com/danielwohlr/delivery_time_series
Time series forecasting of food delivery service data
forecasting-time-series python scikit-learn
Last synced: 04 May 2026
https://github.com/homebackend/pdf-title-page-splitter
Splits a pdf based on identified title pages using ML trained model
machine-learning opencv pdf-splitter pdf2image pypdf2 scikit-learn tensorflow
Last synced: 04 May 2026
https://github.com/mariiasam/stroke-prediction
A model for predicting the risk of stroke in a patient
balanced-random-forest-classifier decission-tree-classifier gradient-boosting imbalanced-learning joblib logistic-regression matplotlib numpy random-forest-classifier scikit-learn seaborn streamlit
Last synced: 04 May 2026
https://github.com/joel-beck/airbnb-oslo
Price Prediction Models for Airbnb Apartments in Oslo | Winter Term 2021/22
prediction python pytorch scikit-learn
Last synced: 04 May 2026
https://github.com/suguru-n/temp_easyai
学部生向け機械学習体験プログラム
google-colab jupyter-notebook linearregression python scikit-learn
Last synced: 04 May 2026
https://github.com/dakii24/credit-card-fraud-detection
This repository contains a machine learning project focused on detecting fraudulent credit card transactions. The project includes data preprocessing, model training, and evaluation to identify and prevent fraudulent activities.
capstone-project class-imbalance classification-algorithm credit-card credit-card-fraud data-science decision-trees fraud machine-learning open-data python scikit-learn svm svm-classifier
Last synced: 04 May 2026
https://github.com/madhu26sree/diabetes-prediction
This project leverages the Support Vector Machine (SVM) algorithm to predict whether a person is likely to have diabetes or not, using the Diabetes dataset. It covers data preprocessing, model building, evaluation using Python.
machine-learning python scikit-learn
Last synced: 04 May 2026
https://github.com/chathumiamarasinghe/nn-training-model
A comprehensive project for training neural networks to solve real-world problems. This repository includes customizable code for building, training, and evaluating neural network architectures using popular deep learning frameworks.
jupyter-notebook matplotlib numpy phyton scikit-learn
Last synced: 04 May 2026
https://github.com/siddhantborse/atmosviz
Atmos Viz is a Python-based project designed to analyze, visualize, and predict global temperature trends across various cities and countries using time-series analysis and advanced data science techniques. Leveraging historical climate data, this project integrates machine learning models, geospatial mapping, and interactive visualizations to unco
geopandas geospatial-analysis gis matplotlib numpy pandas plotly python scikit-learn seaborn shapefiles time timeseries-analysis timeseries-data
Last synced: 05 May 2026
https://github.com/navindafernando/face-recognition
Training a SVM Pipeline for Face Recognition :anger:
classification haar-cascade-classifier joblib machine-learning numpy opencv pipeline principal-component-analysis scikit-learn ski support-vector-machines
Last synced: 05 May 2026
https://github.com/jcm-ai/personal-data-science-projects
This page contains all of my personal data science projects. 📊📈📉👨💻
data-analysis data-visualization exploratory-data-analysis jupyter-notebooks machine-learning-algorithms matplotlib-pyplot numpy-library pandas-python personal-project predictive-modeling programming python3 scikit-learn scipy seaborn statistical-analysis
Last synced: 05 May 2026
https://github.com/himanshkr03/comparative_performance_on_fashionmnist
This repository explores various machine learning and deep learning models for classifying images from the Fashion MNIST dataset. It includes data exploration, model training, evaluation, and visualization techniques to gain insights into the classification task.
deep-learning fashion-mnist fine hybrid-model image-classification keras machine-learning scikit-learn tensorflow xgboost-algorithm
Last synced: 05 May 2026
https://github.com/s-matke/eco-forecast
Machine learning model used for predicting European country with most green surplus energy generated
data-science green-energy machine-learning scikit-learn supervised-learning
Last synced: 05 May 2026
https://github.com/kriss024/anaconda-python-for-data-science-basic
Anaconda Python 3.8.8 with TensorFlow 2 Docker image
anaconda3 data-science docker jupyter-notebook machine-learning python python3 scikit-learn tensorflow2
Last synced: 05 May 2026
https://github.com/marconicivitavecchia/stazione-monitoraggio-ambientale
Codice in MicroPython per ESP32 per il corso tenuto dalla nostra scuola rivolto ai docenti sulla creazione di una stazione di monitoraggio ambientale che copre gli argomenti di Python, IoT ed Intelligenza Artificiale.
ai esp32 micropython micropython-esp32 python school-project scikit-learn
Last synced: 05 May 2026
https://github.com/zafir100100/cancer-stage-prediction
This code predicts cancer data using various regression models, calculates their average R-squared scores, and prints the best model.
cross-validation data-analysis data-preprocessing decision-trees gradient-boosting linear-regression machine-learning-algorithms numpy pandas random-forest regression scikit-learn
Last synced: 05 May 2026
https://github.com/hitthecodelabs/petalanalyticsstreamlit
Web application developed with Streamlit that predicts the Iris flower type based on its physical features
matplotlib model numpy pickle python scikit-learn sklearn streamlit
Last synced: 05 May 2026
https://github.com/smaddanki/pattern-pursuit-challenge
A personal challenge to build a production-ready trading signal system for S&P 500 stocks using deep learning. This project progresses from basic ML models to a complete trading infrastructure, focusing on 5-day forward return prediction and signal generation.
deep-learning machine-learning pytorch quantative-trading quantitative-finance quantitative-research scikit-learn
Last synced: 05 May 2026
https://github.com/patilsukanya/house-price-prediction
Libraries Used
matplotlib numpy pandas scikit-learn seaborn
Last synced: 05 May 2026
https://github.com/teja-1403/coursera-machine-learning-with-python-honors
This project involves building a classifier to predict rainfall for the next day based on weather data from the Australian Government's Bureau of Meteorology. Various machine learning techniques such as Linear Regression, KNN, Decision Trees, Logistic Regression, and SVM were implemented and evaluated.
classification hierarchical-clustering machine-learning regression scikit-learn scipy
Last synced: 05 May 2026
https://github.com/nandinimarepalli/ai_ml_internship_projects
Projects completed during my AI/ML and Data Expert internship, including EDA, machine learning models, and dashboard development using Python, pandas, scikit-learn, and visualization libraries.
matplotlib numpy pandas python scikit-learn seaborn
Last synced: 05 May 2026
https://github.com/vedanty3/bulldozer-price-prediction
A machine learning project aiming to build a machine learning model which could predict the sales price of bulldozer.
andrew-ng-machine-learning ensemble-machine-learning gridsearchcv jupyter-notebook machine-learning matplotlib numpy pandas python randomforestregressor randomizedsearchcv scikit-learn ztm
Last synced: 05 Apr 2026
https://github.com/rohit1901/py-cluster
Classifier and Cluster Analysis in Data Science
classification clustering data-science k-means-clustering machine-learning pytest python python3 ruff scikit-learn
Last synced: 05 May 2026
https://github.com/aryar-06/linear-regression
A Python project demonstrating basic linear regression with gradient descent and matrix operations, alongside scikit-learn comparison.
data-analysis data-preprocessing educational-project gradient-descent linear-regression machine-learning python regression-algorithms scikit-learn
Last synced: 05 May 2026
https://github.com/antoniskl/un-general-debate-corpus-classification
The aim of this project is to classify UNGDC speeches with regards to climate change. As a secondary objective, a correlation is being examined between these speeches, the forestation and the happiness index of the countries.
classification data-science jupyter-notebook machine-learning nlp python regression scikit-learn text-classification text-preprocessing
Last synced: 05 May 2026
https://github.com/kefrankk/ml-fraud-detection
I built a predictive model to detect fraud in financial transactions.
Last synced: 05 May 2026
https://github.com/debangshu9183/spam-mail-detection
To detect spam mail
logistic-regression machine-learning numpy pandas scikit-learn seaborn
Last synced: 05 May 2026
https://github.com/kunalpisolkar24/dsbda_lab
Collection of practical codes for Savitribai Phule Pune University's Data Science and Big Data Analytics Laboratory (310256).
data-analytics data-preprocessing data-science data-wrangling descriptive-statistics linear-regression logistic-regression mapreduce scala scikit-learn sppu-computer-engineering tf-idf
Last synced: 05 May 2026
https://github.com/sevilaymuni/project-no.6-tree-based-models
Random Forest Assisted Suggestions for Salifort Motors Employee Retention: Plan, Analyze, Construct and Execute
data-science decision-trees evaluation-metrics gridsearchcv logistic-regression machine-learning matplotlib python random-forest-classifier scikit-learn seaborn-plots
Last synced: 05 May 2026
https://github.com/pjj11005/ml_with_pytorch_study
[머신 러닝 교과서: 파이토치 편] -> 학습한 코드 저장소
deep-learning graph-neural-networks machine-learning neural-networks pytorch scikit-learn transformer
Last synced: 06 May 2026
https://github.com/grandechowhiskey/fcc-machine_learning-boilerplates
A collection of projects completed as part of the FreeCodeCamp "Machine Learning with Python" certification. These projects focus on implementing machine learning models, data preprocessing, and predictive analysis using libraries like scikit-learn and TensorFlow.
ai ml python3 scikit-learn tensorflow
Last synced: 06 May 2026
https://github.com/keneandita/iris-intel
Iris Flower Classifier is a simple web app built with Streamlit that predicts the species of an Iris flower based on user-input flower features. It uses pre-trained machine learning models including Logistic Regression, K-Nearest Neighbors, SVM, and Decision Tree to make real-time predictions.
iris-classification jupyter-notebook machine-learning python scikit-learn streamlit
Last synced: 06 May 2026
https://github.com/eshansugeesh/fico-score-loan-default-modeling-project
Credit risk assessment using FICO score segmentation, loan default modeling, discretization techniques, and log-likelihood evaluation for predictive analytics in financial services.
bucketing classification credit-risk customer-segmentation data-science discretization fico-score financial-analytics loan-analysis loan-default log-likelihood machine-learning numpy pandas predictive-modeling risk-modeling scikit-learn segmentation statistical-modelling
Last synced: 06 May 2026
https://github.com/billgewrgoulas/recommendation-systems
Algorithms for joke rating prediction using the joke data-set from Kaggle.
algorithm clustering collaborative-filtering machine-learning numpy pandas recommender-system scikit-learn scypi
Last synced: 06 May 2026
https://github.com/anu-gtb/campusplacement
Link to Streamlit app :
binary-classification campus-placements data-science data-visualization machine-learning matplotlib ml-algorithms predictive-modeling python3 scikit-learn seaborn statistics streamlit
Last synced: 06 May 2026
https://github.com/sabin74/boston_house_prediction
This project aims to predict the median value of owner-occupied homes in Boston suburbs using various machine learning regression models. Multiple regression techniques were applied, including Linear Regression, Decision Tree, Random Forest, Gradient Boosting and dimensionality reduction with PCA. Hyperparameter tuning was performed.
boston-housing-price-prediction hyperparameter-tuning kaggle-dataset pca-analysis python3 regression-models scikit-learn
Last synced: 06 May 2026
https://github.com/adesartika33/proyek-analisis-data-dataset-iris
Proyek ini bertujuan untuk menganalisis dataset Iris, salah satu dataset klasik dalam bidang Machine Learning dan Data Science. Dataset ini terdiri dari 150 sampel bunga Iris dari tiga spesies (Setosa, Versicolor, dan Virginica)
classification data-science data-visualization eda exploratory-data-analysis iris-dataset machine-learning python random-forest scikit-learn
Last synced: 06 May 2026
https://github.com/msikorski93/predicting-prices-on-king-county-housing-dataset
Predicting house prices using different regression analysis models.
catboost eda gradient-boosting king-county lightgbm linear-regression machine-learning neural-network polynomial-regression real-estate regression-models scikit-learn tensorflow xgboost
Last synced: 06 May 2026
https://github.com/pimakarov/textkd-p4-fewshot-distilbert
📊 Compare few-shot text classification with DistilBERT and TF-IDF + SVM using IMDB data, analyzing performance across various sample sizes.
bert distilbert few-shot-learning nlp python pytorch scikit-learn text-classification transfer-learning trasformer
Last synced: 06 May 2026
https://github.com/tharunkumar666/employee_attrition_prediction
Predict employee attrition using Logistic Regression. Use Python with Pandas and Scikit-learn to analyze factors like salary, satisfaction, and promotion history. Model classifies if an employee will stay or leave, helping HR take proactive retention measures.
pandas python regression-models scikit-learn
Last synced: 06 May 2026
https://github.com/pradeep-r04/spam-email-classification
Spam Email Classification Using NLP and Machine Learning involves building a system to identify and categorize emails as either spam or non-spam (ham). This process typically uses Natural Language Processing (NLP) techniques to analyze and preprocess text data and machine learning algorithms to train a model for classification.
artificial-intelligence machine-learning naive-bayes-classifier nlp pkl python scikit-learn streamlit
Last synced: 06 May 2026
https://github.com/williyam-m/company-registration-trends
Utilized Linear Regression from scikit-learn to predict future company registration trends.
flask matplotlib numpy pandas-python scikit-learn
Last synced: 06 May 2026
https://github.com/lintangwisesa/pdb_mti_ui_lab1_k6
Tugas Lab 1 Pengelolaan Data Besar MTI UI 2023
machine-learning python3 scikit-learn
Last synced: 06 May 2026
https://github.com/bhavyac16/flairifyme
FlairifyMe is a Reddit Flair Detector for r/india subreddit, that takes a post's URL as user input and predicts the flair for the post using a model generated by Logistic Regression.
flair-prediction flask hacktoberfest linear-svm logistic-regression naive-bayes-classifier nltk praw-reddit reddit-flair-detector scikit-learn scraped-data subreddit text-classification
Last synced: 06 May 2026
https://github.com/sahilmate/ebm-breast-cancer-classifier
This repository implements an Explainable Boosting Machine (EBM) model for breast cancer classification using scikit-learn and interpret. The project includes data preprocessing, model training, accuracy evaluation, and feature importance visualization.
breast-cancer-classification data-visualization explainable-boosting-machine feature-importance interpret machine-learning scikit-learn
Last synced: 06 May 2026
https://github.com/kumar-o/customer-churn-prediction
🔍 Predict customer churn using a synthetic dataset with advanced models and metrics to enhance business retention strategies and decision-making.
churn-prediction customer-churn-prediction customer-survival-analysis data-science factorization-machines flask-application gridsearchcv libsvm machine-learning model-evaluation numpy pipelines roc-auc scikit-learn seaborn smote survival-analysis xgboost4j
Last synced: 06 May 2026
https://github.com/rafay-imraan/recommendation-system
A machine learning model that outputs personalized similar movie recommendations for people based on the ones they have rated positively.
machine-learning pandas python scikit-learn
Last synced: 06 May 2026
https://github.com/ccastleberry/hands_on_machine_learning
Notebooks and files created while working through the book Hands on Machine Learning
data-science jupyter-notebook scikit-learn tensorflow
Last synced: 06 May 2026
https://github.com/avtorgenii/ml-playground
A repository for exploring and experimenting with datasets, building machine learning models, and testing various techniques in data preprocessing, feature engineering, and model evaluation.
matplotlib ml pandas scikit-learn
Last synced: 06 May 2026
https://github.com/galaxy092/samsung-innovation-campus-big-data-capstone-project
Samsung Innovation Campus Big Data Capstone Project - Weather Prediction
hadoop jupyter-notebook pandas pyspark scikit-learn sparksql
Last synced: 06 May 2026
https://github.com/samudraneel05/stanford-open-policing
The Stanford Open Policing Project (SOPP) aims to bring transparency to police interactions by collecting and analyzing data on traffic stops across the United States. It accumulates a vast dataset on traffic stops, encompassing details such as demographics, location, and outcomes.
clustering heirarchical-clustering k-means-clustering machine-learning matplotlib pandas python scikit-learn
Last synced: 06 May 2026
https://github.com/kianaabrisham/svm-from-scratch
Linear SVM from scratch with hinge loss + decision boundaries
classification from-scratch fundamentals hinge-loss numpy optimization scikit-learn svm
Last synced: 07 May 2026
https://github.com/taquynhnga2001/regression-calories-burnt-prediction
Develop regression models which can predict the total calories a person has burnt during a workout based on some biological measures.
machine-learning python regression-analysis scikit-learn
Last synced: 07 May 2026
https://github.com/kirillshiryaev61/customer_activity_prediction
Прогнозирование снижения покупательской активности в интернет-магазине. Модель на основе ML выявляет клиентов с риском оттока для повышения удержания. Учебный проект.
jupyter pandas python scikit-learn
Last synced: 07 May 2026
https://github.com/ayaarbi/prediction_des_maladies_cardiovasculaires_avec_ml
Ce projet, développé au sein de cours de Machine Learning, utilise des algorithmes de classification supervisée pour prédire la présence de maladies cardio-vasculaires à partir de données médicales publiées sur Kaggle.
cardiovascular-diseases jupyter-notebook machine-learning matplotlib pandas python scikit-learn
Last synced: 07 May 2026
https://github.com/z-fran/walmart-store-sales-forecasting
Data analysis and machine learning solution in Python for the Kaggle competition Walmart Recruiting - Store Sales Forecasting.
machine-learning sales-analysis sales-forecasting sales-prediction scikit-learn walmart-sales-forecasting
Last synced: 07 May 2026
https://github.com/govind-prakash/machinelearning
A collection of my machine learning projects, tutorial exercises, algorithm implementations, and related code.
decision-trees gradientboostinclassifier linear-regression logistic-regression scikit-learn unsupervised-learning
Last synced: 07 May 2026
https://github.com/tony123105/comp4423_garbage_classification
Garbage classification using traditional machine learning approaches (HOG, LBP, SIFT features with SVM, KNN, Random Forest classifiers) and an ensemble method to categorize waste into 10 types.
computer-vision feature-extraction garbage-classification hog image-classification knn lbp machine-learning opencv python random-forest scikit-learn sift svm
Last synced: 07 May 2026
https://github.com/sumdiboii/loan-prediction-decision-trees
A Decision Tree Classifier was implemented to predict personal loan acceptance using a dataset of 5,000 customers. Key features included income, education, mortgage, and credit card usage. The model achieved 97% accuracy, with 92% precision and 76% recall for positive loan predictions, validated using a classification report and confusion matrix.
classification data-visualisation decision-trees loan-prediction machine-learning python scikit-learn supervised-learning
Last synced: 07 May 2026
https://github.com/andrewsy1004/linear-regression-model-for-house-price-prediction
A linear regression model to predict house prices based on features like size, location, and number of rooms. This project demonstrates the application of machine learning in real estate price estimation
linear-regression python scikit-learn xgbregressor
Last synced: 07 May 2026
https://github.com/mwasifanwar/automl_framework
Comprehensive AutoML framework that automates data preprocessing, feature engineering, model selection, hyperparameter tuning, and deployment. Features neural architecture search and automated data cleaning pipelines.
automl automl-algorithms data-science data-science-projects feature-engineering feature-engineering-algorithm feature-engineering-ml hyperparameter-optimization machine-learning machine-learning-algorithms machine-learning-models mlops mlops-workflow python scikit-learn scikit-learn-python
Last synced: 07 May 2026
https://github.com/jimmymugendi/bulding-a-decision-tree-to-predict-customer-churn
This repo desribes bulding a decision tree to predict customer churn in a given organisation
accuracy-score decision-tree-classifier matplotlib-pyplot numpy pandas-dataframe scikit-learn
Last synced: 07 May 2026
https://github.com/cnoret/hexa-watts
Interactive data visualization and machine learning app for energy consumption analysis and prediction in France, built with Streamlit. (Text in French)
data-visualization electricity-forecasting energy-analysis france machine-learning scikit-learn streamlit
Last synced: 07 May 2026
https://github.com/moustafamohamed01/mall-customer-segmentation-data
Customer segmentation using K-Means clustering based on annual income and spending score.
data-science data-visualization k-means-clustering machine-learning python scikit-learn unsupervised-learning
Last synced: 08 May 2026
https://github.com/anmolian/cassava_leaf_disease_detection
Image Classification
computer-vision deep-learning machine-learning scikit-learn tensorflow
Last synced: 08 May 2026
https://github.com/jahanostg/linear-regression_ml-algorithm
Linear Regression Algorithm
colab-notebook matplotlib numpy pandas scikit-learn seaborn
Last synced: 08 May 2026
https://github.com/amiegirl/fellowship_ai
Sentiment Analysis of Movies Dataset
decision-tree-classification logisticregression matplotlib pandas random-forest-classification scikit-learn wordcloud
Last synced: 08 May 2026
https://github.com/thekartikeyamishra/data-preprocessor
A Google Colab module for interactive data preprocessing. Handles missing values, categorical encoding (One-Hot, Label), and numerical scaling (Standard, MinMax). Outputs a cleaned dataset
ipywidgets numpy pandas python scikit-learn
Last synced: 08 May 2026
https://github.com/jatin-mehra119/churn_modeling
This repository is dedicated to predicting customer churn using machine learning techniques. It includes comprehensive scripts for data preprocessing, model training, and evaluation, along with detailed visualizations and insights.
classification-model datavisualization pandas scikit-learn
Last synced: 08 May 2026