scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-06-25 00:23:58 UTC
- JSON Representation
https://github.com/inesruizblach/data-science-project
A data science project exploring Portuguese "Vinho Verde" wine quality prediction. Features EDA, feature engineering, ML models, and evaluation using Python, pandas, scikit-learn, and visualization tools.
binary-classification classification data-science exploratory-data-analysis feature-engineering imbalanced-learn jupyter-notebook machine-learning model-evaluation pandas regression scikit-learn seaborn uci-dataset wine-quality
Last synced: 09 May 2026
https://github.com/kianaabrisham/naive-bayes-sentiment
Sentiment classification using Multinomial NB (scratch + sklearn)
bag-of-words naive-bayes nlp scikit-learn sentiment-analysis text-classification
Last synced: 14 May 2026
https://github.com/scorchinghot/core-machine-learning-exploration
This repository provides a hands-on exploration of classical machine learning algorithms applied to the MovieLens 100k dataset, aiming to build intuition and understanding of core ML concepts.
core-ml data-science hands-on machine-learning ml-algorithms python scikit-learn tutorial
Last synced: 05 Oct 2025
https://github.com/vedanty3/bulldozer-price-prediction
A machine learning project aiming to build a machine learning model which could predict the sales price of bulldozer.
andrew-ng-machine-learning ensemble-machine-learning gridsearchcv jupyter-notebook machine-learning matplotlib numpy pandas python randomforestregressor randomizedsearchcv scikit-learn ztm
Last synced: 05 Apr 2026
https://github.com/srisaihariharan/mic_sentiment_analysis_v
Sentiment analysis of IMDb movie reviews using Python, Scikit-learn, and TF-IDF.
machine-learning natural-language-processing nlp python scikit-learn sentiment-analysis sentiment-classification
Last synced: 03 May 2026
https://github.com/apfirebolt/movie_recommendation_using_scikitlearn_and_pyqt5
A movie recommendation system built using KNN model from scikit-learn library. GUI components are powered by pyQt5, a library to create GUI applications in Python
cosine-similarity jupyter-notebook knn-algorithm movie-recommedation pandas python scikit-learn
Last synced: 03 May 2026
https://github.com/furk4nbulut/uygulamalarla-makine-ogrenmesi-ve-derin-ogrenme-atolyesi
Bu repository, Manisa'da gerçekleştirilen BTK Akademi Uygulamalı Makine Öğrenmesi ve Derin Öğrenme Atölyesi'ne ait eğitim sürecini kapsamaktadır. Atölyede katılımcılar, ileri düzey makine öğrenmesi ve derin öğrenme teknikleriyle ilgili teorik ve pratik bilgiler edinmektedir.
matplotlib numpy pandas scikit-learn seaborn
Last synced: 03 May 2026
https://github.com/jonad/finding_donors
Predicting income with UCI Census Income Dataset using supervised machine learning algorithms
numpy pandas scikit-learn scikitlearn-machine-learning
Last synced: 03 May 2026
https://github.com/thmslmr/scikitlearn-examples
💡Scikit Learn examples - Python
Last synced: 03 May 2026
https://github.com/srilaasya/breast-cancer-classifier
Used several Python libraries to make a K-Nearest Neighbor classifier that is trained to predict whether a patient has breast cancer
knearest-neighbor-classifier python scikit-learn
Last synced: 03 May 2026
https://github.com/stefan-500/ml-network-traffic-classification-unsw-nb15
Network traffic classification using Machine Learning
classification machine-learning network-traffic-classification python scikit-learn support-vector-machines svc-model unsw-nb15-dataset
Last synced: 03 May 2026
https://github.com/ceodaniyal/telecom_customer_churn_prediction
A machine learning project that predicts whether a telecom customer will churn (leave the service) using customer demographics, account information, and service usage. The repository includes data preprocessing, model training (with logistic regression), feature scaling, and example predictions.
classification customer-churn-prediction data-science logistic-regression machine-learning ml-project pandas prediction python scikit-learn streamlit telecom
Last synced: 04 May 2026
https://github.com/abdullahalzubaer/feature-selection-ranking
In-depth analysis regarding feature selection and ranking.
feature-ranking feature-selection random scikit-learn
Last synced: 04 May 2026
https://github.com/vyclarks/gestational-diabetes-prediction-ml
Predicting gestational diabetes from the Pima dataset — Python (scikit-learn); reproducible notebook, metrics, and report.
healthcare-analysis machine-learning python scikit-learn
Last synced: 04 May 2026
https://github.com/danielwohlr/delivery_time_series
Time series forecasting of food delivery service data
forecasting-time-series python scikit-learn
Last synced: 04 May 2026
https://github.com/homebackend/pdf-title-page-splitter
Splits a pdf based on identified title pages using ML trained model
machine-learning opencv pdf-splitter pdf2image pypdf2 scikit-learn tensorflow
Last synced: 04 May 2026
https://github.com/marionchaff/real-estate-price-prediction-france
Real estate price prediction using French public database DVF
data-analysis dvf-data machine-learning price-prediction python real-estate scikit-learn
Last synced: 04 May 2026
https://github.com/mariiasam/stroke-prediction
A model for predicting the risk of stroke in a patient
balanced-random-forest-classifier decission-tree-classifier gradient-boosting imbalanced-learning joblib logistic-regression matplotlib numpy random-forest-classifier scikit-learn seaborn streamlit
Last synced: 04 May 2026
https://github.com/bhawnamehbubani/airline-passenger-referral-program-development-with-classification-techniques
Prediction of airline passenger referrals using Logistic Regression, GridSearchCV, and TF-IDF vectorization with Python, Pandas, Scikit-learn, and Excel.
excel gridsearchcv logistic-regression pandas python3 scikit-learn tf-idf-vectorization
Last synced: 04 May 2026
https://github.com/suguru-n/temp_easyai
学部生向け機械学習体験プログラム
google-colab jupyter-notebook linearregression python scikit-learn
Last synced: 04 May 2026
https://github.com/drod75/nyc-arrests-analysis
This is a simple Data Science Project made to analyze and display data and trends found within the NYC Arrests Year to Date Dataset.
data-analysis data-visualization folium jupyter-notebook matplotlib-pyplot nyc-opendata nypd python scikit-learn seaborn
Last synced: 04 May 2026
https://github.com/chathumiamarasinghe/nn-training-model
A comprehensive project for training neural networks to solve real-world problems. This repository includes customizable code for building, training, and evaluating neural network architectures using popular deep learning frameworks.
jupyter-notebook matplotlib numpy phyton scikit-learn
Last synced: 04 May 2026
https://github.com/aqueeqazam/machine-learning-using-scikit
This repository contains all of the algorithms used to train the machine learning models using the Scikit library.
Last synced: 04 May 2026
https://github.com/sxv357/xtern-artificial-intelligence-work-based-assessment
This application takes in data regarding undergraduate college students in the state of Indiana such as their year, what major they're pursuing, which university they attend, and makes a prediction about their food order.
jupyter-notebook matplotlib pandas pickle scikit-learn seaborn
Last synced: 05 May 2026
https://github.com/navindafernando/face-recognition
Training a SVM Pipeline for Face Recognition :anger:
classification haar-cascade-classifier joblib machine-learning numpy opencv pipeline principal-component-analysis scikit-learn ski support-vector-machines
Last synced: 05 May 2026
https://github.com/thekartikeyamishra/resumeevaluatorapp
The Automated Resume Evaluator is a Python-based application that helps evaluate resumes against job descriptions. It calculates an Applicant Tracking System (ATS) score, which is the percentage of keywords from the job description found in the resume.
flask machine-learning matplotlib nlp nltk pypdf python scikit-learn spacy textblob
Last synced: 05 May 2026
https://github.com/himanshkr03/comparative_performance_on_fashionmnist
This repository explores various machine learning and deep learning models for classifying images from the Fashion MNIST dataset. It includes data exploration, model training, evaluation, and visualization techniques to gain insights into the classification task.
deep-learning fashion-mnist fine hybrid-model image-classification keras machine-learning scikit-learn tensorflow xgboost-algorithm
Last synced: 05 May 2026
https://github.com/s-matke/eco-forecast
Machine learning model used for predicting European country with most green surplus energy generated
data-science green-energy machine-learning scikit-learn supervised-learning
Last synced: 05 May 2026
https://github.com/antonio-f/ridge_regression_example
Easy Ridge Regression example
easy matplotlib-pyplot polynomial-regression python ridge-regression scikit-learn tutorial
Last synced: 05 May 2026
https://github.com/celineboutinon/bookworms
OpenClassrooms Data Analyst 2022-2023 - Projet 6
apriori-algorithm data-analysis data-analytics data-visualisation dataframes matplotlib-pyplot mlxtend numpy pandas python scikit-learn scikit-posthocs scikitlearn seaborn statsmodels
Last synced: 05 May 2026
https://github.com/zafir100100/cancer-stage-prediction
This code predicts cancer data using various regression models, calculates their average R-squared scores, and prints the best model.
cross-validation data-analysis data-preprocessing decision-trees gradient-boosting linear-regression machine-learning-algorithms numpy pandas random-forest regression scikit-learn
Last synced: 05 May 2026
https://github.com/markdouthwaite/lingo-demo
A demo project showing how to effectively deploy Scikit-Learn Linear Models in Go into Google Cloud Run.
go golang google-cloud-platform python scikit-learn
Last synced: 05 May 2026
https://github.com/teja-1403/coursera-machine-learning-with-python-honors
This project involves building a classifier to predict rainfall for the next day based on weather data from the Australian Government's Bureau of Meteorology. Various machine learning techniques such as Linear Regression, KNN, Decision Trees, Logistic Regression, and SVM were implemented and evaluated.
classification hierarchical-clustering machine-learning regression scikit-learn scipy
Last synced: 05 May 2026
https://github.com/zuhairzia/customer-segmentation
📖 About Customer Segmentation using KMeans clustering to analyze demographics, income, and spending. Helps businesses with targeted marketing and customer insights.
joblib matplotlib numpy pandas scikit-learn seaborn
Last synced: 05 May 2026
https://github.com/rohra-mehak/sciencesync
System for Personalized Google Scholar Alerts Processing and Data Management, and provision of ML based clustering analysis
agglomerative-clustering clustering crossref-api customtkinter google-api google-scholar graph-api machine-learning numpy pandas python3 scientific-article-analysis scikit-learn sqlite3
Last synced: 05 May 2026
https://github.com/rohansardar/iris_flower
A basic ML project on the iris flower classification
data-science iris-classification iris-dataset ml python scikit-learn
Last synced: 05 May 2026
https://github.com/gbourniq/cnn-multiclass-classification-gear
Using Machine Learning and Deep Learning to predict the category of outdoor equipment
image-classification keras-tensorflow multiclass-classification python scikit-learn tensorboard-visualizations
Last synced: 05 May 2026
https://github.com/aryar-06/linear-regression
A Python project demonstrating basic linear regression with gradient descent and matrix operations, alongside scikit-learn comparison.
data-analysis data-preprocessing educational-project gradient-descent linear-regression machine-learning python regression-algorithms scikit-learn
Last synced: 05 May 2026
https://github.com/antoniskl/un-general-debate-corpus-classification
The aim of this project is to classify UNGDC speeches with regards to climate change. As a secondary objective, a correlation is being examined between these speeches, the forestation and the happiness index of the countries.
classification data-science jupyter-notebook machine-learning nlp python regression scikit-learn text-classification text-preprocessing
Last synced: 05 May 2026
https://github.com/kefrankk/ml-fraud-detection
I built a predictive model to detect fraud in financial transactions.
Last synced: 05 May 2026
https://github.com/debangshu9183/spam-mail-detection
To detect spam mail
logistic-regression machine-learning numpy pandas scikit-learn seaborn
Last synced: 05 May 2026
https://github.com/sevilaymuni/project-no.6-tree-based-models
Random Forest Assisted Suggestions for Salifort Motors Employee Retention: Plan, Analyze, Construct and Execute
data-science decision-trees evaluation-metrics gridsearchcv logistic-regression machine-learning matplotlib python random-forest-classifier scikit-learn seaborn-plots
Last synced: 05 May 2026
https://github.com/pjj11005/ml_with_pytorch_study
[머신 러닝 교과서: 파이토치 편] -> 학습한 코드 저장소
deep-learning graph-neural-networks machine-learning neural-networks pytorch scikit-learn transformer
Last synced: 06 May 2026
https://github.com/fahrettinsolak/ai-based-salary-scale-calculation-project
This project demonstrates a Polynomial Regression model using a dataset related to experience and salary. The model is built using Python with the pandas, matplotlib, and sklearn libraries. The dataset includes information on years of experience and corresponding salary.
artificial-intelligence deep-learning jupyter-notebook machine-learning matplotlib pandas pyhton scikit-learn
Last synced: 05 May 2026
https://github.com/samia35-2973/living-type-classification-from-codon-usage
Machine learning project to classify living types based on codon usage data using Random Forest and XGBoost classifiers.
classification codon-usage data-cleaning data-preprocessing excel exploratory-data-analysis living-type machine-learning python random-forest-classifier scikit-learn supervised-learning xgboost-classifier
Last synced: 06 May 2026
https://github.com/keneandita/iris-intel
Iris Flower Classifier is a simple web app built with Streamlit that predicts the species of an Iris flower based on user-input flower features. It uses pre-trained machine learning models including Logistic Regression, K-Nearest Neighbors, SVM, and Decision Tree to make real-time predictions.
iris-classification jupyter-notebook machine-learning python scikit-learn streamlit
Last synced: 06 May 2026
https://github.com/nicolas-giacomelli/modelo_regressao_linear_vendas
Modelo de regressão linear para previsão de vendas Desafio do curso de IA da RocketSeat
matplotlib pandas python3 scikit-learn
Last synced: 06 May 2026
https://github.com/billgewrgoulas/recommendation-systems
Algorithms for joke rating prediction using the joke data-set from Kaggle.
algorithm clustering collaborative-filtering machine-learning numpy pandas recommender-system scikit-learn scypi
Last synced: 06 May 2026
https://github.com/kaoutarmi/predition_price-old-cars
Ce projet de prédiction du prix des voitures utilise l’apprentissage automatique pour estimer la valeur des véhicules en fonction de leurs caractéristiques.
car-price-prediction data-preprocessing data-science decision-tree feature-engineering machine-learning regression scikit-learn
Last synced: 06 May 2026
https://github.com/andrewsy1004/logistic-regression-spam-classifier
This project implements a spam email classifier using Logistic Regression.
Last synced: 06 May 2026
https://github.com/anu-gtb/campusplacement
Link to Streamlit app :
binary-classification campus-placements data-science data-visualization machine-learning matplotlib ml-algorithms predictive-modeling python3 scikit-learn seaborn statistics streamlit
Last synced: 06 May 2026
https://github.com/5hraddha/optimize-oil-well-locations
In the quest for harnessing valuable energy resources, the OilyGiant mining company wants to expand its operations by discovering new oil well locations. To achieve this, a data-driven approach is adopted, leveraging geological exploration data from three distinct regions and employing techniques in data analysis and modeling.
linear-regression numpy pandas scikit-learn supervised-learning
Last synced: 06 May 2026
https://github.com/felipesbonatti/case-credit-risk-prediction
Projeto de classificação de risco de crédito construído com Python, Scikit-learn e Pandas. Demonstra um fluxo de trabalho de Machine Learning de ponta a ponta: pré-processamento de dados, feature engineering, treinamento de múltiplos algoritmos e avaliação de performance com métricas como AUC-ROC.
credit-risk machine-learning predictive-modeling python scikit-learn
Last synced: 06 May 2026
https://github.com/msikorski93/predicting-prices-on-king-county-housing-dataset
Predicting house prices using different regression analysis models.
catboost eda gradient-boosting king-county lightgbm linear-regression machine-learning neural-network polynomial-regression real-estate regression-models scikit-learn tensorflow xgboost
Last synced: 06 May 2026
https://github.com/pimakarov/textkd-p4-fewshot-distilbert
📊 Compare few-shot text classification with DistilBERT and TF-IDF + SVM using IMDB data, analyzing performance across various sample sizes.
bert distilbert few-shot-learning nlp python pytorch scikit-learn text-classification transfer-learning trasformer
Last synced: 06 May 2026
https://github.com/dwade-eng/customer-lead-conversion-analysis
This project explores a real-world lead conversion dataset, using a structured machine learning pipeline to classify leads into likely or unlikely converters. It includes complete steps from data wrangling and visualization to feature engineering and model evaluation.
html matplotlib pandas python3 scikit-learn seaborn
Last synced: 06 May 2026
https://github.com/ejw-data/ml-playground
Testing the limitations, inabilities, and strengths of models with synthetic data
machine-learning python scikit-learn
Last synced: 06 May 2026
https://github.com/douglas-data-analyst/predictive-analysis
Modelo preditivo para previsão de vendas usando scikit-learn e machine learning
data-science machine-learning predictive-analytics python sales-forecasting scikit-learn time-series
Last synced: 06 May 2026
https://github.com/lintangwisesa/pdb_mti_ui_lab1_k6
Tugas Lab 1 Pengelolaan Data Besar MTI UI 2023
machine-learning python3 scikit-learn
Last synced: 06 May 2026
https://github.com/kartheekdama/salary-prediction
This salary prediction model leverages machine learning techniques, including Random Forest, Decision Tree, and Linear Regression, to estimate salaries based on individual attributes such as age, gender, education level, job title, and years of experience. The Random Forest model outperforms the others, achieving the highest R-squared score.
decision-tree exploratory-data-analysis feature-importance linear-regression machine-learning random-forest scikit-learn
Last synced: 06 May 2026
https://github.com/sahilmate/ebm-breast-cancer-classifier
This repository implements an Explainable Boosting Machine (EBM) model for breast cancer classification using scikit-learn and interpret. The project includes data preprocessing, model training, accuracy evaluation, and feature importance visualization.
breast-cancer-classification data-visualization explainable-boosting-machine feature-importance interpret machine-learning scikit-learn
Last synced: 06 May 2026
https://github.com/josepablodmg/python--linear-regression-advertising
A linear regression analysis to predict sales based on advertising spending across TV, radio, and newspaper channels. The project includes exploratory data analysis, model training, coefficient visualization, and residual analysis.
advertising data-analysis exploratory-data-analysis linear-regression machine-learning python regression scikit-learn visualization
Last synced: 06 May 2026
https://github.com/avtorgenii/ml-playground
A repository for exploring and experimenting with datasets, building machine learning models, and testing various techniques in data preprocessing, feature engineering, and model evaluation.
matplotlib ml pandas scikit-learn
Last synced: 06 May 2026
https://github.com/galaxy092/samsung-innovation-campus-big-data-capstone-project
Samsung Innovation Campus Big Data Capstone Project - Weather Prediction
hadoop jupyter-notebook pandas pyspark scikit-learn sparksql
Last synced: 06 May 2026
https://github.com/samudraneel05/stanford-open-policing
The Stanford Open Policing Project (SOPP) aims to bring transparency to police interactions by collecting and analyzing data on traffic stops across the United States. It accumulates a vast dataset on traffic stops, encompassing details such as demographics, location, and outcomes.
clustering heirarchical-clustering k-means-clustering machine-learning matplotlib pandas python scikit-learn
Last synced: 06 May 2026
https://github.com/jbizzlefoshizzle/ibm_capstone_project
Used K-means clustering and mapping libraries to determine best cities in San Diego to open a Mexican restaurant
beautifulsoup4 folium-maps geopy pandas-python scikit-learn
Last synced: 06 May 2026
https://github.com/taquynhnga2001/regression-calories-burnt-prediction
Develop regression models which can predict the total calories a person has burnt during a workout based on some biological measures.
machine-learning python regression-analysis scikit-learn
Last synced: 07 May 2026
https://github.com/kirillshiryaev61/customer_activity_prediction
Прогнозирование снижения покупательской активности в интернет-магазине. Модель на основе ML выявляет клиентов с риском оттока для повышения удержания. Учебный проект.
jupyter pandas python scikit-learn
Last synced: 07 May 2026
https://github.com/ayaarbi/prediction_des_maladies_cardiovasculaires_avec_ml
Ce projet, développé au sein de cours de Machine Learning, utilise des algorithmes de classification supervisée pour prédire la présence de maladies cardio-vasculaires à partir de données médicales publiées sur Kaggle.
cardiovascular-diseases jupyter-notebook machine-learning matplotlib pandas python scikit-learn
Last synced: 07 May 2026
https://github.com/z-fran/walmart-store-sales-forecasting
Data analysis and machine learning solution in Python for the Kaggle competition Walmart Recruiting - Store Sales Forecasting.
machine-learning sales-analysis sales-forecasting sales-prediction scikit-learn walmart-sales-forecasting
Last synced: 07 May 2026
https://github.com/garimarao24/customer-churn-project
This repository contains a Customer Churn Prediction project that leverages Machine Learning techniques to predict customer churn and segment customers using clustering.
customer-churn kmeans-clustering logistic-regression machine-learning pca scikit-learn
Last synced: 07 May 2026
https://github.com/tony123105/comp4423_garbage_classification
Garbage classification using traditional machine learning approaches (HOG, LBP, SIFT features with SVM, KNN, Random Forest classifiers) and an ensemble method to categorize waste into 10 types.
computer-vision feature-extraction garbage-classification hog image-classification knn lbp machine-learning opencv python random-forest scikit-learn sift svm
Last synced: 07 May 2026
https://github.com/saswatamcode/datascienceapi
This is a RESTful API built using Flask and Scikit-Learn. It provides a host of Classification and Regression algorithms that can be used readily and returns results in the form of predictions, confusion matrices, accuracy scores and more.
api flask ml python3 scikit-learn
Last synced: 07 May 2026
https://github.com/sumdiboii/loan-prediction-decision-trees
A Decision Tree Classifier was implemented to predict personal loan acceptance using a dataset of 5,000 customers. Key features included income, education, mortgage, and credit card usage. The model achieved 97% accuracy, with 92% precision and 76% recall for positive loan predictions, validated using a classification report and confusion matrix.
classification data-visualisation decision-trees loan-prediction machine-learning python scikit-learn supervised-learning
Last synced: 07 May 2026
https://github.com/jonperk318/machine-learning-in-python
ML models built from scratch in Python 3.9.13
classification clustering feature-extraction jupyter-notebook linear-classification linear-regression machine-learning mnist pca principal-component-analysis python scikit-learn
Last synced: 07 May 2026
https://github.com/alphacrypto246/titanic-survival
This project leverages machine learning techniques to predict passenger survival in the Titanic disaster using the Kaggle Titanic dataset. It includes data preprocessing, exploratory data analysis (EDA), and model building with algorithms like Logistic Regression and Random Forests to achieve reliable predictions.
logistic-regression machine-learning machine-learning-algorithms python scikit-learn scikitlearn-machine-learning
Last synced: 07 May 2026
https://github.com/dynle/2020f-ml
2020F Keio University - Machine Learning Laboratory
machine-learning python scikit-learn
Last synced: 07 May 2026
https://github.com/mwasifanwar/automl_framework
Comprehensive AutoML framework that automates data preprocessing, feature engineering, model selection, hyperparameter tuning, and deployment. Features neural architecture search and automated data cleaning pipelines.
automl automl-algorithms data-science data-science-projects feature-engineering feature-engineering-algorithm feature-engineering-ml hyperparameter-optimization machine-learning machine-learning-algorithms machine-learning-models mlops mlops-workflow python scikit-learn scikit-learn-python
Last synced: 07 May 2026
https://github.com/jimmymugendi/bulding-a-decision-tree-to-predict-customer-churn
This repo desribes bulding a decision tree to predict customer churn in a given organisation
accuracy-score decision-tree-classifier matplotlib-pyplot numpy pandas-dataframe scikit-learn
Last synced: 07 May 2026
https://github.com/ynsrc/machine-learning
Machine Learning Examples
classification data-science machine-learning machine-learning-algorithms matplotlib nlp nlp-machine-learning regression scikit-learn sklearn
Last synced: 07 May 2026
https://github.com/mark-mdo47/family-machine-learning-project-2017
We are doing a two-part Machine Learning project this summer with SciKit-Learn and Keras/TensorFlow
machine-learning python scikit-learn tensorflow
Last synced: 07 May 2026
https://github.com/henrytseng/example_docker_scikit-learn
A quick example of using Scikit-Learn from a Docker container
Last synced: 08 May 2026
https://github.com/jahanostg/linear-regression_ml-algorithm
Linear Regression Algorithm
colab-notebook matplotlib numpy pandas scikit-learn seaborn
Last synced: 08 May 2026
https://github.com/amiegirl/fellowship_ai
Sentiment Analysis of Movies Dataset
decision-tree-classification logisticregression matplotlib pandas random-forest-classification scikit-learn wordcloud
Last synced: 08 May 2026
https://github.com/thekartikeyamishra/data-preprocessor
A Google Colab module for interactive data preprocessing. Handles missing values, categorical encoding (One-Hot, Label), and numerical scaling (Standard, MinMax). Outputs a cleaned dataset
ipywidgets numpy pandas python scikit-learn
Last synced: 08 May 2026
https://github.com/jatin-mehra119/churn_modeling
This repository is dedicated to predicting customer churn using machine learning techniques. It includes comprehensive scripts for data preprocessing, model training, and evaluation, along with detailed visualizations and insights.
classification-model datavisualization pandas scikit-learn
Last synced: 08 May 2026
https://github.com/msikorski93/detecting-panic-disorder
Panic disorder detecting using machine learning techniques.
artificial-neural-networks classification knn logistic-regression machine-learning panic-disorder random-forest scikit-learn sgd svm tensorflow xgboost
Last synced: 08 May 2026
https://github.com/samkazan/fraud-detection-ml
Machine learning models for enhanced fraud detection in e-commerce transactions, exploring feature engineering, distance prediction, and clustering analysis.
clustering data-science data-visualization dataanalytics dbscan eda hierarchical-clustering kmeans-clustering knn-imputer matplotlib mlxtend python scikit-learn seaborn xgboost
Last synced: 08 May 2026
https://github.com/oriolventur/assignment-2-model-creation
Assignment 2 from Artificial Intelligence 1 course: Model creation using synthetic data and scikit-learn.
jupyter-notebook model-creation python scikit-learn
Last synced: 08 May 2026
https://github.com/shahzadmustafa15/dbscan-clustering
DBSCAN clustering algorithm applied on synthetic non-linear data (make_moons dataset).
data-science data-visualization dbscan-clustering density-based-clustering machine-learning ml-projects python scikit-learn unsupervised-learning
Last synced: 08 May 2026
https://github.com/labex-labs/supervised-learning-regression
Supervised Learning: Regression | This repo collects 7 of programming labs exercises for Supervised Learning: Regression. Supervised learning. If you are hearing or reading this term for the first time, then it may be completely unclear what it means. Don't worry. In this lab, you will get a comp...
challenges course exercises hands-on labex labs machine-learning playgroud programming scikit-learn
Last synced: 08 May 2026
https://github.com/aasjunior/mlapp-api
Esta API fornece endpoints para aplicar algoritmos de aprendizado de máquina, como K-Nearest Neighbors (KNN), Árvore de Decisão e Algoritmo Genético. Realizado como tarefa da disciplina de Laboratório Mobile/Computação Natural no 5º Semestre de Desenvolvimento de Software Multiplataforma.
fastapi machine-learning python scikit-learn
Last synced: 09 May 2026
https://github.com/vijaykumarr1452/customer-churn-prediction
Analysis the data of telecom company and insights gained to reduce customer churn.
anaconda jupyter-notebook machine-learning pandas prediction scikit-learn
Last synced: 09 May 2026
https://github.com/radoslawregula/iris-classification
Jupyter notebook implementing an efficient machine learning method to classify flowers from the Iris data set.
classification iris-dataset jupyter-notebook machine-learning python scikit-learn softmax-classifier
Last synced: 09 May 2026
https://github.com/santiagoasp98/spam-detection
SMS spam detection using Logistic Regression and Multinomial Naive Bayes.
classification logistic-regression machine-learning multinomial-naive-bayes python scikit-learn spam-detection
Last synced: 09 May 2026
https://github.com/alphacrypto246/employee-attrition
This project analyzes employee attrition data to uncover key factors driving employee turnover. Using Python, it employs data preprocessing, exploratory data analysis, and machine learning models to predict attrition and provide actionable insights for improving employee retention strategies.
decision-tree-classifier machine-learning machine-learning-algorithms python scikit-learn scikitlearn-machine-learning
Last synced: 09 May 2026
https://github.com/akwardhan/loan-default-prediction-xgboost-streamlit
Full-scale loan default prediction system using XGBoost, trained on 1.3M LendingClub loans. Includes feature-rich preprocessing, class imbalance handling, recall-focused ML pipeline, and Streamlit web deployment for real-time borrower risk scoring.
credit-risk data-science google-colab loan-default-prediction machine-learning python real-world-project scikit-learn streamlit xgboost
Last synced: 09 May 2026
https://github.com/callmerajesh/ames-housing-price-prediction
Predicting house prices using Decision Tree Regressor on the Ames dataset
ames-housing data-science decision-tree machine-learning python regression scikit-learn
Last synced: 09 May 2026
https://github.com/payall03/spam-mail-detection
A Web App for Detecting Spam Messages using Machine Learning | Flask · TfidfVectorizer · Naive Bayes
css deploy-to-render flask html machine-learning ml-project mlproject naive-bayes naive-bayes-classifier natural-language-processing nlp python scikit-learn spam-detection spam-filter text-classification tfidf-joblib webapp
Last synced: 09 May 2026