scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-06-25 00:23:58 UTC
- JSON Representation
https://github.com/visu512/spotify-music-recommender
Built Spotify Music recommendation system using Machine learning
csv-files dataset jupyter-notebook kmeans-clustering machine-learning matplotlib numpy panda python scikit-learn spotify-api stream vscode
Last synced: 12 Apr 2026
https://github.com/aleksandarbuk/machine-learning
The Machine Learning Library repository provides a collection of scripts and tools leveraging Scikit-Learn, Pandas, and NumPy for various machine learning tasks and data analysis.
matplotlib numpy python scikit-learn tensorflow
Last synced: 16 Apr 2026
https://github.com/gangula-karthik/bank-transaction-classification
Classifying bank transactions with precision—your first step towards smarter finance management 💳🤖📊
finance machine-learning nlp scikit-learn
Last synced: 09 Apr 2025
https://github.com/shivsharcode/diabetes-prediction-new
DIABETES PREDICTION MODEL
flask jupyter-notebook logistic-regression machine-learning matplotlib numpy pandas python scikit-learn
Last synced: 12 Apr 2026
https://github.com/programmersteve/price_regression_ml
jupyter machine-learning matplotlib numpy pandas python scikit-learn
Last synced: 12 Apr 2026
https://github.com/linggarm/covid-19-tweets-classification
Classification of Covid-19 Tweets using Multinomial Naive Bayes and TF-IDF Vectorizer to categorize tweets about Covid-19 into three main classes
artificial-intelligence classification covid-19 data-science google-colab machine-learning multinomial-naive-bayes naive-bayes natural-language-processing nlp python sastrawi scikit-learn text-classification tf-idf tweepy tweets tweets-classification twitter vaccine
Last synced: 11 May 2026
https://github.com/hrolive/recommendation-systems-ibm
Analyze the interactions that users have with articles on the IBM Watson Studio platform and make recommendations to them about new articles, using various recommendation engines.
machine-learning natural-language-processing pandas python recomendation-system scikit-learn
Last synced: 12 Apr 2026
https://github.com/abdelrahman-amen/housing-price
Predicting housing prices with machine learning regression models. This project implements Linear Regression, Random Forest, and Decision Tree models for accurate predictions.
decision-tree housing-price-prediction linear-regression machine-learning python random-forest regression-analysis scikit-learn
Last synced: 07 May 2026
https://github.com/murugavl/crop-prediction
This Crop Prediction System utilizes machine learning to recommend suitable crops based on environmental data. It helps farmers make informed decisions by analyzing factors like soil type and climate. The system aims to enhance agricultural efficiency and productivity.
flask machine-learning python scikit-learn
Last synced: 12 Jun 2025
https://github.com/mianmharoon/sentimentanalysis_coreml_emotionclassifier
Emotion classification iOS app using CoreML and SwiftUI – demo for sentiment and emotion analysis, with the model converted from Scikit-learn using coremltools.
ai coreml coreml-models emotionclassification ios machinelearning nlp python3 scikit-learn sentimentanalysis swift swiftui
Last synced: 12 Apr 2026
https://github.com/samkazan/structural_discovery_of_macromolecules_data_analysis
This research project uses machine learning techniques and neural network to uncover key factors that contribute to successful protein structure discovery using Python and R
classification clustering ipython-notebook jupyter-notebook keras-neural-networks keras-tensorflow machine-learning neural-network numpy python r rmarkdown scikit-learn scipy tensorflow
Last synced: 02 Feb 2026
https://github.com/hyzhak/scikit-learn-experiments
scikit-learn experiments
data-science scikit-learn sklearn
Last synced: 27 Apr 2026
https://github.com/mpoojithavigneswari/sentiment-analysis
The primary goal of this project is to build a sentiment analysis model that can predict the sentiment of a given review (positive or negative).
deep-learning keras machine-learning nlp python rnn-lstm scikit-learn tensorflow
Last synced: 04 Feb 2026
https://github.com/sravyatogarla/movie-recommendation-system
A complete Movie Recommendation System project implementing Popularity-Based, Content-Based, and Collaborative Filtering models using the MovieLens dataset. Built with Python, Pandas, and Plotly, featuring interactive inputs and visualizations.
capstone-project collaborative-filtering content-based-filtering data-science data-visualization edureka jupyter-notebook machine-learning movie-recomendation-system movielens pandas popularity-based-filtering python recommender-system scikit-learn sql
Last synced: 13 Apr 2026
https://github.com/murugavl/loan_approval_prediction
This project is a Loan Approval Prediction System that uses Machine Learning to determine whether a loan application should be approved or rejected based on various factors. It is deployed using Streamlit for an interactive user experience.
jupyter-notebook machine-learning numpy pandas python random-forest-classifier scikit-learn stremlit
Last synced: 13 Apr 2026
https://github.com/karimosman89/fraud-detection
Identify fraudulent transactions.Develop a machine learning model that can identify fraudulent transactions in credit card data or insurance claims.Show how your model can reduce financial losses by flagging suspicious activities.
anomaly-detection lightgbm pandas python scikit-learn xgboost
Last synced: 13 Apr 2026
https://github.com/mahdibehoftadeh/polynomial-regression-co2-emissions
A simple machine learning polynomial regression using a large dataset to learn and predict CO2 emission of a car by its built features like engine size and cylinders
machine-learning matplotlib numpy nural-network pandas polynomial-regression python scikit-learn
Last synced: 22 Feb 2026
https://github.com/connergroth/timbrality
Hybrid music recommender combining NMF collaborative filtering, two-tower content embeddings, audio feature synthesis, and meta-learning fusion for adaptive personalization.
beautifulsoup cloudscraper docker fastapi lastfm-api machine-learning music postgresql python pytorch redis scikit-learn spotify-api supabase
Last synced: 13 Apr 2026
https://github.com/murugavl/customer_churn_prediction
Customer churn prediction is a crucial task in business analytics, helping companies identify customers who are likely to leave and take preventive actions. This project utilizes Machine Learning (ML) techniques to analyze customer behavior and predict churn.
jupyter-notebook machine-learning matplotlib numpy pandas pickle python scikit-learn streamlit
Last synced: 13 Apr 2026
https://github.com/jpc963/previsao-de-vendas
Previsão de vendas que uma determinada empresa faria com base nas mídias utilizadas
jupyter-notebook machine-learning matplotlib pandas python scikit-learn seaborn
Last synced: 13 Apr 2026
https://github.com/mostafa-ghaith/deadlift-o-meter
Deadlift-o-Meter is a project that utilizes a Scikit-Learn model, Mediapipe, and Tkinter to count correct deadlift reps using a live webcam feed. The application analyzes the user's body movements and provides real-time feedback on their performance.
body-movement-classification computer-vision deadlift landmark-detection live-webcam-analysis mediapipe pose-estimation rep-counter scikit-learn tkinter
Last synced: 07 May 2026
https://github.com/andrewobwocha/titanicsurvival
🚢 End-to-end Python pipeline for Titanic survival classification. Demonstrates EDA, preprocessing, feature engineering, and Logistic Regression evaluation using Scikit-learn.
classification data-preprocessing data-visualization exploratory-data-analysis feature-engineering machine-learning pandas python scikit-learn titanic
Last synced: 13 Jun 2025
https://github.com/kwerulik/learning-ml-hands-on
My personal study repository for the "Hands-On Machine Learning with Scikit-Learn, Keras & TensorFlow" book. Contains code implementations, notes, and exercise solutions.
hands-on-machine-learning keras machine-learning python scikit-learn study-repository tensorflow
Last synced: 13 Apr 2026
https://github.com/taimoorkhan10/ai-model-collection-zoo
A comprehensive collection of various AI models, accessible via a unified RESTful API and a user-friendly web interface.
artificial-intelligence cicd computer-vision data-science deep-learning docker examples keras machine-learning modeldeployment natural-language-processing neural-networks python pytorch restful-api scikit-learn tensorflow testing webapplication
Last synced: 13 Apr 2026
https://github.com/hilarionengarejr/movie-recommender-app
Sentiment analysis on user reviews for movie recommendations using Content Based Filtering.
docker flask nltk numpy pandas python3 scikit-learn selenium
Last synced: 10 Apr 2026
https://github.com/pratyush905/farecast-nyc-taxifare-predictor
Machine learning models to predict nyc taxi fare based on given dataset
jupiter-notebook kaggle machine-learning matplotlib numpy python regression-models scikit-learn
Last synced: 13 Apr 2026
https://github.com/eesunmoon/aml_project
[Project] Analyzing Fitness Patterns & Building Predictive Models
matplotlib numpy pandas python3 recommender-system scikit-learn seaborn tensorflow
Last synced: 13 Apr 2026
https://github.com/lucianoscarpaci/news-data-classification
Using the Reuters dataset, this example illustrates the process of data preprocessing, model definition and training, and performance evaluation.
keras model-definition model-training performance-evaluation preprocessing-data reuters scikit-learn seaborn tensorflow
Last synced: 13 Apr 2026
https://github.com/kizman-23/supervised_models
Classical prediction of future data using models trained by labeled data
numpy pandas scikit-learn supervised-machine-learning
Last synced: 13 Apr 2026
https://github.com/adarshpheonix2810/resume-screening
This project is designed to classify resumes into different job categories based on their content. Users can upload resumes in PDF, DOCX, or TXT format, and the application will predict the category using a pre-trained machine learning model. It aims to enhance the job application process for candidates and recruiters alike.
data-science machine-learning natural-language-processing python resume-classification scikit-learn streamlit
Last synced: 13 Apr 2026
https://github.com/busradeveci/student-performance-prediction
A machine learning project to predict student exam performance based on academic, social, and personal features. Built with Python and scikit-learn.
data-analysis kaggle linear-regression machine-learning predictive-modeling python scikit-learn student-performance
Last synced: 25 Apr 2025
https://github.com/psyhackological/heartbeat
:heart: Get your heart racing with HeartBeat! AI-ML project predicts heart disease for a cardio workout for your data and health.
heart machine-learning pandas python scikit-learn
Last synced: 13 Jun 2025
https://github.com/sahildev23/predictive-modeling
ML pipeline with automated preprocessing, cross-validation, and performance visualization. Achieved 28% accuracy improvement on 100k+ records
classification cross-validation data-science machine-learning python scikit-learn
Last synced: 13 Apr 2026
https://github.com/somenath203/movie-recommender-system
Click below to checkout the website
content-based-recommendation cosine-similarity huggingface-spaces movie-recommender-system python recommender-system scikit-learn streamlit streamlit-webapp
Last synced: 13 Apr 2026
https://github.com/prakharchoudhary/mlchallenge-2
My submission for machine learning challenge #2, organised by hackerEarth.
adaboost gradient-boosting-classifier jupyter-notebook machine-learning python scikit-learn
Last synced: 13 Apr 2026
https://github.com/tusharpandey003/iris-flower-classification
Iris flower classification using KNN and Random forest algorithm
data-science iris iris-classification iris-data iris-dataset iris-detection iris-flower-classification iris-flowers knn-classification machine-learning-algorithms random-forest scikit-learn streamlit
Last synced: 13 Apr 2026
https://github.com/sanchariii/order_amt_prediction
Order Amount Prediction is a machine learning project that predicts customer order amounts based on past behavior. It includes milestones for data cleaning, exploratory data analysis, feature engineering, and model building. The framework can be customized to suit specific needs and provides insights for better decision-making.
jupyter-notebook machine-learning python scikit-learn
Last synced: 13 Apr 2026
https://github.com/danishtalpur/sentiview-website
SentiView is a sentiment analysis tool designed to analyze and interpret the emotions behind tweets on Twitter. The platform processes textual data from user-generated tweets to determine the sentiment behind them—whether they are positive, negative, or neutral.
css flask html java naive-bayes-classifier scikit-learn twitter-sentiment-analysis
Last synced: 16 Apr 2026
https://github.com/joewlos/fantasy_football_monte_carlo_draft_simulator
Monte Carlo Fantasy Football Draft Simulator Featuring FastAPI, NextUI, and ODMantic
fantasy-football monte-carlo nextjs nextui odmantic pydantic python scikit-learn
Last synced: 13 Apr 2026
https://github.com/kianoushamirpour/end_to_end_text_classification
Developing feature engineering pipelines, building packages, automating tests, and creating FastAPI endpoints.
apache-airflow ci docker-compose factory-design-pattern fastapi feast grafana hyperopt mlflow prometheus pytorch scikit-learn tox transformers xgboost-classifier
Last synced: 08 Apr 2026
https://github.com/santoshn86/dlp-ev-system-for-pa-optimization
This system is a game-changer, enabling smarter energy management through predictive insights and personalized optimization strategies.
aiml django flask keras pytorch scikit-learn tensorflow typescript
Last synced: 13 Apr 2026
https://github.com/sorabh-kapoor/face-recognition-attendance-system
The Facial Recognition System is an AI-powered application USING FLASK designed to detect and recognize faces with high accuracy. This system can be integrated into various applications, including security systems, attendance management, and identity verification.
flask flask-application knn ml numpy opencv pandas python scikit-learn
Last synced: 13 Apr 2026
https://github.com/pramodyasahan/spaceship-titanic
This repository features a machine learning model designed to predict whether passengers of a space travel company are likely to be transported. The model employs CatBoostClassifier, a machine learning algorithm known for handling categorical data effectively.
machine-learning numpy pandas python scikit-learn
Last synced: 13 Apr 2026
https://github.com/muscaanmnmnm/breast-cancer-detector
A predictive model for breast cancer detection using K-Nearest Neighbors, demonstrating the impact of feature scaling on model performance and recall.
breast-cancer-wisconsin data-science feature-scaling jupyter-notebook knn-classification machine-learning pandas-dataframe python-3 scikit-learn
Last synced: 06 Sep 2025
https://github.com/1adore1/deadlock-match-tracker-bot
Telegram bot for tracking real-time Deadlock matches for top 250 players of the leaderboard. Fetches match data and predicts winners using a machine learning model.
aiogram api deadlock optuna pandas python scikit-learn
Last synced: 13 Apr 2026
https://github.com/adrianmarino/knn-cf-rec-sys
Similarity CF based RecSys examples
python recommender-system scikit-learn
Last synced: 08 May 2026
https://github.com/nikhilakki/predicting-the-gender-of-the-riders-of-new-york-s-citi-bikes
Predicting the Gender of the riders of New York Citi Bikes (2015-2017)
data-science decision-trees feature-engineering machine-learning pandas python scikit-learn
Last synced: 13 Apr 2026
https://github.com/grandechowhiskey/fcc-data_analysis-projects
A collection of projects completed as part of the FreeCodeCamp "Data Analysis with Python" certification. These projects cover statistical calculations, data visualization, and trend analysis using real-world datasets.
data-analysis data-visualization matplotlib pandas python3 scikit-learn seaborn
Last synced: 01 May 2026
https://github.com/otuemre/emailphishingdetection
A real-time phishing email detection system using Machine Learning (SVM, Logistic Regression, Naive Bayes) with FastAPI backend and custom domain deployment.
cybersecurity fastapi huggingface machine-learning nlp real-time scikit-learn spam-detection svm-classifier tfidf-vectorizer
Last synced: 13 Apr 2026
https://github.com/dineshh912/analysis_stock_price_data
Experiment analysis of stock price data with python3
data-analysis data-visualization financial-data python3 scikit-learn stock-price-prediction
Last synced: 24 Apr 2026
https://github.com/nicolascoiado/nivel-mar
Este projeto realiza uma análise detalhada do nível médio global do mar (GMSL), utilizando uma base de dados pública que abrange medições históricas. O objetivo é explorar tendências, calcular a taxa média de elevação e visualizar os dados por meio de gráficos.
google-colab jupyter-notebook matplotlib numpy pandas python python3 scikit-learn
Last synced: 11 Mar 2025
https://github.com/thinker84/real-time-stock-price-prediction-and-market-analysis-using-machine-learning
Real-time stock price prediction app using LSTM, Streamlit, and historical data (2010–2023). Forecasts next 10 days & visualizes trends.
data-science django lstm machine-learning numpy pandas pandas-datareader scikit-learn stock-market stock-price-prediction stooq streamlit yahoo-finance yahoo-finance-api
Last synced: 13 Jul 2025
https://github.com/hetuvpatel/brain-stroke-prediction
Machine Learning project for predicting stroke risk using healthcare data. Includes EDA, preprocessing, SMOTE, feature selection (RFE), evaluation of Logistic Regression, Decision Tree, Random Forest, KNN, SVM, and Stacked Ensemble models.
data-mining ensemble-learning healthcare machine-learning predictive-modeling python rfe scikit-learn smote
Last synced: 17 May 2026
https://github.com/pksvv/machinelearning_svm
Various implementations of Support Vector Machine Algo
machine-learning python scikit-learn support-vector-machine
Last synced: 04 May 2026
https://github.com/nirmaldeepponnada/codeclauseinternshipproject2
Python, NLTK, Scikit-Learn, Pandas, NumPy, Pickle, SciPy, and JSON are used for text preprocessing, feature engineering, multi-label classification, and model persistence.
nltk numpy pandas pickle python scikit-learn scipy
Last synced: 07 Apr 2026
https://github.com/mastermindromii/car-price-prediction-model
Here is My Regression Project based on Predicting Price of Car using Linear Regression.
linear-regression matplotlib numpy pandas python scikit-learn seaborn
Last synced: 13 Apr 2026
https://github.com/aml-hassan-abd-el-hamid/finding-donors-for-charityml
Predicting salary of the people based on various data about them
machine-learning python scikit-learn supervised-learning udacity-machine-learning-nanodegree
Last synced: 08 May 2026
https://github.com/codecraft-sanju/medvisionai-medical-image-ai-vision.
MedVisionAI is an AI-powered platform that analyzes ultrasound images to detect PCOS and provide actionable recommendations. Using CNN-based deep learning and generative AI, it ensures fast, accurate diagnosis, reduces errors, and supports clinicians with instant insights all while maintaining patient privacy and compliance.
deep-learning fastapi gemini-api genai keras-tensorflow machine-learning matplotlib python react scikit-learn seaborn tailwindcss tensorflow
Last synced: 07 Sep 2025
https://github.com/wilfordaf/ml-sect-introduction-task
Test task for students assosiation
classic-machine-learning keras machine-learning regression-models scikit-learn
Last synced: 28 Feb 2025
https://github.com/mmerlyn/analysis-of-tomato-prices
Forecasting tomato prices in Karnataka using machine learning to help farmers make better crop planning and selling decisions.
css flask html matplotlib numpy pandas python scikit-learn seaborn
Last synced: 06 Jul 2025
https://github.com/yanne0800/lung_cancer_prediction
This project predicts lung cancer risks using machine learning models like Random Forest, Logistic Regression, and SVM. It analyzes patient data with features such as age, smoking habits, and symptoms. Data preprocessing, visualization, and performance evaluation ensure accurate predictions for early diagnosis.
algorithm classification cnn decision-tree-classifier decision-trees deep-learning gradientboosting keras lung-cancer medical-image-processing navies-bayes-classifer neuralnetworks python scikit-learn
Last synced: 05 May 2026
https://github.com/lingumd/cryptocurrencies
Unsupervised machine learning models used to group the cryptocurrencies to help prepare for a new investment.
concatenate elbow-curves get-dummies hvplot jupyterlab kmeans matplotlib-pyplot minmaxscaler pandas path pca-analysis plotly-express scikit-learn unsupervised-machine-learning
Last synced: 13 Apr 2026
https://github.com/abhipatel35/svm-hyperparameter-optimization-for-breast-cancer
Utilizing SVM for breast cancer classification, this project compares model performance before and after hyperparameter tuning using GridSearchCV. Evaluation metrics like classification report showcase the effectiveness of the optimized model.
breast-cancer cancer-diagnosis classification data-analysis data-science gridsearchcv healthcare hyperparameter-tuning jupyter-notebook machine-learning medical-imaging pycharm python scikit-learn support-vector-machine svm
Last synced: 05 Feb 2026
https://github.com/oceanuz/car-price-regression
A comprehensive ML evaluation and improvement notebook for a car price prediction model. It includes topics such as scoring with r2, cross-validation, overfitting/underfitting diagnosis, and polynomial regression. *Ridge regression* is applied to reduce overfitting, and (GridSearchCV) techniques are used to find the best alpha hyperparameter.
cross-validation data-science grid-search hyperparameter-tuning machine-learning machine-learning-models model-evaluation overfitting python regression ridge-regression scikit-learn
Last synced: 11 Dec 2025
https://github.com/18mahi/digital_cave
An intermediate-level deep learning project that compares Convolutional Neural Networks (CNN) and Multi-Layer Perceptrons (MLP) on the MNIST handwritten digits dataset. This project demonstrates data augmentation, learning rate scheduling, and visual comparison of model performance
cnn confusion-matrix data-augmentation data-science deep-learning evaluation-metrics jupyter-notebook keras learning-rate-scheduler machine-learning matplotlib mlp numpy python3 scikit-learn seaborn tensorflow
Last synced: 13 Apr 2026
https://github.com/pranavsp108/time-series-forcasting
A time-series forecasting project to predict hourly energy consumption using Python, Pandas, and an XGBoost regression model.
data-analysis data-science energy-consumption forecasting matplotlib numpy pandas python scikit-learn sustainability time-series xgboost
Last synced: 10 Apr 2026
https://github.com/ahmadbuilds/fake-news-classifier
Classifies news articles as real or fake using an NLP pipeline with TF-IDF + n-grams and machine learning models. Includes text preprocessing, feature engineering, model training, and evaluation.
fastapi logistic-regression matplotlib n-grams nextjs nltk numpy pandas python3 random-forest-classifier react scikit-learn seaborn supervised-learning tf-idf typescript xgboost-classifier
Last synced: 11 Apr 2026
https://github.com/pranavsp108/financial-fraud-detection
A comprehensive machine learning project for detecting financial fraud using XGBoost and LightGBM, with a focus on advanced feature engineering, class imbalance handling, and hyperparameter tuning.
classification-model data-science feature-engineering fraud-detection hyperparameter-tuning lightgbm machine-learning pandas python scikit-learn xgboost
Last synced: 04 May 2026
https://github.com/kostadinlambov/time-series-forecasting
This project evaluates the predictive performance of a CNN-LSTM Hybrid deep learning model for Bitcoin price movement prediction.
keras-tensorflow matplotlib-pyplot mlflow numpy optuna pandas python scikit-learn seaborn statsmodels ta-lib tensorflow
Last synced: 07 Apr 2026
https://github.com/javedfazlulahf/customer-churn-prediction
📊 Predict customer churn in telecom using machine learning to enhance retention strategies and drive better business outcomes.
churn-prediction cross-validation data-science factorization-machines imbalanced-learn libsvm machine-learning model-evaluation pipelines plotly scikit-learn seaborn shap-values spark-ml survival-analysis tensorflow watson-studio xgboost4j
Last synced: 11 May 2026
https://github.com/blue-catblues/tieba-integratedanalysis
Python期末大作业—对百度贴吧进行爬虫采集(scrapy)、统计分析(pandas)、可视化展示(matplotlib),与机器学习分类(scikitLearn)的综合性数据分析
matplotlib nlp-machine-learning pandas python scikit-learn scrapy seaborn
Last synced: 05 Oct 2025
https://github.com/nihanthbhargav/time-series-stock-market
This project combines computer vision and NLP by segmenting pet images with a U-Net model and generating captions using CNN-RNN/LSTM. Using the Oxford-IIIT Pets dataset, it demonstrates a unified pipeline that integrates pixel-level segmentation with automatic caption generation for meaningful image understanding.
matplotlib numpy pandas plotly python scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/inesruizblach/data-science-project
A data science project exploring Portuguese "Vinho Verde" wine quality prediction. Features EDA, feature engineering, ML models, and evaluation using Python, pandas, scikit-learn, and visualization tools.
binary-classification classification data-science exploratory-data-analysis feature-engineering imbalanced-learn jupyter-notebook machine-learning model-evaluation pandas regression scikit-learn seaborn uci-dataset wine-quality
Last synced: 09 May 2026
https://github.com/kianaabrisham/naive-bayes-sentiment
Sentiment classification using Multinomial NB (scratch + sklearn)
bag-of-words naive-bayes nlp scikit-learn sentiment-analysis text-classification
Last synced: 14 May 2026
https://github.com/therayyanshariff/cinereview
A Machine Learning web app for sentiment analysis, using a Scikit-learn NLP model with a custom-styled Streamlit UI.
machine-learning nlp python scikit-learn sentiment-analysis streamlit
Last synced: 04 May 2026
https://github.com/dearabhin/girlfriend-predictor
Using machine learning to solve the ultimate college classification problem. A fun project applying Python and Logistic Regression to predict relationship outcomes based on a (hilariously) synthetic dataset. 📊❤️
classification data-science fun-project google-colab jyputer-notebook jypyternotebook logistic-regression machine-learning pandas python scikit-learn
Last synced: 06 Oct 2025
https://github.com/harris-giki/e-comdataanalysis_ml
E-commerce Customer Analysis with Linear Regression: analyzes customer behavior within an e-commerce setting and predict yearly customer spending based on various features using a linear regression model.
development ecommerce linear-regression machine-learning model prediction-model python scikit-learn
Last synced: 14 Apr 2026
https://github.com/sducournau/ign_lidar_hd_dataset
🏗️ Comprehensive Python library for processing IGN LiDAR HD data into machine learning-ready datasets for Building Level of Detail (LOD) classification. Features GPU/CPU processing, smart data management, and complete ML pipeline integration.
building-classification data-processing dataset france geospatial gis ign lidar lidar-hd numpy point-cloud scikit-learn
Last synced: 20 Jan 2026
https://github.com/alrafiabdullah/titanic-kaggle
gridsearch pandas random-forest scikit-learn titanic-kaggle
Last synced: 21 Jan 2026
https://github.com/madsondeluna/mvp_pucrio_data_analytics_and_machine_learning
MVP referente a sprint de Machine Learning & Analytics (40530010056_20250_01) da Pós-Graduação em Data Science and Analytics da PUC-Rio.
comparative-analysis data-analytics data-science machine-learning-algorithms postgraduate-course python pytorch scikit-learn
Last synced: 03 May 2026
https://github.com/hiftd22/wpattern
📈 Analyze and visualize stock patterns with wPattern for better trading insights. Simplify your investment decisions through data-driven analysis.
cli finance financial-analysis financial-data matplotlib numpy pandas pattern-recognition python scikit-learn stock-scanner technical-analysis yfinance
Last synced: 29 Apr 2026
https://github.com/pranavsp108/market_basket_analysis-instacart
Customer segmentation and market basket analysis using the Instacart dataset with Python, Pandas, and K-Means clustering.
customer-segmentation-and-buying-behavior data-analysis data-visualization instacart jupyter-notebook kmeans-clustering market-basket-analysis pandas python scikit-learn
Last synced: 05 May 2026
https://github.com/fersihela/ml_project
Machine learning project developed for the Data Science course at Esprit School of Engineering. Focuses on predictive modeling using Python and scikit-learn.
deep-learning esprit-school-of-engineering machine-learning predective-modeling python scikit-learn
Last synced: 25 Feb 2026
https://github.com/jlee9503/telecommunication-churn
Analyze key factors influencing customer churn using Python data analytics technique. Explore key factors through data preprocessing, exploratory data analysis (EDA), and predictive modeling.
data-analysis data-visualization matplotlib pandas python scikit-learn
Last synced: 18 Jan 2026
https://github.com/lorenzorottigni/dl-houses
Deep Learning python bootcamp: exploratory data analysis & deep learning on houses price dataset
deep-learning ipynb machine-learning numpy pandas python scikit-learn seaborn tensorflow
Last synced: 11 Apr 2026
https://github.com/animesh-sh/selectanime
django django-rest-framework react redux scikit-learn
Last synced: 14 Apr 2026
https://github.com/shadmanshaikh/ml_algo_from_scratch
All standard machine learning algorithms from scratch in python 🐍
classification deep-learning machine-learning neural-nets python regression scikit-learn
Last synced: 09 May 2026
https://github.com/cinnaavox/loan-prediction
Machine Learning project predicting loan approvals using Decision Tree Classification. Includes data cleaning, feature engineering, model evaluation and key business insights.
decision-trees matplotlib numpy pandas python scikit-learn seaborn
Last synced: 14 Apr 2026
https://github.com/sharvesh1401/battsense
BattSense is a machine learning project focused on predicting the State of Health (SOH) of lithium-ion batteries using operational parameters such as voltage, current, temperature, and capacity. The model enables accurate, data-driven diagnostics for battery performance monitoring in electric vehicles and portable devices.
battery-diagnostics battery-health battery-health-prediction battery-soh data-analysis electric-vehicles energy-storage machine-learning predictive-maintenance python regression scikit-learn
Last synced: 07 May 2026
https://github.com/nicknobad/binaryimageclassification
Benchmark of sci-kit learn models and a tensorflow model for Binary Image Classification
binaryclassification image-classification scikit-learn scikit-model scikitlearn-machine-learning tensorflow tensorflow-examples tensorflow-experiments tensorflow-models
Last synced: 21 Jan 2026
https://github.com/allanreda/automated-k-means-clustering-engine
An interactive K-Means clustering tool built with Flask and Scikit-Learn, supporting Excel file uploads, cluster analysis, and data export, deployed on Google Cloud Run via Docker with CI/CD integration.
cicd css data-visualization deployment docker flask google-cloud-run html javascript k-means-clustering machine-learning matplotlib numpy pandas python scikit-learn
Last synced: 19 Jan 2026
https://github.com/khushirajurkar/exoplanet-habitability-prediction-model
Predicts whether an exoplanet is habitable using ML. Handles class imbalance with ADASYN, tests multiple models, and saves the best one. Includes confusion matrices, ROC curves, and a clean Jupyter notebook
adasyn astroinformatics confusion-matrix exoplanets logistic-regression machine-learning multiclass-classification python roc-curve scikit-learn smote
Last synced: 06 May 2026
https://github.com/lemma-osu/sknnr
scikit-learn compatible estimators for various kNN imputation methods
classification gnn gradient-nearest-neighbor imputation k-nearest-neighbor knn most-similar-neighbor msn random-forest-nearest-neighbor regression rfnn scikit-learn sklearn-estimator
Last synced: 23 Feb 2026
https://github.com/fanyicharllson/mobile-money-transaction-analysis
Machine learning pipeline for classifying mobile money users (MTN MoMo & Orange Money) into activity segments — CSC 3221 Final Project, ICT University Cameroon.
cameroon data-science ict-university jupyter jupyter-notebook machine-learning mtn-momo orange-money python scikit-learn
Last synced: 31 May 2026
https://github.com/probokrishnacahya/machine_learning
IMT01306601-A Machine Learning
ipynb machine-learning matplotlib numpy pandas python scikit-learn seaborn
Last synced: 14 Apr 2026
https://github.com/manome/python-supervised-learning
This project provides sample code for performing supervised learning.
conformal-prediction scikit-learn supervised-learning
Last synced: 19 Jan 2026
https://github.com/soumyapro/parkinson-disease-prediction
This project predicts Parkinson's disease using machine learning models.
logistic-regression numpy pandas scikit-learn svc xgboost
Last synced: 19 Jan 2026