scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-06-25 00:23:58 UTC
- JSON Representation
https://github.com/eugen-goebel/predictive-analytics-agent
Automated ML pipeline — data profiling, preprocessing, model training, and evaluation report generation
automation data-science docker machine-learning predictive-analytics python scikit-learn streamlit
Last synced: 05 Apr 2026
https://github.com/sundanc/weatherprediction
This project implements a weather prediction system that predicts the temperature based on real-time weather data, including features like humidity, wind speed, and day-related features (day of the week, month
machine-learning machinelearning numpy pandas programming python scikit-learn scikitlearn-machine-learning weather-prediction
Last synced: 18 Apr 2026
https://github.com/akhundmuzzammil/energyconsumptionprediction
This repository contains code and resources for training a linear regression model to predict energy consumption based on various building parameters.
data-analysis energy-consumption linear-regression machine-learning python scikit-learn streamlit visualization
Last synced: 18 Apr 2026
https://github.com/hariprasath-v/machinehack-analytics-olympiad-2022
Create a machine learning model to help an insurance company understand which claims are worth rejecting and the claims which should be accepted for reimbursement.
catboost-classifier exploratory-data-analysis logloss machinehack numpy optuna pandas python scikit-learn shap
Last synced: 18 Apr 2026
https://github.com/alezoon/movie-revenue-prediction
Sk-learn practice using Linear Regression, ML workflow practice.
jupyter machine-learning matplotlib-pyplot numpy pandas python scikit-learn
Last synced: 05 Apr 2026
https://github.com/ksasi/dog-breed-classifier
Dog Breed Classifier
cnn cnn-classification computer-vision deep-learning deep-neural-networks keras keras-neural-networks machine-learning numpy pandas python scikit-learn
Last synced: 05 Apr 2026
https://github.com/ricardorobledo/next_level_data_science
matplotlib numpy pandas python3 scikit-learn
Last synced: 05 Apr 2026
https://github.com/simrandalal/semantic-book-recommender
A semantic content-based book recommender using sentence-transformer embeddings, cosine similarity, and a Streamlit interface.
dotenv huggingface-transformers nlp-machine-learning pandas python scikit-learn similarity-search streamlit
Last synced: 05 Apr 2026
https://github.com/murugavl/flower-prediction
Flower Prediction is a machine learning project that uses the Iris dataset to classify iris flowers into three species: Setosa, Versicolor, and Virginica. The project includes data analysis, model training with various algorithms, and deployment via a Flask web application for user-friendly predictions.
flask machine-learning matplotlib numpy pandas python scikit-learn seaborn
Last synced: 05 Apr 2026
https://github.com/elprofesoriqo/kagglecompetitions
Kaggle competitions projects
artificial-intelligence machine-learning python pytorch scikit-learn
Last synced: 05 Apr 2026
https://github.com/taqsblaze/hush
Hush: A lightweight, context-aware text toxicity classifier. Leveraging NLP and Random Forest ensemble learning to detect and mitigate harmful language in real-time. Built for efficiency, safety, and cleaner digital communication.
content-moderation machine-learning nlp random-forest safety-tools scikit-learn text-classification toxicity-detection
Last synced: 05 Apr 2026
https://github.com/deliprofesor/game-search-volume-prediction-machine-learning-models-and-forecasting
This repository uses machine learning models like Random Forest, XGBoost, LightGBM, and time-series forecasting with Prophet to predict game search volumes. Additionally, Grid Search is applied for hyperparameter tuning of the LightGBM model.
data-cleaning data-science data-visualization feature-selection forecasting-models game-search grid-search hyperparameter-tuning lightgbm machine-learning pandas prophet python random-forest scikit-learn time-series-analysis time-series-forecasting xgboost
Last synced: 18 Apr 2026
https://github.com/malick08012/heart-disease-prediction
A machine learning project that predicts the risk of heart disease based on patient health data. Includes data cleaning, EDA, visualization, model training, evaluation and feature importance analysis
artificial-intelligence heartdisease-prediction logistic-regression machine-learning python scikit-learn
Last synced: 18 Apr 2026
https://github.com/manalisbhavsar/mall-customers-clustering
K-Means clustering to mall customer data, segmenting customers based on their annual income and spending score. To identify patterns and group customers for targeted marketing.
data-analysis data-visualization matplotlib numpy pandas python scikit-learn
Last synced: 18 Apr 2026
https://github.com/jeffandyalltogether/mlrecommendationsystem
project code for a recommendation system for Amazon using collaborative filtering, ranking, and matrix factorization to enhance customer satisfaction and product discovery.
eda matplotlib pandas python scikit-learn seaborn tensorflow
Last synced: 05 Apr 2026
https://github.com/naren1704/ml-approach-for-employee-performance-prediction
A Flask UI that predicts the performance of employee based on the XGBoost trained model.
css flask html python scikit-learn xgboost
Last synced: 05 Apr 2026
https://github.com/yashrajgithub/crop-recommendation
KrishiGyaan is a web app designed to help farmers make informed decisions on crop selection. By analyzing soil and environmental factors, the app provides personalized crop recommendations, enhancing agricultural productivity and promoting sustainable farming practices.
api artificial-intelligence crop-recommendation-system data-preprocessing data-visualization json machine-learning-algorithms pickle python random-forest-classifier scikit-learn streamlit supervised-learning train-test-split user-interface
Last synced: 05 Apr 2026
https://github.com/barek2k2/ml_ruby
Ruby gem uses Machine Learning(ML) techniques to make predictions and classifications, and it's powered by Python3 under the hood.
artificial-intelligence data-science machine-learning pandas prediction python3 ruby ruby-on-rails scikit-learn
Last synced: 05 Apr 2026
https://github.com/merekat/ml-shortcut-library
A Visual Studio Code shortcut library designed to simplify and accelerate machine learning development.
cnn coding data-science deep-learning efficiency extension extensions fnn machine-learning machinelearning nlp numpy pandas python scikit-learn shortcut shortcuts tensorflow visual-studio visual-studio-code
Last synced: 05 Apr 2026
https://github.com/emilyfelker/ieee_cis_fraud_detection
Which online transactions are fraudulent? Program that uses various machine learning algorithms to detect fraud.
decision-trees kaggle logistic-regression machine-learning neural-network pandas poetry pytest python scikit-learn sklearn tensorflow xgboost
Last synced: 05 Apr 2026
https://github.com/oadultradeepfield/galaxy10-anomaly-detection
A public API and experimental PyTorch pipeline for anomaly detection in the Galaxy10 DECals dataset using ResNet50, autoencoders, and clustering techniques
flask google-cloud-run kaggle pytorch scikit-learn
Last synced: 05 Apr 2026
https://github.com/lexxai/goit_python_ds_hw_04
Модуль 4. Класифікація та оцінка роботи моделі. Лінійна регресія: перенавчання та регуляризація
lasso-regression linear-regression numpy pandas python red regression ridge-regression scikit-learn
Last synced: 05 Apr 2026
https://github.com/nowon1/insurance-claim-prediction_version
This project aims to predict the insurance claim amounts based on various customer attributes using machine learning techniques. The project involves data preprocessing, exploratory data analysis, feature engineering, and model training and evaluation.
data-preprocessing data-science data-visualization exploratory-data-analysis feature-engineering insurance jupyter-notebook machine-learning numpy pandas predictive-modeling python random-forest regression-analysis scikit-learn
Last synced: 05 Apr 2026
https://github.com/perpendicooler/elementary-research-for-steamboat-willie-s-store-in-poland
An elementary research for a company to opening store in a city using gurobi and pulp optimization.
christofides-algorithm gurobipy numpy pandas pulp python3 scikit-learn travelling-salesman-problem
Last synced: 05 Apr 2026
https://github.com/billy0402/python-machine-learning
A learning project from NTUB machine learning course.
ai course jupyter-notebook python scikit-learn tensorflow
Last synced: 05 Apr 2026
https://github.com/lorenzorottigni/ml-movies
Machine Learning python bootcamp: Recommender Systems on movies dataset
ipynb machine-learning numpy pandas python recommender-system scikit-learn seaborn
Last synced: 05 Apr 2026
https://github.com/amiegirl/sentiment_analyzer_app
Sentiment Analysis and Deployment of Real-time Flipkart Product Reviews
aws aws-ec2 flask logistic-regression machine-learning naive-bayes-classifier pandas python random-forest-classifier scikit-learn wordcloud
Last synced: 05 Apr 2026
https://github.com/kenatsf/basic_data_analysis
Basic data science project: ETL, forecast and data visualization.
analysis data data-analysis data-science logistic-regression matplotlib matplotlib-pyplot numpy pandas powerbi python scikit-learn time-series time-series-analysis time-series-forecasting
Last synced: 05 Apr 2026
https://github.com/manojpatra1991/machine-learning-engineer-nanodegree
Machine Learning Engineer Nanodegree Projects - My Submissions
adaboost csv-files decision-tree html jupyter-notebook linear-regression machine-learning machine-learning-algorithms machine-learning-nanodegree naive-bayes-classifier python3 scikit-learn support-vector-machine
Last synced: 18 Apr 2026
https://github.com/mbarbetti/mediastinal-lymphoma-classification
Machine-learning-based classification of bulky mediastinal lymphomas using radiomic features
diagnosis-prediction lymphoma-classification machine-learning personalized-treatment precision-medicine radiomics-analysis scikit-learn texture-analysis
Last synced: 18 Apr 2026
https://github.com/thekartikeyamishra/ai-customer-feedback-summarizer
The AI Customer Feedback Summarizer is a Python-based application that processes customer feedback, extracts insights, and summarizes reviews. This basic version uses extractive summarization techniques, and the advanced version integrates advanced sentiment analysis, visualization, and industry-specific fine-tuning.
ai chatbot gpt machine-learning matplotlib nltk pandas python scikit-learn streamlit
Last synced: 18 Apr 2026
https://github.com/nabilshadman/python-classification-and-generative-models
Applications of classification and generative models with Python
classification data-science data-visualization generative-model machine-learning matplotlib numpy pandas scikit-learn
Last synced: 19 Apr 2026
https://github.com/vijaykumarr1452/black_friday_sales_analysis
Black Friday Sales Analysis python machine learning project using pandas and scikit-learn for data preprocessing, model training, and performance evaluation.
confusion-matrix jupyter-notebook machine-learning pandas python random-forest-classifier sales-analysis scikit-learn
Last synced: 19 Apr 2026
https://github.com/kheriberto/linear_regression_ecommerce
Simple project showcasing crafting a linear regression model with SciKit Learn
data-analysis jupyter-notebook linear-regression pandas python scikit-learn seaborn
Last synced: 19 Apr 2026
https://github.com/syzygianinfern0/ml-tutorial
Basics of ML to Help Beginners made in Jupyter Notebook
jupyter-notebook machine-learning machine-learning-algorithms machine-learning-coursera scikit-learn
Last synced: 19 Apr 2026
https://github.com/yassin522/heartbeat-categorization
This project is aimed at developing a machine learning model that can accurately classify heartbeats as either normal or abnormal. The model is trained on a dataset of ECG (electrocardiogram) signals, which were collected from patients and labeled by medical professionals.
cnn deep-learning keras machine-learning scikit-learn tensorflow
Last synced: 20 Apr 2026
https://github.com/kaladabrio2020/machine-learning-with-pytorch-and-scikit-learn
Progress on the book machine learning with pytorch and scikit-learn
deep-learning implementation machine-learning python3 pytorch scikit-learn
Last synced: 20 Apr 2026
https://github.com/vyjayanthipolapragada/car_mileage_prediction
Predicting the mileage of car using the linear regression model with Scikit-learn
kaggle-titanic linear-regression machine-learning numpy pandas predictive-modeling python scikit-learn
Last synced: 20 Apr 2026
https://github.com/prahaladhchandrahasan/housingprices_adavanced_regression
A machine learning model for "House Prices: Advanced Regression Techniques" kaggle competition.
machine-learning-algorithms matplotlib-pyplot numpy pandas python3 scikit-learn
Last synced: 20 Apr 2026
https://github.com/namratha2301/carprice_analysisandprediction
This project analyzes factors influencing vehicle prices using a dataset of various attributes, including Engine capacity, Power, Mileage, and Seating capacity.
data-analysis data-visualization exploratory-data-analysis machine-learning pandas predictive-modeling random-forest-classifier regression scikit-learn seaborn
Last synced: 20 Apr 2026
https://github.com/zawadi-wanjiru/house-prices-prediction-group-project
Predicting House Prices Using Regression Analysis
datacleaning datavisualization descriptive-statistics exploratory-data-analysis jupyter-notebook matplotlib modelling pandas-library predictive-analysis python regression-analysis scikit-learn seaborn-python
Last synced: 20 Apr 2026
https://github.com/bruceunx/ai-simulator
aiplayground 人工智能学习乐园
ai maching-learning scikit-learn
Last synced: 20 Apr 2026
https://github.com/dahsie/spam_classification
Ce fut mon prémier projet NLP où j'ai réalisé la détection de spam en utilisant les algorithmes d'embedding pour encorder mes textes. J'ai utilisé Random Forest et Milti-Layres Perceptrons pour la phase de classification. Ce qui a pemit l'obtension des précisions respective de 97% et 98%. J'ai aussi appris à documenter mes codes via sphinx
doc2vec fasttext-embeddings gensim glove-embeddings python scikit-learn sphinx-doc word2vec-algorithm
Last synced: 20 Apr 2026
https://github.com/grandechowhiskey/harvard-cs50-ai-projects
This project contains a collection of programming assignments from CS50’s Introduction to Artificial Intelligence with Python course.
html python scikit-learn tensorflow
Last synced: 20 Apr 2026
https://github.com/himasnhu-at/freecodecamp--ml
ML Models I built for my freeCodeCamp's Machine Learning with Python certification
freecodecamp freecodecamp-project machine-learning machine-learning-algorithms matplotlib pandas python scikit-learn
Last synced: 20 Apr 2026
https://github.com/tr-3n/-ai-powered-resume-analyzer-multi-source-job-matcher
AI-Powered Resume Analyzer & Multi-Source Job Matcher, is a web application built using Python and Streamlit that helps job seekers find the best job opportunities based on their resume. The app extracts text from uploaded resumes, matches it with job listings from multiple sources, and displays the most relevant jobs.
ai api html-css job job-recommendation job-search jobmatching natural-language-processing pandas pypdf2 python resume-analyzer scikit-learn streamlit web-development
Last synced: 20 Apr 2026
https://github.com/tryomar/data-miner
DataMiner is an interactive web application for data mining and machine learning. It helps users upload, clean, transform, and analyze datasets while building predictive models — all through a simple and powerful Streamlit interface.
data-cleaning data-mining data-preprocessing data-science data-visualization interactive-dashboards pandas python scikit-learn streamlit
Last synced: 20 Apr 2026
https://github.com/alphacrypto246/customer-churn
This project predicts customer churn using machine learning. It includes data preprocessing, exploratory analysis, model training, and evaluation to identify key factors driving churn and provide actionable insights for retention.
knn-classification machine-learning machine-learning-algorithms python scikit-learn scikitlearn-machine-learning
Last synced: 20 Apr 2026
https://github.com/abdel-17/facial-recognition
Facial recognition using Machine Learning in Python
machine-learning pca python scikit-learn
Last synced: 20 Apr 2026
https://github.com/simhayn/binary-classification
Alzheimer's disease detection using XGBoost and other prediction models.
alzheimer-disease-prediction binary-classification exploratory-data-analysis mental-health prediction-model python scikit-learn xgboost
Last synced: 20 Apr 2026
https://github.com/chdl17/lead-score-case-study
Lead scoring is the process of assigning a numerical value or score to each lead, based on factors such as demographics and behavior, to determine their potential value as customers.
machine-learning-algorithms matplotlib-pyplot python scikit-learn
Last synced: 20 Apr 2026
https://github.com/kimaya012/fake-news-detection
This project detects whether a news is fake or not using machine learning.
decision-tree-classifier fake-news-detection gradient-descent logistic-regression machine-learning python random-forest-classifier scikit-learn sklearn
Last synced: 20 Apr 2026
https://github.com/ghufranbarcha/linear-regression-training-app
This project is a Streamlit application that allows users to upload a CSV file, select variables, and train a linear regression model. The app provides an easy-to-use interface for selecting dependent and independent variables, scaling data, applying polynomial regression, and evaluating model performance.
data-science machine-learning python scikit-learn streamlit
Last synced: 20 Apr 2026
https://github.com/kerushani/sign-language-detection
A sign language detector.
mediapipe opencv python scikit-learn
Last synced: 20 Apr 2026
https://github.com/adityapradhan202/binge-trend
Media and entertainment recommendation website with AI powered recommendation system.
datascience-machinelearning natural-language-processing python scikit-learn spacy-nlp
Last synced: 21 Apr 2026
https://github.com/yogeshsinghkatoch9/advanced_nyc_housing_price_prediction
A robust ensemble learning framework for advanced NYC housing price prediction, leveraging global, clustered, and local ensembles with hyperparameter tuning.
data-science ensemble-learning housing-prices machine-learning new-york python scikit-learn
Last synced: 21 Apr 2026
https://github.com/sayan-mondal2022/mlops-assignment
A project for validating the Machine learning models
machine-learning scikit-learn streamlit
Last synced: 22 Apr 2026
https://github.com/h-sarhan/hate-speech-classifier
Automatic Detection of Hate Speech and Offensive Content
Last synced: 22 Apr 2026
https://github.com/5hraddha/megaline-plan-recommendations
Megaline is a telecom operator and it offers its clients two prepaid plans, Surf and Ultimate.Megaline has found out that many of their subscribers use legacy plans. They want to develop a model that would analyze subscribers' behavior and recommend one of Megaline's newer plans: Smart or Ultra.
decision-tree-classifier logistic-regression random-forest-classifier scikit-learn supervised-learning
Last synced: 22 Apr 2026
https://github.com/waikato-datamining/spectral-data-converter-sklearn
Scikit-learn plugins for the spectral-data-converter library.
kasperl scikit-learn sdc seppl spectral-data
Last synced: 24 Apr 2026
https://github.com/sabin74/movie_recommendation_system
A Python-based movie recommendation engine built using the MovieLens Dataset that supports:
collaborative-filtering content-based-filtering cosine-similarity movie-lens movie-recomendation-system pyhton3 scikit-learn tf-idf-vectorizer
Last synced: 24 Apr 2026
https://github.com/hoccyy/house-price-prediction
Machine learning model built with Scikit-learn to predict house prices based on various features.
linear-regression machine-learning ml pickle prediction-model scikit-learn scikitlearn-machine-learning
Last synced: 24 Apr 2026
https://github.com/mcp-tool-shop-org/runforge-vscode
RunForge VS Code Extension - Push-button ML training with presets
deterministic developer-tools machine-learning mcp python scikit-learn training typescript vscode vscode-extension
Last synced: 25 Apr 2026
https://github.com/jawwad-fida/data-science-salary-estimator
A tool that estimates data science salaries (MAE ~ $ 11K) to help data scientists negotiate their income when they get a job.
data-science machine-learning project scikit-learn
Last synced: 25 Apr 2026
https://github.com/capsuleismail/parkinsons-telemonitoring-dataset
Dataset used to predict Parkinson’s disease severity based on biomedical voice measurements.
data-science jupyter-notebook machinelearning-python scikit-learn
Last synced: 25 Apr 2026
https://github.com/sarangs1621/weather-prediction
Weather Prediction Using Machine Learning is a project that leverages machine learning algorithms to predict weather conditions based on historical data. It evaluates three popular ML models (Decision Tree, KNN, and Logistic Regression) and provides performance insights through metrics and visualizations.
data-analysis decision-tree jupyter-notebook knn logistic-regression machine-learning predictive-modeling python scikit-learn weather-prediction
Last synced: 25 Apr 2026
https://github.com/bp0609/decision-tree-implementation-from-scratch
This repo contains the decision tree implementation from scratch for all possible cases i) discrete features, discrete output; ii) discrete features, real output; iii) real features, discrete output; iv) real features, real output.
decision-tree-classifier decision-tree-regressor scikit-learn
Last synced: 26 Apr 2026
https://github.com/deliprofesor/cinematic-data-analytics-and-recommendation-platform
This project analyzes a movie dataset using machine learning algorithms to predict success, explore revenue-popularity relationships, and develop recommendation systems. It employs techniques like K-Means, DBSCAN, GMM, decision trees, PCA, and NLP for insights and personalized suggestions.
clustering content-based-recommendation data-analysis data-visualization decision-tree gmm k-means machine-learning natural-language-processing nlp pca predictive-modeling python recommendation-system scikit-learn user-based-recommendation
Last synced: 26 Apr 2026
https://github.com/a-n-i-t-t-a/credit_card_fraud_detection
Fraudulent transactions are a growing concern in the financial sector, and leveraging machine learning can help detect anomalies in real-time. I built a Credit Card Fraud Detection System using the K-Nearest Neighbors (KNN) algorithm, trained on a dataset with key transaction patterns.
flask knn-classifier machine-learning pandas python scikit-learn
Last synced: 27 Apr 2026
https://github.com/leolion3/smartnanotubes-smellinspector-companion
Companion software for the SmellInspector Devices from SmartNanoTubes. Allows specifying substances, connecting multiple devices, collecting data and performing machine learning.
docker machine-learning python3 reactjs scikit-learn smartnanotubes smellinspector
Last synced: 27 Apr 2026
https://github.com/mihirmakwana03/ci7521-cw1-notebook
Multi-class classification on imbalanced data — 8 sklearn classifiers + SMOTE + ROC-AUC benchmarking. Kingston CI7521 CW1.
classification hyperparameter-tuning imbalanced-data machine-learning scikit-learn smote
Last synced: 27 Apr 2026
https://github.com/toscdom/spam_detection
This repository contains a project focused on analyzing and classifying emails to detect SPAM. It includes: Training a machine learning classifier for SPAM detection. Identifying key topics in SPAM emails using NLP techniques. Calculating semantic distances to evaluate topic similarity. Tools used include Python libraries like nlp frameworks
classifier nlp nltk scikit-learn semantic-analysis spam-detection
Last synced: 27 Apr 2026
https://github.com/sundanc/movierecommendation
Movie recommendation system based on user input. Built with Streamlit
movie-recommendation-app python scikit-learn scikitlearn-machine-learning streamlib
Last synced: 27 Apr 2026
https://github.com/davidrpugh/kaust-dsa-201
Course materials for KAUST DSA 201
deep-learning machine-learning pytorch scikit-learn
Last synced: 27 Apr 2026
https://github.com/capsuleismail/spambase
Classifying Email as Spam or Non-Spam with RandomForestClassifier
datascience jupyter-notebook machinelearning-python scikit-learn
Last synced: 28 Apr 2026
https://github.com/tillscode/personal-finance-ml-analysis
Machine learning analysis of personal financial data with predictive modeling and interactive dashboard
dashboard data-analysis finance machine-learning python scikit-learn
Last synced: 28 Apr 2026
https://github.com/renoyegon/customer_segmentation_using_kmeans_clustering
This project applies KMeans clustering to segment customers in the Online Retail II dataset. Using powerful Python libraries such as pandas, scikit-learn, matplotlib, and seaborn, we uncover meaningful customer behavior patterns
kmeans-clustering matplotlib scikit-learn seaborn
Last synced: 28 Apr 2026
https://github.com/serdaraydem1r/10dayaichallenge101
In the 10-day camp, we experienced the basics of machine learning by coding
artificial-intelligence machine-learning-algorithms model-evaluation-and-selection scikit-learn
Last synced: 28 Apr 2026
https://github.com/lmriccardo/moments-learning
Repository for the First-Second Moments Learning project. In this repo you will find an implementation of a learning model to learn the relationship between time-series model parameters and the first two moments of its outputs
machine-learning mean mlp-regressor models random-forest scikit-learn time-series torch variance
Last synced: 28 Apr 2026
https://github.com/hai4320/ml_ai_notebook
All my note about ML, AI and Data Science
ai machine-learning numpy pandas scikit-learn
Last synced: 28 Apr 2026
https://github.com/dwade-eng/amazon-product-recommender-prototype-
This project is a content-based product recommendation engine inspired by Amazon's "Customers who viewed this item also viewed" feature. It uses a dataset of product metadata and user interactions to suggest similar items based on product titles, brands, and categories using TF-IDF vectorization and cosine similarity.
html numpy pandas python3 scikit-learn
Last synced: 28 Apr 2026
https://github.com/ronverse17/loan-recovery-strategy
End-to-end ML project for predicting high-risk borrowers and recommending recovery actions
classification data-science kmeans-clustering machine-learning matplotlib random-forest-classifier scikit-learn seaborn
Last synced: 28 Apr 2026
https://github.com/emmanuelletocs/steam-game-recommender
A powerful recommendation system for Steam games, combining Content-Based and Collaborative Filtering techniques. Built with Python, Scikit-learn, and Streamlit to deliver accurate, real-time game recommendations. Perfect for gamers and data scientists interested in building intelligent recommendation engines.
als-algorithm data-analysis gaming-industry knn machine-learning mds mysql ncf neural-network pyspark recommendation-engine recommendation-system scikit-learn spark
Last synced: 28 Apr 2026
https://github.com/rajivaleaakash/customer-churn-prediction
A machine learning project focused on predicting customer churn using various data analysis and modeling techniques. The repository includes data preprocessing, feature engineering, exploratory data analysis (EDA), model training, evaluation, and visualization to help businesses identify customers at risk of leaving.
churn-prediction classification customer-churn data-analysis data-science gridsearchcv imblearn machine-learning numpy pandas pyhton randomsearchcv scikit-learn
Last synced: 28 Apr 2026
https://github.com/brenofariasdasilva/dagster-education-model
Dagster Education Model using Dagster 1.3.11 and Python 3.7.17.
dagster makefile matplotlib pandas pyenv python3 scikit-learn seaborn shellscript
Last synced: 28 Apr 2026
https://github.com/akash-47-tank/predictive-customer-churn-analyzer
A professional-grade customer churn prediction system that not only predicts customer churn but also provides clear explanations for the predictions. Built with Python, XGBoost, and SHAP.
machine-learning pandas python scikit-learn shap streamlit xgboost
Last synced: 28 Apr 2026
https://github.com/findthehead/pentestpayload
A KNN algorithm based Web Application Payload search and modification engine with a nice red FLASK based GUI
knn-classification knn-regression machine-learning pentest-tool scikit-learn websecurity
Last synced: 28 Apr 2026
https://github.com/catcoder27/ai-portfolio
Reusable ML scaffold: notebooks, model cards, reports
data-science kaggle machine-learning pandas scikit-learn
Last synced: 28 Apr 2026
https://github.com/incalculable-driverslicence975/data-projects-portfolio
📊 Showcase data projects that highlight analytics, machine learning, and MLOps with reproducible code and clear business insights.
ai computer-vision dashboard data-science-projects data-visualization deep-learning etl excel finance hadoop hiveq keras machine-learning nlp pandas portfolio-project scikit-learn tableau-dashboards
Last synced: 28 Apr 2026
https://github.com/nexus69420/movie-recommender-streamlit
A hybrid movie recommendation system that combines content-based filtering using NLP and collaborative filtering using SVD. Built with Python, Streamlit, and trained on TMDB and MovieLens data. Delivers personalized recommendations with a simple web interface.
collaborative-filtering content-based-recommendation data-science machine-learning nlp python recommendation-system scikit-learn streamlit svd
Last synced: 28 Apr 2026
https://github.com/rubaramanan/irisclassifier_with_scikitlearn
iris-classification machine-learning numpy pandas scikit-learn
Last synced: 28 Apr 2026
https://github.com/arnab-0053/song-identifier
It identifies songs and artists from lyric snippets using two distinct methods - simple NLP based approach and BM25(Best Match 25) approach.
bm25 nlp nltk python rank-bm25 scikit-learn song-lyrics spotify-dataset text-preprocessing
Last synced: 28 Apr 2026
https://github.com/abhi227070/car-price-prediction
This project implements a machine learning model to predict the price of cars based on various features such as mileage, manufacturing date, fuel type, and more. Users can input car information, and the model will estimate the price of the car based on the provided data. This tool can be useful for both car buyers and sellers to estimate car price.
data-analysis machine-learning machine-learning-algorithms machinelearning python3 regression regression-models scikit-learn scikitlearn-machine-learning
Last synced: 28 Apr 2026
https://github.com/razalkr70/customer-segmentation-using-dataset
A data science project that segments mall customers using K-Means clustering. Based on age, income, and spending score, it identifies customer groups and visualizes them with 2D and 3D plots for targeted marketing insights.
clustering customer-segmentation data-science data-visualization kmeans machine-learning pca python scikit-learn
Last synced: 28 Apr 2026
https://github.com/rakibhhridoy/customersegmentation-clustering
Customer segmentation heavily use in business purpose. It is needed skill for business intelligence and applied machine learning engineer. This represent quite basic way the customer segmentation is done. In python the task is quite easy to do.
agglomerative-clustering clustering-algorithm customer ecommerce kmeans-clustering machine-learning scikit-learn scikitlearn-machine-learning segmentation unsupervised-learning unsupervised-machine-learning
Last synced: 28 Apr 2026
https://github.com/senaldolage/spam-text-classifier
A simple machine learning project to classify whether a given text message is spam or not
Last synced: 28 Apr 2026
https://github.com/tom-uchida/introduction_to_machine_learning
Machine learning private lesson.
Last synced: 28 Apr 2026