scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-06-23 00:27:46 UTC
- JSON Representation
https://github.com/snikumbh/archr
archR: Identifying promoter sequence architectures de novo using NMF
archr discovery nmf non-negative-matrix-factorization promoter-sequence-architectures r r-package scikit-learn sequence-architectures unsupervised-machine-learning
Last synced: 18 Apr 2026
https://github.com/mnj-tothetop/english-handwritten-characters-recognizer
A handwritten english character recognizer [0-9, A-Z, a-z] made by using a Dataset of 3409 images. Tensorflow, Keras, Scikit-learn, and OpenCV was used to implement the Convolution Neural Network (CNN). Matplotlib and Seaborn were used to visualize the data.
artificial-intelligence convolutional-neural-networks keras matplotlib opencv-python scikit-learn seaborn tensorflow
Last synced: 18 Apr 2026
https://github.com/27ahmad/movie-recommendation-system
Welcome to the Movie Recommendation System! This project uses Streamlit to provide personalized movie recommendations based on user preferences and similarity.
movie-recommendation numpy pandas python scikit-learn
Last synced: 04 Apr 2026
https://github.com/bjpcjp/scikit-learn
Updates in progress. Jupyter workbooks will be added as time allows.
Last synced: 18 Apr 2026
https://github.com/minhtran241/ml-dl-llm-genai
Showcasing ML/DL fundamentals, paper implementations, deep learning models, and other projects. The purpose of this repository is to provide a playground for me to explore and learn about PyTorch, deep learning, and generative AI.
deep-learning generative-ai llm machine-learning paper-implementations pytorch scikit-learn
Last synced: 18 Apr 2026
https://github.com/justsecret123/nba-players-stats-analysis
A quick interactive Notebook to visualize some NBA players stats (points, assists, steals, blocks...) and totals, rankings and comparisons. Feel free to add any player in the .csv data files. 🏀
csv ipython-notebook ipywidgets jupyter-notebook jupyterlab matplotlib pandas python scikit-learn seaborn
Last synced: 18 Apr 2026
https://github.com/gattsu001/telecom-churn-predictor
Predicts which telecom customers are likely to churn with 95% accuracy using engineered features from usage, billing, and support data. Implements Sturges-based binning, one-hot encoding, stratified 80/20 train-test split, and a two-level ensemble pipeline with soft voting. Achieves 94.60% accuracy, 0.8968 AUC, 0.8675 precision, 0.7423 recall.
churn-prediction classification classification-algorithm customer-retention data-science data-visualization feature-engineering joblib jupyter-notebook machine-learning pandas scikit-learn supervised-learning svm
Last synced: 18 Apr 2026
https://github.com/rescurib/random_forest_arduino_uno
Ejemplo de implementación de un clasificador de bosque aleatorio en un Arduino UNO usando scikit-learn y m2cgen.
Last synced: 18 Apr 2026
https://github.com/tanim-mishkat/data-science-prediction-model-pds-course-
Diabetes Progression Prediction Using Regression Analysis: This project uses regression analysis in Python to predict diabetes progression based on medical and physiological data. Includes data preprocessing, model training, evaluation, and visualizations.
data-science machine-learning python regression scikit-learn
Last synced: 19 Apr 2026
https://github.com/gregoritsch3/ml_clustering_eda_customersegmentation
An EDA and Machine Learning Clustering exercise on the Mall Customer Segmentation synthetic dataset demonstrating the use of KMeans Clustering and the Elbow Method. The clustering algorithm successfully segments the customer base into groups distinguishable by their annual income and spending score.
clustering kmeans-clustering machine-learning matplotlib numpy pandas scikit-learn scipy seaborn
Last synced: 04 Apr 2026
https://github.com/pedroteixeiraw/variational_quantum_circuit_binary_classification
This project focuses on developing a Variational Quantum Circuit capable of performing Binary Classification between two classes: red wine and white wine, based on their characteristics using machine learning.
binary-classification cost-function json machine-learning matplotlib numpy pandas qiskit qiskit-machine-learning quantum-machine-learning scikit-learn training-data variational-circuit
Last synced: 04 Apr 2026
https://github.com/sentinel-ml/sentinel_ai
Machine Learning Model to detect fraud in financial systems
ai python pytorch scikit-learn security security-tools tensorflow
Last synced: 04 Apr 2026
https://github.com/abdul-rafay19/california-housing-price-prediction
This project predicts California housing prices using machine learning regression models, including Random Forests and Decision Trees. It covers data preprocessing, exploratory analysis, model training, and hyperparameter tuning to optimize performance.
decision-trees gridsearchcv linear-regression matplotlib numpy pandas python random-forest randomsearch-cv scikit-learn scipy seaborn
Last synced: 04 Apr 2026
https://github.com/alainlebret/python-et-ia-1
Ressources personnelles du cours "Python & IA" en 2e année GPSE à l'ENSICAEN
artificial-intelligence image-processing machine-learning matplotlib numpy python scikit-image scikit-learn
Last synced: 04 Apr 2026
https://github.com/adhadse/hands-on-machine-learning-book-notes-and-practice
This repo holds the Jupyter notebooks and datasets containing notes/comments on things I learned from this book. Feel free to use and learned from them.
data-science deep-learning jupyter-notebooks keras machine-learning python scikit-learn tensorflow
Last synced: 04 Apr 2026
https://github.com/kaladabrio2020/livro-ml-with-pytorch-and-sk
Progresso em cada capitulo
jupyter-notebook matplotlib-pyplot nump pandas python3 pytorch scikit-learn
Last synced: 04 Apr 2026
https://github.com/mnitin-reddy/a-b-testing-and-regression-analysis-for-ad-performance-optimization
Analyzed the performance of Facebook and AdWords ads using A/B testing and regression analysis to identify trends, correlations, and cost-effectiveness. Key insights included distribution of clicks and conversions, monthly trends, and cost-per-conversion analysis to optimize ROI.
abtesting data-science hypothesis-testing machine-learning matplotlib numpy pandas scikit-learn scipy seaborn statsmodels
Last synced: 04 Apr 2026
https://github.com/yashsonaar/machine-learning-tasks
This repository has machine learning tasks which include classification, recommendation system, fraud detection system
classification jupyter-notebook machine-learning numpy pandas prediction python scikit-learn testing
Last synced: 04 Apr 2026
https://github.com/anushrey10/fuel_efficiency_predictor
Welcome to the Fuel Efficiency Predictor! This advanced tool uses machine learning to predict your vehicle's fuel efficiency based on various characteristics.
decision-tree gradient-boosting-classifier html-css-javascript linear-regression machile-learning matplotlib python random-forest scikit-learn tailwindcss
Last synced: 18 Apr 2026
https://github.com/chengetanaim/high-school-alcoholism-and-academic-performance
Student Alcoholism and Academic Performance Data Analysis
Last synced: 18 Apr 2026
https://github.com/giacomolat/object-detection-sperimental-thesis-for-degree
In this repository is my experimental thesis work on the recognition of museum works through object detection techniques.
convolutional-neural-networks detectron2 jupyter-notebook machine-learning neural-networks object-detection python pytorch rcnn rcnn-model scikit-learn
Last synced: 18 Apr 2026
https://github.com/eugen-goebel/predictive-analytics-agent
Automated ML pipeline — data profiling, preprocessing, model training, and evaluation report generation
automation data-science docker machine-learning predictive-analytics python scikit-learn streamlit
Last synced: 05 Apr 2026
https://github.com/sundanc/weatherprediction
This project implements a weather prediction system that predicts the temperature based on real-time weather data, including features like humidity, wind speed, and day-related features (day of the week, month
machine-learning machinelearning numpy pandas programming python scikit-learn scikitlearn-machine-learning weather-prediction
Last synced: 18 Apr 2026
https://github.com/akhundmuzzammil/energyconsumptionprediction
This repository contains code and resources for training a linear regression model to predict energy consumption based on various building parameters.
data-analysis energy-consumption linear-regression machine-learning python scikit-learn streamlit visualization
Last synced: 18 Apr 2026
https://github.com/hariprasath-v/machinehack-analytics-olympiad-2022
Create a machine learning model to help an insurance company understand which claims are worth rejecting and the claims which should be accepted for reimbursement.
catboost-classifier exploratory-data-analysis logloss machinehack numpy optuna pandas python scikit-learn shap
Last synced: 18 Apr 2026
https://github.com/alezoon/movie-revenue-prediction
Sk-learn practice using Linear Regression, ML workflow practice.
jupyter machine-learning matplotlib-pyplot numpy pandas python scikit-learn
Last synced: 05 Apr 2026
https://github.com/ksasi/dog-breed-classifier
Dog Breed Classifier
cnn cnn-classification computer-vision deep-learning deep-neural-networks keras keras-neural-networks machine-learning numpy pandas python scikit-learn
Last synced: 05 Apr 2026
https://github.com/ricardorobledo/next_level_data_science
matplotlib numpy pandas python3 scikit-learn
Last synced: 05 Apr 2026
https://github.com/simrandalal/semantic-book-recommender
A semantic content-based book recommender using sentence-transformer embeddings, cosine similarity, and a Streamlit interface.
dotenv huggingface-transformers nlp-machine-learning pandas python scikit-learn similarity-search streamlit
Last synced: 05 Apr 2026
https://github.com/murugavl/flower-prediction
Flower Prediction is a machine learning project that uses the Iris dataset to classify iris flowers into three species: Setosa, Versicolor, and Virginica. The project includes data analysis, model training with various algorithms, and deployment via a Flask web application for user-friendly predictions.
flask machine-learning matplotlib numpy pandas python scikit-learn seaborn
Last synced: 05 Apr 2026
https://github.com/elprofesoriqo/kagglecompetitions
Kaggle competitions projects
artificial-intelligence machine-learning python pytorch scikit-learn
Last synced: 05 Apr 2026
https://github.com/taqsblaze/hush
Hush: A lightweight, context-aware text toxicity classifier. Leveraging NLP and Random Forest ensemble learning to detect and mitigate harmful language in real-time. Built for efficiency, safety, and cleaner digital communication.
content-moderation machine-learning nlp random-forest safety-tools scikit-learn text-classification toxicity-detection
Last synced: 05 Apr 2026
https://github.com/deliprofesor/game-search-volume-prediction-machine-learning-models-and-forecasting
This repository uses machine learning models like Random Forest, XGBoost, LightGBM, and time-series forecasting with Prophet to predict game search volumes. Additionally, Grid Search is applied for hyperparameter tuning of the LightGBM model.
data-cleaning data-science data-visualization feature-selection forecasting-models game-search grid-search hyperparameter-tuning lightgbm machine-learning pandas prophet python random-forest scikit-learn time-series-analysis time-series-forecasting xgboost
Last synced: 18 Apr 2026
https://github.com/malick08012/heart-disease-prediction
A machine learning project that predicts the risk of heart disease based on patient health data. Includes data cleaning, EDA, visualization, model training, evaluation and feature importance analysis
artificial-intelligence heartdisease-prediction logistic-regression machine-learning python scikit-learn
Last synced: 18 Apr 2026
https://github.com/manalisbhavsar/mall-customers-clustering
K-Means clustering to mall customer data, segmenting customers based on their annual income and spending score. To identify patterns and group customers for targeted marketing.
data-analysis data-visualization matplotlib numpy pandas python scikit-learn
Last synced: 18 Apr 2026
https://github.com/jeffandyalltogether/mlrecommendationsystem
project code for a recommendation system for Amazon using collaborative filtering, ranking, and matrix factorization to enhance customer satisfaction and product discovery.
eda matplotlib pandas python scikit-learn seaborn tensorflow
Last synced: 05 Apr 2026
https://github.com/naren1704/ml-approach-for-employee-performance-prediction
A Flask UI that predicts the performance of employee based on the XGBoost trained model.
css flask html python scikit-learn xgboost
Last synced: 05 Apr 2026
https://github.com/yashrajgithub/crop-recommendation
KrishiGyaan is a web app designed to help farmers make informed decisions on crop selection. By analyzing soil and environmental factors, the app provides personalized crop recommendations, enhancing agricultural productivity and promoting sustainable farming practices.
api artificial-intelligence crop-recommendation-system data-preprocessing data-visualization json machine-learning-algorithms pickle python random-forest-classifier scikit-learn streamlit supervised-learning train-test-split user-interface
Last synced: 05 Apr 2026
https://github.com/barek2k2/ml_ruby
Ruby gem uses Machine Learning(ML) techniques to make predictions and classifications, and it's powered by Python3 under the hood.
artificial-intelligence data-science machine-learning pandas prediction python3 ruby ruby-on-rails scikit-learn
Last synced: 05 Apr 2026
https://github.com/merekat/ml-shortcut-library
A Visual Studio Code shortcut library designed to simplify and accelerate machine learning development.
cnn coding data-science deep-learning efficiency extension extensions fnn machine-learning machinelearning nlp numpy pandas python scikit-learn shortcut shortcuts tensorflow visual-studio visual-studio-code
Last synced: 05 Apr 2026
https://github.com/emilyfelker/ieee_cis_fraud_detection
Which online transactions are fraudulent? Program that uses various machine learning algorithms to detect fraud.
decision-trees kaggle logistic-regression machine-learning neural-network pandas poetry pytest python scikit-learn sklearn tensorflow xgboost
Last synced: 05 Apr 2026
https://github.com/oadultradeepfield/galaxy10-anomaly-detection
A public API and experimental PyTorch pipeline for anomaly detection in the Galaxy10 DECals dataset using ResNet50, autoencoders, and clustering techniques
flask google-cloud-run kaggle pytorch scikit-learn
Last synced: 05 Apr 2026
https://github.com/lexxai/goit_python_ds_hw_04
Модуль 4. Класифікація та оцінка роботи моделі. Лінійна регресія: перенавчання та регуляризація
lasso-regression linear-regression numpy pandas python red regression ridge-regression scikit-learn
Last synced: 05 Apr 2026
https://github.com/nowon1/insurance-claim-prediction_version
This project aims to predict the insurance claim amounts based on various customer attributes using machine learning techniques. The project involves data preprocessing, exploratory data analysis, feature engineering, and model training and evaluation.
data-preprocessing data-science data-visualization exploratory-data-analysis feature-engineering insurance jupyter-notebook machine-learning numpy pandas predictive-modeling python random-forest regression-analysis scikit-learn
Last synced: 05 Apr 2026
https://github.com/perpendicooler/elementary-research-for-steamboat-willie-s-store-in-poland
An elementary research for a company to opening store in a city using gurobi and pulp optimization.
christofides-algorithm gurobipy numpy pandas pulp python3 scikit-learn travelling-salesman-problem
Last synced: 05 Apr 2026
https://github.com/billy0402/python-machine-learning
A learning project from NTUB machine learning course.
ai course jupyter-notebook python scikit-learn tensorflow
Last synced: 05 Apr 2026
https://github.com/lorenzorottigni/ml-movies
Machine Learning python bootcamp: Recommender Systems on movies dataset
ipynb machine-learning numpy pandas python recommender-system scikit-learn seaborn
Last synced: 05 Apr 2026
https://github.com/amiegirl/sentiment_analyzer_app
Sentiment Analysis and Deployment of Real-time Flipkart Product Reviews
aws aws-ec2 flask logistic-regression machine-learning naive-bayes-classifier pandas python random-forest-classifier scikit-learn wordcloud
Last synced: 05 Apr 2026
https://github.com/kenatsf/basic_data_analysis
Basic data science project: ETL, forecast and data visualization.
analysis data data-analysis data-science logistic-regression matplotlib matplotlib-pyplot numpy pandas powerbi python scikit-learn time-series time-series-analysis time-series-forecasting
Last synced: 05 Apr 2026
https://github.com/manojpatra1991/machine-learning-engineer-nanodegree
Machine Learning Engineer Nanodegree Projects - My Submissions
adaboost csv-files decision-tree html jupyter-notebook linear-regression machine-learning machine-learning-algorithms machine-learning-nanodegree naive-bayes-classifier python3 scikit-learn support-vector-machine
Last synced: 18 Apr 2026
https://github.com/mbarbetti/mediastinal-lymphoma-classification
Machine-learning-based classification of bulky mediastinal lymphomas using radiomic features
diagnosis-prediction lymphoma-classification machine-learning personalized-treatment precision-medicine radiomics-analysis scikit-learn texture-analysis
Last synced: 18 Apr 2026
https://github.com/thekartikeyamishra/ai-customer-feedback-summarizer
The AI Customer Feedback Summarizer is a Python-based application that processes customer feedback, extracts insights, and summarizes reviews. This basic version uses extractive summarization techniques, and the advanced version integrates advanced sentiment analysis, visualization, and industry-specific fine-tuning.
ai chatbot gpt machine-learning matplotlib nltk pandas python scikit-learn streamlit
Last synced: 18 Apr 2026
https://github.com/nabilshadman/python-classification-and-generative-models
Applications of classification and generative models with Python
classification data-science data-visualization generative-model machine-learning matplotlib numpy pandas scikit-learn
Last synced: 19 Apr 2026
https://github.com/vijaykumarr1452/black_friday_sales_analysis
Black Friday Sales Analysis python machine learning project using pandas and scikit-learn for data preprocessing, model training, and performance evaluation.
confusion-matrix jupyter-notebook machine-learning pandas python random-forest-classifier sales-analysis scikit-learn
Last synced: 19 Apr 2026
https://github.com/kheriberto/linear_regression_ecommerce
Simple project showcasing crafting a linear regression model with SciKit Learn
data-analysis jupyter-notebook linear-regression pandas python scikit-learn seaborn
Last synced: 19 Apr 2026
https://github.com/syzygianinfern0/ml-tutorial
Basics of ML to Help Beginners made in Jupyter Notebook
jupyter-notebook machine-learning machine-learning-algorithms machine-learning-coursera scikit-learn
Last synced: 19 Apr 2026
https://github.com/yassin522/heartbeat-categorization
This project is aimed at developing a machine learning model that can accurately classify heartbeats as either normal or abnormal. The model is trained on a dataset of ECG (electrocardiogram) signals, which were collected from patients and labeled by medical professionals.
cnn deep-learning keras machine-learning scikit-learn tensorflow
Last synced: 20 Apr 2026
https://github.com/kaladabrio2020/machine-learning-with-pytorch-and-scikit-learn
Progress on the book machine learning with pytorch and scikit-learn
deep-learning implementation machine-learning python3 pytorch scikit-learn
Last synced: 20 Apr 2026
https://github.com/vyjayanthipolapragada/car_mileage_prediction
Predicting the mileage of car using the linear regression model with Scikit-learn
kaggle-titanic linear-regression machine-learning numpy pandas predictive-modeling python scikit-learn
Last synced: 20 Apr 2026
https://github.com/prahaladhchandrahasan/housingprices_adavanced_regression
A machine learning model for "House Prices: Advanced Regression Techniques" kaggle competition.
machine-learning-algorithms matplotlib-pyplot numpy pandas python3 scikit-learn
Last synced: 20 Apr 2026
https://github.com/namratha2301/carprice_analysisandprediction
This project analyzes factors influencing vehicle prices using a dataset of various attributes, including Engine capacity, Power, Mileage, and Seating capacity.
data-analysis data-visualization exploratory-data-analysis machine-learning pandas predictive-modeling random-forest-classifier regression scikit-learn seaborn
Last synced: 20 Apr 2026
https://github.com/zawadi-wanjiru/house-prices-prediction-group-project
Predicting House Prices Using Regression Analysis
datacleaning datavisualization descriptive-statistics exploratory-data-analysis jupyter-notebook matplotlib modelling pandas-library predictive-analysis python regression-analysis scikit-learn seaborn-python
Last synced: 20 Apr 2026
https://github.com/bruceunx/ai-simulator
aiplayground 人工智能学习乐园
ai maching-learning scikit-learn
Last synced: 20 Apr 2026
https://github.com/dahsie/spam_classification
Ce fut mon prémier projet NLP où j'ai réalisé la détection de spam en utilisant les algorithmes d'embedding pour encorder mes textes. J'ai utilisé Random Forest et Milti-Layres Perceptrons pour la phase de classification. Ce qui a pemit l'obtension des précisions respective de 97% et 98%. J'ai aussi appris à documenter mes codes via sphinx
doc2vec fasttext-embeddings gensim glove-embeddings python scikit-learn sphinx-doc word2vec-algorithm
Last synced: 20 Apr 2026
https://github.com/grandechowhiskey/harvard-cs50-ai-projects
This project contains a collection of programming assignments from CS50’s Introduction to Artificial Intelligence with Python course.
html python scikit-learn tensorflow
Last synced: 20 Apr 2026
https://github.com/himasnhu-at/freecodecamp--ml
ML Models I built for my freeCodeCamp's Machine Learning with Python certification
freecodecamp freecodecamp-project machine-learning machine-learning-algorithms matplotlib pandas python scikit-learn
Last synced: 20 Apr 2026
https://github.com/tr-3n/-ai-powered-resume-analyzer-multi-source-job-matcher
AI-Powered Resume Analyzer & Multi-Source Job Matcher, is a web application built using Python and Streamlit that helps job seekers find the best job opportunities based on their resume. The app extracts text from uploaded resumes, matches it with job listings from multiple sources, and displays the most relevant jobs.
ai api html-css job job-recommendation job-search jobmatching natural-language-processing pandas pypdf2 python resume-analyzer scikit-learn streamlit web-development
Last synced: 20 Apr 2026
https://github.com/tryomar/data-miner
DataMiner is an interactive web application for data mining and machine learning. It helps users upload, clean, transform, and analyze datasets while building predictive models — all through a simple and powerful Streamlit interface.
data-cleaning data-mining data-preprocessing data-science data-visualization interactive-dashboards pandas python scikit-learn streamlit
Last synced: 20 Apr 2026
https://github.com/alphacrypto246/customer-churn
This project predicts customer churn using machine learning. It includes data preprocessing, exploratory analysis, model training, and evaluation to identify key factors driving churn and provide actionable insights for retention.
knn-classification machine-learning machine-learning-algorithms python scikit-learn scikitlearn-machine-learning
Last synced: 20 Apr 2026
https://github.com/abdel-17/facial-recognition
Facial recognition using Machine Learning in Python
machine-learning pca python scikit-learn
Last synced: 20 Apr 2026
https://github.com/simhayn/binary-classification
Alzheimer's disease detection using XGBoost and other prediction models.
alzheimer-disease-prediction binary-classification exploratory-data-analysis mental-health prediction-model python scikit-learn xgboost
Last synced: 20 Apr 2026
https://github.com/chdl17/lead-score-case-study
Lead scoring is the process of assigning a numerical value or score to each lead, based on factors such as demographics and behavior, to determine their potential value as customers.
machine-learning-algorithms matplotlib-pyplot python scikit-learn
Last synced: 20 Apr 2026
https://github.com/kimaya012/fake-news-detection
This project detects whether a news is fake or not using machine learning.
decision-tree-classifier fake-news-detection gradient-descent logistic-regression machine-learning python random-forest-classifier scikit-learn sklearn
Last synced: 20 Apr 2026
https://github.com/ghufranbarcha/linear-regression-training-app
This project is a Streamlit application that allows users to upload a CSV file, select variables, and train a linear regression model. The app provides an easy-to-use interface for selecting dependent and independent variables, scaling data, applying polynomial regression, and evaluating model performance.
data-science machine-learning python scikit-learn streamlit
Last synced: 20 Apr 2026
https://github.com/kerushani/sign-language-detection
A sign language detector.
mediapipe opencv python scikit-learn
Last synced: 20 Apr 2026
https://github.com/adityapradhan202/binge-trend
Media and entertainment recommendation website with AI powered recommendation system.
datascience-machinelearning natural-language-processing python scikit-learn spacy-nlp
Last synced: 21 Apr 2026
https://github.com/yogeshsinghkatoch9/advanced_nyc_housing_price_prediction
A robust ensemble learning framework for advanced NYC housing price prediction, leveraging global, clustered, and local ensembles with hyperparameter tuning.
data-science ensemble-learning housing-prices machine-learning new-york python scikit-learn
Last synced: 21 Apr 2026
https://github.com/sayan-mondal2022/mlops-assignment
A project for validating the Machine learning models
machine-learning scikit-learn streamlit
Last synced: 22 Apr 2026
https://github.com/h-sarhan/hate-speech-classifier
Automatic Detection of Hate Speech and Offensive Content
Last synced: 22 Apr 2026
https://github.com/5hraddha/megaline-plan-recommendations
Megaline is a telecom operator and it offers its clients two prepaid plans, Surf and Ultimate.Megaline has found out that many of their subscribers use legacy plans. They want to develop a model that would analyze subscribers' behavior and recommend one of Megaline's newer plans: Smart or Ultra.
decision-tree-classifier logistic-regression random-forest-classifier scikit-learn supervised-learning
Last synced: 22 Apr 2026
https://github.com/waikato-datamining/spectral-data-converter-sklearn
Scikit-learn plugins for the spectral-data-converter library.
kasperl scikit-learn sdc seppl spectral-data
Last synced: 24 Apr 2026
https://github.com/sabin74/movie_recommendation_system
A Python-based movie recommendation engine built using the MovieLens Dataset that supports:
collaborative-filtering content-based-filtering cosine-similarity movie-lens movie-recomendation-system pyhton3 scikit-learn tf-idf-vectorizer
Last synced: 24 Apr 2026
https://github.com/hoccyy/house-price-prediction
Machine learning model built with Scikit-learn to predict house prices based on various features.
linear-regression machine-learning ml pickle prediction-model scikit-learn scikitlearn-machine-learning
Last synced: 24 Apr 2026
https://github.com/mcp-tool-shop-org/runforge-vscode
RunForge VS Code Extension - Push-button ML training with presets
deterministic developer-tools machine-learning mcp python scikit-learn training typescript vscode vscode-extension
Last synced: 25 Apr 2026
https://github.com/jawwad-fida/data-science-salary-estimator
A tool that estimates data science salaries (MAE ~ $ 11K) to help data scientists negotiate their income when they get a job.
data-science machine-learning project scikit-learn
Last synced: 25 Apr 2026
https://github.com/capsuleismail/parkinsons-telemonitoring-dataset
Dataset used to predict Parkinson’s disease severity based on biomedical voice measurements.
data-science jupyter-notebook machinelearning-python scikit-learn
Last synced: 25 Apr 2026
https://github.com/sarangs1621/weather-prediction
Weather Prediction Using Machine Learning is a project that leverages machine learning algorithms to predict weather conditions based on historical data. It evaluates three popular ML models (Decision Tree, KNN, and Logistic Regression) and provides performance insights through metrics and visualizations.
data-analysis decision-tree jupyter-notebook knn logistic-regression machine-learning predictive-modeling python scikit-learn weather-prediction
Last synced: 25 Apr 2026
https://github.com/bp0609/decision-tree-implementation-from-scratch
This repo contains the decision tree implementation from scratch for all possible cases i) discrete features, discrete output; ii) discrete features, real output; iii) real features, discrete output; iv) real features, real output.
decision-tree-classifier decision-tree-regressor scikit-learn
Last synced: 26 Apr 2026
https://github.com/deliprofesor/cinematic-data-analytics-and-recommendation-platform
This project analyzes a movie dataset using machine learning algorithms to predict success, explore revenue-popularity relationships, and develop recommendation systems. It employs techniques like K-Means, DBSCAN, GMM, decision trees, PCA, and NLP for insights and personalized suggestions.
clustering content-based-recommendation data-analysis data-visualization decision-tree gmm k-means machine-learning natural-language-processing nlp pca predictive-modeling python recommendation-system scikit-learn user-based-recommendation
Last synced: 26 Apr 2026
https://github.com/a-n-i-t-t-a/credit_card_fraud_detection
Fraudulent transactions are a growing concern in the financial sector, and leveraging machine learning can help detect anomalies in real-time. I built a Credit Card Fraud Detection System using the K-Nearest Neighbors (KNN) algorithm, trained on a dataset with key transaction patterns.
flask knn-classifier machine-learning pandas python scikit-learn
Last synced: 27 Apr 2026
https://github.com/leolion3/smartnanotubes-smellinspector-companion
Companion software for the SmellInspector Devices from SmartNanoTubes. Allows specifying substances, connecting multiple devices, collecting data and performing machine learning.
docker machine-learning python3 reactjs scikit-learn smartnanotubes smellinspector
Last synced: 27 Apr 2026
https://github.com/mihirmakwana03/ci7521-cw1-notebook
Multi-class classification on imbalanced data — 8 sklearn classifiers + SMOTE + ROC-AUC benchmarking. Kingston CI7521 CW1.
classification hyperparameter-tuning imbalanced-data machine-learning scikit-learn smote
Last synced: 27 Apr 2026
https://github.com/toscdom/spam_detection
This repository contains a project focused on analyzing and classifying emails to detect SPAM. It includes: Training a machine learning classifier for SPAM detection. Identifying key topics in SPAM emails using NLP techniques. Calculating semantic distances to evaluate topic similarity. Tools used include Python libraries like nlp frameworks
classifier nlp nltk scikit-learn semantic-analysis spam-detection
Last synced: 27 Apr 2026
https://github.com/sundanc/movierecommendation
Movie recommendation system based on user input. Built with Streamlit
movie-recommendation-app python scikit-learn scikitlearn-machine-learning streamlib
Last synced: 27 Apr 2026
https://github.com/davidrpugh/kaust-dsa-201
Course materials for KAUST DSA 201
deep-learning machine-learning pytorch scikit-learn
Last synced: 27 Apr 2026
https://github.com/capsuleismail/spambase
Classifying Email as Spam or Non-Spam with RandomForestClassifier
datascience jupyter-notebook machinelearning-python scikit-learn
Last synced: 28 Apr 2026
https://github.com/tillscode/personal-finance-ml-analysis
Machine learning analysis of personal financial data with predictive modeling and interactive dashboard
dashboard data-analysis finance machine-learning python scikit-learn
Last synced: 28 Apr 2026
https://github.com/renoyegon/customer_segmentation_using_kmeans_clustering
This project applies KMeans clustering to segment customers in the Online Retail II dataset. Using powerful Python libraries such as pandas, scikit-learn, matplotlib, and seaborn, we uncover meaningful customer behavior patterns
kmeans-clustering matplotlib scikit-learn seaborn
Last synced: 28 Apr 2026