scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-06-25 00:23:58 UTC
- JSON Representation
https://github.com/mgesteban/analyzing_car_prices
A comprehensive data science project analyzing factors that drive used car prices to provide actionable insights for used car dealerships.
crisp-dm data-science lasso-regression linear-regression machine-learning one-hot-encoding pandas ridge-regression scikit-learn
Last synced: 15 Feb 2026
https://github.com/quran-yeamen/serverlifecycleml
Predictive modeling of server lifecycle stages using synthetic data and machine learning.
data-science machine-learning predictive-modeling python scikit-learn synthetic-data
Last synced: 15 Feb 2026
https://github.com/ramezze/virtualmouse-handtracking
A python application to control mouse functions using hand gestures leveraging MediaPipe, OpenCV and Tensorflow.
computer-vision handtracking handtrackingmodule kivy machine-learning mediapipe mediapipe-hands mouse multilayer-perceptron neural-network python scikit-learn scipy tensorflow virtualmouse webcam webcam-capture
Last synced: 15 Feb 2026
https://github.com/grandechowhiskey/fcc-machine_learning-boilerplates
A collection of projects completed as part of the FreeCodeCamp "Machine Learning with Python" certification. These projects focus on implementing machine learning models, data preprocessing, and predictive analysis using libraries like scikit-learn and TensorFlow.
ai ml python3 scikit-learn tensorflow
Last synced: 06 May 2026
https://github.com/sridharyadav07/machine-learning-project-combined-cycle-power-plant-
This project is focused on Multiple machine learning models, including Linear Regression, Decision Tree Regression, and Random Forest Regression, were implemented to predict the target variable and evaluated using various metrics like RMSE, MAE, and R-squared. The performance of these models was compared, and the Random Forest Regressor was found.
data-processing decisiontreeregressor linear-regression metrics-evaluation python random-forest-regressor scikit-learn
Last synced: 16 Apr 2026
https://github.com/pksvv/machinelearning_svm
Various implementations of Support Vector Machine Algo
machine-learning python scikit-learn support-vector-machine
Last synced: 04 May 2026
https://github.com/tansudasli/tensorflow-sandbox
all about machine learning
apache-arrow keras machine-learning numpy pandas scikit-learn tensorflow2
Last synced: 16 Apr 2026
https://github.com/ksasi/smartcab
machine-learning numpy pandas python reinforcement-learning scikit-learn
Last synced: 16 Apr 2026
https://github.com/lorenzorottigni/ml-advertising
Machine Learning python bootcamp: logistic regression on advertising dataset
ipynb logistic-regression machine-learning numpy pandas python scikit-learn seaborn
Last synced: 16 Apr 2026
https://github.com/eljandoubi/predict-customer-churn-with-clean-code
Clean Code Principles
autopep8 logging numpy pandas pylint pytest scikit-learn
Last synced: 16 Apr 2026
https://github.com/piotrwnuczek/cloudprediction
Predicting cloud task execution time using AI/ML
matplotlib pandas python scikit-learn
Last synced: 16 Apr 2026
https://github.com/silky-x0/spam-detector
An machine learning algorithm to detect spam emails or such.
jupyter-notebook nltk-python pandas python3 scikit-learn
Last synced: 16 Apr 2026
https://github.com/thinker84/real-time-stock-price-prediction-and-market-analysis-using-machine-learning
Real-time stock price prediction app using LSTM, Streamlit, and historical data (2010–2023). Forecasts next 10 days & visualizes trends.
data-science django lstm machine-learning numpy pandas pandas-datareader scikit-learn stock-market stock-price-prediction stooq streamlit yahoo-finance yahoo-finance-api
Last synced: 13 Jul 2025
https://github.com/otuemre/emailphishingdetection
A real-time phishing email detection system using Machine Learning (SVM, Logistic Regression, Naive Bayes) with FastAPI backend and custom domain deployment.
cybersecurity fastapi huggingface machine-learning nlp real-time scikit-learn spam-detection svm-classifier tfidf-vectorizer
Last synced: 13 Apr 2026
https://github.com/pierluigi-failla/pipesnake
a pandas sklearn-inspired pipeline data processor
feature-engineering feature-extraction pandas pipeline pipeline-data-processor python scikit-learn
Last synced: 03 Mar 2026
https://github.com/sadmansakib93/mental-resilience-analysis-using-machine-learning
Utilized supervised and unsupervised ML techniques to analyze mental health and resilience levels of medical students [Project completed on December, 2019]
artificial-intelligence classification clustering correlation linear-regression machine-learning machine-learning-algorithms mental-health python regression resilience scikit-learn statistical-analysis
Last synced: 06 May 2026
https://github.com/eshansugeesh/fico-score-loan-default-modeling-project
Credit risk assessment using FICO score segmentation, loan default modeling, discretization techniques, and log-likelihood evaluation for predictive analytics in financial services.
bucketing classification credit-risk customer-segmentation data-science discretization fico-score financial-analytics loan-analysis loan-default log-likelihood machine-learning numpy pandas predictive-modeling risk-modeling scikit-learn segmentation statistical-modelling
Last synced: 06 May 2026
https://github.com/thekartikeyamishra/customer-retention-predictor
The Customer Retention Predictor is a Python-based tool designed to help businesses predict customer churn using historical data. This project is particularly beneficial for small businesses and MSMEs in India, allowing them to identify customers at risk of leaving and take proactive measures to retain them.
joblib machine-learning numpy pandas python scikit-learn tinker
Last synced: 16 Apr 2026
https://github.com/intscription/machine-learning
Machine Learning and it's advance concepts
adaboost numpy pandas pca-analysis pipeline random-forest scikit-learn svm
Last synced: 28 Apr 2026
https://github.com/archish27/pythontutorial
Python Programming Tutorial for new geeks who want to learn python from scratch to deal with various applications
matplotlib numpy pandas pygame python python-2 python-3 scikit-learn soup
Last synced: 01 Apr 2026
https://github.com/pranav-nagpure/dog-breed-prediction-nb
IPython Notebook to build the model for Dog Breed Prediction
ipython-notebook keras-tensorflow machine-learning scikit-learn transfer-learning xception-net
Last synced: 03 Mar 2026
https://github.com/sahiltiwariiii/dssp
Predicting student math scores ! This project utilizes advanced machine learning techniques and MLOps tools like DVC and MLflow to predict a student's math score based on various factors such as gender, race/ethnicity, parental level of education, lunch type, test preparation course, writing etc
docker dotenv dvc flask machine-learning mlflow mlops mysql mysql-connector-python numpy pandas pymysql python python-dotenv scikit-learn seaborn sklearn-library statistics streamlit
Last synced: 27 Mar 2026
https://github.com/shreeparab1890/indian-cricketer-classifier
This notebook is trying to bulia a model which will predict a Indian Cricketer based on the given image. In this project we have handled 8 Indian Cricketers and build a model to classify the given image between this 8 Cricketers.
image-classification matplotlib numpy opencv pandas python random-forest-classifier scikit-learn sklearn streamlit
Last synced: 01 Apr 2026
https://github.com/ifte-13/early-stage-brain-stroke-detection
Predictive Analysis & Early Detection of Brain stroke using Machine Learning Algorithm
decision-tree imbalanced-learn knn matplotlib numpy pandas random-forest scikit-learn seaborn
Last synced: 06 Jul 2025
https://github.com/leftcoastnerdgirl/supervised_learning
This project demonstrates supervised machine learning using scikit-learn.
classification-reports confusion-matrix jupyter-notebook numpy pandas-python pathlib scikit-learn sklearn
Last synced: 16 Apr 2026
https://github.com/sanikamal/deep-learning-atoz
A collection of deep learning architectures ,model, code snippets, tips and mini projects.
computer-vision deep-learning nlp scikit-learn skimage tensorflow
Last synced: 16 Apr 2026
https://github.com/bkamapantula/discover
Code search utility to assist developer workflows via code discovery. Currently uses TF-IDF estimator.
developer-tools python scikit-learn tf-idf
Last synced: 16 Apr 2026
https://github.com/grupoguerreroherrera/ethical-ai-recruitment-audit
Bias audit toolkit reproducing the recruitment AI case from Activity 6 — Unidad 3, Electiva II Inteligencia Artificial Avanzada. Empirical analysis with reweighing mitigation, Model Card documentation, and APA 7 references.
academic-project algorithmic-auditing artificial-intelligence bias-mitigation disparate-impact ethical-ai fairness machine-learning model-card python random-forest recruitment-bias reweighing scikit-learn unesco-ai-ethics
Last synced: 03 Jun 2026
https://github.com/danishtalpur/sentiview-website
SentiView is a sentiment analysis tool designed to analyze and interpret the emotions behind tweets on Twitter. The platform processes textual data from user-generated tweets to determine the sentiment behind them—whether they are positive, negative, or neutral.
css flask html java naive-bayes-classifier scikit-learn twitter-sentiment-analysis
Last synced: 16 Apr 2026
https://github.com/kalpthakkar/jobpilot-ai
JobPilot AI is a next-generation, AI-powered job application and management platform that automates the end-to-end process of job searching, intelligent application submission, and workflow analytics. It combines state-of-the-art AI, ML, NLP, and cloud technologies to deliver a seamless, highly customizable, and extensible solution for job seekers.
artificial-intelligence automation beautifulsoup chromadb fastapi gmail-api jobs langchain llm lxml nlp nltk ollama python3 pywinauto rag scikit-learn selenium sqlite-database
Last synced: 08 Apr 2026
https://github.com/priyanshul28/ml_eda_regression_energyconsumptionforecasting
An EDA and Machine Learning Time-Series Regression Forecasting exercise on the PMJE Energy Consumption dataset demonstrating time-series analysis and the use of Time-Series Split, XGBoost, etc. The model is optimized using hyperparameter tuning through GridSearchCV. A Rob Mulla guided exercise.
forcasting machine-learning numpy pandas scikit-learn time-series-analysis
Last synced: 17 Apr 2026
https://github.com/ejw-data/proj-food-inspections
Analyzing Chicago Food Inspection data for interesting insights by combining multiple data resources and performing feature engineering.
decision-trees pandas preprocessing python scikit-learn
Last synced: 17 Apr 2026
https://github.com/archismwanchatterjee/football-match-prediction
Football Match Predictor for top leagues
ensemble-machine-learning machine-learning pandas random-forest-classifier rmse-score scikit-learn streamlit
Last synced: 17 Apr 2026
https://github.com/abhishek-k-git/soulmegle
expressjs flask html nodejs numpy python3 reactjs scikit-learn socket tailwindcss
Last synced: 01 Apr 2026
https://github.com/tnleite/loan-approval-prediction
Este repositório apresenta um modelo preditivo de aprovação de empréstimos, focado em minimizar o risco de inadimplência. Utilizando EDA e algoritmos de machine learning (Random Forest, XGBoost), ajustamos o threshold para maximizar o recall de inadimplentes, contribuindo para uma gestão de riscos eficiente.
classification-algorithm data-science exploratory-data-analysis machine-learning-algorithms machine-learning-models matplotlib numpy scikit-learn scipy seaborn xgboost-classifier
Last synced: 13 Apr 2026
https://github.com/erikglz/coap-mtd
Repository for an IoT security project implementing Moving Target Defense (MTD) through CoAP protocol randomization to mitigate spoofing attacks and enhance adaptive security.
coap-protocol cybersecurity iot machine-learning python scikit-learn spoofing
Last synced: 17 Apr 2026
https://github.com/vaishnavis03/finlatics_ml_program
This repository contains the .ipynb files for 3 datasets, along with a PPT for each. The datasets included are Facebook Marketplace Data, Sales Prediction Data, and Wine Quality data.
correlation data-analysis data-science data-visualization knn linear-regression machine-learning matplotlib numpy pandas random-forest-classifier scikit-learn
Last synced: 17 Apr 2026
https://github.com/danicc097/python-ml-app
Various [arguably useless] Machine Learning services with gRPC and OpenTelemetry for demo purposes
grpc-python opentelemetry scikit-learn
Last synced: 17 Apr 2026
https://github.com/sankoktas/bhi360-fall-detection
Fall detection system using Bosch BHI360 sensor data with time-series labeling, feature extraction, and machine learning (LOSO CV + Gradient Boosting).
accelerometer bhi360 bosch-sensors data-augmentation fall-detection feature-extraction gradient-boosting gyroscope human-activity-recognition label-studio loso-cross-validation machine-learning python scikit-learn sensor-data smote time-series
Last synced: 07 May 2026
https://github.com/iamwatchdogs/cardiovascular-risk-prediction
This mini-project uses machine learning algorithms to predict possible risks of heart disease by analyzing given data.
jupyter-notebook machine-learning-algorithms matplotlib numpy pandas python scikit-learn seaborn
Last synced: 02 Apr 2026
https://github.com/akshitvats026/heart_disease_prediction
An ML-based Heart Disease Prediction System that predicts the likelihood of heart disease based on user health parameters. Built using Python, Pandas, and Scikit-learn, the system performs data preprocessing, trains a predictive model, and provides real-time predictions with high accuracy.
accuracy-score logistic-regression machine-learning matplotlib-pyplot numpy pandas python scikit-learn
Last synced: 02 Apr 2026
https://github.com/satyas567/weatherdataanalysis
Comprehensive Weather Data Analysis with Python: Explore trends, visualize patterns, detect outliers, and predict temperature using humidity and wind speed
jupyter-notebook linear-regression matplotlib numpy pandas python scikit-learn seaborn
Last synced: 02 Apr 2026
https://github.com/emanuel-poblano/stock-market-predictor
An end-to-end Python stock price prediction project that pulls real market data, performs feature engineering, trains a machine learning model, and predicts the next-day closing price of a stock.
matplotlib pandas python scikit-learn yfinance
Last synced: 13 Apr 2026
https://github.com/srosalino/prediction_of_seoul_bikes_demand
The objective of this project is to predict the number of bicycles needed to be made available each hour in order to make the service as efficient as possible
cross-validation data-exploration-and-preprocessing hyperparameter-tuning machine-learning regularization-methods scikit-learn
Last synced: 13 May 2026
https://github.com/anu-gtb/campusplacement
Link to Streamlit app :
binary-classification campus-placements data-science data-visualization machine-learning matplotlib ml-algorithms predictive-modeling python3 scikit-learn seaborn statistics streamlit
Last synced: 06 May 2026
https://github.com/prashver/end-to-end-model-deployment-on-aws
Student Performance Analysis with Machine Learning analyzes factors impacting student outcomes using a robust machine learning pipeline. Achieving an impressive R2 score, it predicts student performance effectively. With extensive data preprocessing and deployment on AWS Elastic Beanstalk, it ensures scalability and high availability.
amazon-web-services aws-elastic-beanstalk end-to-end-deployment flask machine-learning-algorithms matplotlib numpy pandas scikit-learn seaborn
Last synced: 02 Apr 2026
https://github.com/soroush-04/apple-stock-prediction
CNN-LSTM model for stock price prediction
convolutional-neural-networks keras machine-learning python scikit-learn tensorflow
Last synced: 02 Apr 2026
https://github.com/felixamaladhas/amazon-reviews-sentiment-analysis
This is a sentiment analysis project that classifies Amazon product reviews as positive or negative using machine learning techniques.
matplotlib numpy pandas python scikit-learn
Last synced: 02 Apr 2026
https://github.com/mayankyadav23/shipment-pricing-prediction
Shipment Pricing Prediction 📦🔍 is a machine learning project that forecasts shipment prices based on various supply chain factors. Using advanced regression models, it provides valuable insights 📊 to optimize pricing strategies in the supply chain analytics domain.
data-visulization flask ineuron-ai machine-learning python scikit-learn shipment-and-pricing
Last synced: 02 Apr 2026
https://github.com/otuemre/obesity-classification
Machine learning project to classify obesity levels based on health metrics like age, sex, height, weight, and BMI.
classification data-science healthcare machine-learning obesity-classification scikit-learn
Last synced: 17 Apr 2026
https://github.com/nikhilgugwad/sentiment-analysis
Sentiment analysis for the Kannada language to classify Kannada sentences into different emotions.
Last synced: 17 Apr 2026
https://github.com/ngangawairimu/linear-regression-
This project builds a linear regression model in Python to predict outcomes and derive insights from feature data. It covers data cleaning, feature analysis, and model evaluation, showcasing predictive modeling techniques using scikit-learn, pandas, and visualization libraries.
data-analysis linear-regression machine-learning predictive-modeling python scikit-learn
Last synced: 17 Apr 2026
https://github.com/rohansardar/speechflowguard
A machine learning web API that detects toxic language in user comments using classical ML
docker logistic-regression machine-learning python3 scikit-learn tf-idf tfidf-text-analysis tfidf-vectorizer
Last synced: 17 Apr 2026
https://github.com/mangesh-balkawade/pythonautomationsscripts
This is the repository which contains the python automations scripts and machine learning case studies , and Python Projects that I have write to learn automations and ML using python.
automation data-science machine-learning-algorithms matplotlib mongodb pandas python3 scikit-learn seaborn webscraping
Last synced: 13 Apr 2026
https://github.com/mnitin-reddy/content-based-recommendation-system-using-deep-learning
A content-based movie recommendation system using deep learning to predict user ratings by leveraging user and movie features. The system integrates neural networks for feature extraction, utility scripts for data processing, and supports both new and existing user recommendations.
deep-learning keras neural-networks numpy pandas python scikit-learn tensorflow
Last synced: 03 Apr 2026
https://github.com/sabin74/boston_house_prediction
This project aims to predict the median value of owner-occupied homes in Boston suburbs using various machine learning regression models. Multiple regression techniques were applied, including Linear Regression, Decision Tree, Random Forest, Gradient Boosting and dimensionality reduction with PCA. Hyperparameter tuning was performed.
boston-housing-price-prediction hyperparameter-tuning kaggle-dataset pca-analysis python3 regression-models scikit-learn
Last synced: 06 May 2026
https://github.com/deshwalx/diabetes-prediction-svm
My first ML project using SVM to predict diabetes
beginner-project classification diabetes machine-learning python scikit-learn svm svm-classifier
Last synced: 06 May 2026
https://github.com/belzebu013/prever_nivel_colesterol
Projeto de IA com algoritmo de Regressão Linear múltipla para prever o nível de colesterol de um individuo.
ia jupiter-notebook pandas python regressao-linear-multipla scikit-learn
Last synced: 17 Apr 2026
https://github.com/msikorski93/seed-clustering
Performing basic clustering on a seeds dataset.
agglomerative clustering dbscan gaussian-mixture-model gmm mini-batch-kmeans scikit-learn seeds
Last synced: 13 May 2026
https://github.com/mryutaro/spla3clip
spla3clip: キル・デスした時刻を自動で解析するスプラトゥーン3用ツール
fastapi python react scikit-learn typescript
Last synced: 04 Apr 2026
https://github.com/rickyarians/ai-ml-nlp
Directory Machine Learning, Deep Learning, Artificial Int, Natural Language Processing Project
deep-learning machine-learning modeling python scikit-learn tensorflow
Last synced: 04 Apr 2026
https://github.com/snikumbh/archr
archR: Identifying promoter sequence architectures de novo using NMF
archr discovery nmf non-negative-matrix-factorization promoter-sequence-architectures r r-package scikit-learn sequence-architectures unsupervised-machine-learning
Last synced: 18 Apr 2026
https://github.com/mnj-tothetop/english-handwritten-characters-recognizer
A handwritten english character recognizer [0-9, A-Z, a-z] made by using a Dataset of 3409 images. Tensorflow, Keras, Scikit-learn, and OpenCV was used to implement the Convolution Neural Network (CNN). Matplotlib and Seaborn were used to visualize the data.
artificial-intelligence convolutional-neural-networks keras matplotlib opencv-python scikit-learn seaborn tensorflow
Last synced: 18 Apr 2026
https://github.com/bjpcjp/scikit-learn
Updates in progress. Jupyter workbooks will be added as time allows.
Last synced: 18 Apr 2026
https://github.com/justsecret123/nba-players-stats-analysis
A quick interactive Notebook to visualize some NBA players stats (points, assists, steals, blocks...) and totals, rankings and comparisons. Feel free to add any player in the .csv data files. 🏀
csv ipython-notebook ipywidgets jupyter-notebook jupyterlab matplotlib pandas python scikit-learn seaborn
Last synced: 18 Apr 2026
https://github.com/gattsu001/telecom-churn-predictor
Predicts which telecom customers are likely to churn with 95% accuracy using engineered features from usage, billing, and support data. Implements Sturges-based binning, one-hot encoding, stratified 80/20 train-test split, and a two-level ensemble pipeline with soft voting. Achieves 94.60% accuracy, 0.8968 AUC, 0.8675 precision, 0.7423 recall.
churn-prediction classification classification-algorithm customer-retention data-science data-visualization feature-engineering joblib jupyter-notebook machine-learning pandas scikit-learn supervised-learning svm
Last synced: 18 Apr 2026
https://github.com/zvdy/movie_recommendation
Movie Recommendation Search Engine using Jupyter Notebooks, Pandas, Nnmpy, SciKit Learn, IPyWidgets
data-science jupyter-notebook machine-learning numpy pandas python scikit-learn
Last synced: 13 Apr 2026
https://github.com/tanim-mishkat/data-science-prediction-model-pds-course-
Diabetes Progression Prediction Using Regression Analysis: This project uses regression analysis in Python to predict diabetes progression based on medical and physiological data. Includes data preprocessing, model training, evaluation, and visualizations.
data-science machine-learning python regression scikit-learn
Last synced: 19 Apr 2026
https://github.com/pedroteixeiraw/variational_quantum_circuit_binary_classification
This project focuses on developing a Variational Quantum Circuit capable of performing Binary Classification between two classes: red wine and white wine, based on their characteristics using machine learning.
binary-classification cost-function json machine-learning matplotlib numpy pandas qiskit qiskit-machine-learning quantum-machine-learning scikit-learn training-data variational-circuit
Last synced: 04 Apr 2026
https://github.com/abdul-rafay19/california-housing-price-prediction
This project predicts California housing prices using machine learning regression models, including Random Forests and Decision Trees. It covers data preprocessing, exploratory analysis, model training, and hyperparameter tuning to optimize performance.
decision-trees gridsearchcv linear-regression matplotlib numpy pandas python random-forest randomsearch-cv scikit-learn scipy seaborn
Last synced: 04 Apr 2026
https://github.com/adhadse/hands-on-machine-learning-book-notes-and-practice
This repo holds the Jupyter notebooks and datasets containing notes/comments on things I learned from this book. Feel free to use and learned from them.
data-science deep-learning jupyter-notebooks keras machine-learning python scikit-learn tensorflow
Last synced: 04 Apr 2026
https://github.com/kaladabrio2020/livro-ml-with-pytorch-and-sk
Progresso em cada capitulo
jupyter-notebook matplotlib-pyplot nump pandas python3 pytorch scikit-learn
Last synced: 04 Apr 2026
https://github.com/yashsonaar/machine-learning-tasks
This repository has machine learning tasks which include classification, recommendation system, fraud detection system
classification jupyter-notebook machine-learning numpy pandas prediction python scikit-learn testing
Last synced: 04 Apr 2026
https://github.com/chengetanaim/high-school-alcoholism-and-academic-performance
Student Alcoholism and Academic Performance Data Analysis
Last synced: 18 Apr 2026
https://github.com/eugen-goebel/predictive-analytics-agent
Automated ML pipeline — data profiling, preprocessing, model training, and evaluation report generation
automation data-science docker machine-learning predictive-analytics python scikit-learn streamlit
Last synced: 05 Apr 2026
https://github.com/akhundmuzzammil/energyconsumptionprediction
This repository contains code and resources for training a linear regression model to predict energy consumption based on various building parameters.
data-analysis energy-consumption linear-regression machine-learning python scikit-learn streamlit visualization
Last synced: 18 Apr 2026
https://github.com/alezoon/movie-revenue-prediction
Sk-learn practice using Linear Regression, ML workflow practice.
jupyter machine-learning matplotlib-pyplot numpy pandas python scikit-learn
Last synced: 05 Apr 2026
https://github.com/ksasi/dog-breed-classifier
Dog Breed Classifier
cnn cnn-classification computer-vision deep-learning deep-neural-networks keras keras-neural-networks machine-learning numpy pandas python scikit-learn
Last synced: 05 Apr 2026
https://github.com/robson-python/academic-performance
Project to evaluate students' academic performance.
csv-import data data-analysis data-science jupyter-notebook machine-learning matplotlib pandas python scikit-learn seaborn vscode
Last synced: 12 Apr 2026
https://github.com/ricardorobledo/next_level_data_science
matplotlib numpy pandas python3 scikit-learn
Last synced: 05 Apr 2026
https://github.com/barbarpotato/applied-data-science-with-python-specialization
This skills-based specialization is intended for learners who have a basic python or programming background, and want to apply statistical, machine learning, information visualization, text analysis, and social network.
data-science matplotlib pandas scikit-learn
Last synced: 06 May 2026
https://github.com/taqsblaze/hush
Hush: A lightweight, context-aware text toxicity classifier. Leveraging NLP and Random Forest ensemble learning to detect and mitigate harmful language in real-time. Built for efficiency, safety, and cleaner digital communication.
content-moderation machine-learning nlp random-forest safety-tools scikit-learn text-classification toxicity-detection
Last synced: 05 Apr 2026
https://github.com/deliprofesor/game-search-volume-prediction-machine-learning-models-and-forecasting
This repository uses machine learning models like Random Forest, XGBoost, LightGBM, and time-series forecasting with Prophet to predict game search volumes. Additionally, Grid Search is applied for hyperparameter tuning of the LightGBM model.
data-cleaning data-science data-visualization feature-selection forecasting-models game-search grid-search hyperparameter-tuning lightgbm machine-learning pandas prophet python random-forest scikit-learn time-series-analysis time-series-forecasting xgboost
Last synced: 18 Apr 2026
https://github.com/busradeveci/student-performance-prediction
A machine learning project to predict student exam performance based on academic, social, and personal features. Built with Python and scikit-learn.
data-analysis kaggle linear-regression machine-learning predictive-modeling python scikit-learn student-performance
Last synced: 25 Apr 2025
https://github.com/manalisbhavsar/mall-customers-clustering
K-Means clustering to mall customer data, segmenting customers based on their annual income and spending score. To identify patterns and group customers for targeted marketing.
data-analysis data-visualization matplotlib numpy pandas python scikit-learn
Last synced: 18 Apr 2026
https://github.com/adarshpheonix2810/resume-screening
This project is designed to classify resumes into different job categories based on their content. Users can upload resumes in PDF, DOCX, or TXT format, and the application will predict the category using a pre-trained machine learning model. It aims to enhance the job application process for candidates and recruiters alike.
data-science machine-learning natural-language-processing python resume-classification scikit-learn streamlit
Last synced: 13 Apr 2026
https://github.com/jeffandyalltogether/mlrecommendationsystem
project code for a recommendation system for Amazon using collaborative filtering, ranking, and matrix factorization to enhance customer satisfaction and product discovery.
eda matplotlib pandas python scikit-learn seaborn tensorflow
Last synced: 05 Apr 2026
https://github.com/barek2k2/ml_ruby
Ruby gem uses Machine Learning(ML) techniques to make predictions and classifications, and it's powered by Python3 under the hood.
artificial-intelligence data-science machine-learning pandas prediction python3 ruby ruby-on-rails scikit-learn
Last synced: 05 Apr 2026
https://github.com/emilyfelker/ieee_cis_fraud_detection
Which online transactions are fraudulent? Program that uses various machine learning algorithms to detect fraud.
decision-trees kaggle logistic-regression machine-learning neural-network pandas poetry pytest python scikit-learn sklearn tensorflow xgboost
Last synced: 05 Apr 2026
https://github.com/lexxai/goit_python_ds_hw_04
Модуль 4. Класифікація та оцінка роботи моделі. Лінійна регресія: перенавчання та регуляризація
lasso-regression linear-regression numpy pandas python red regression ridge-regression scikit-learn
Last synced: 05 Apr 2026
https://github.com/perpendicooler/elementary-research-for-steamboat-willie-s-store-in-poland
An elementary research for a company to opening store in a city using gurobi and pulp optimization.
christofides-algorithm gurobipy numpy pandas pulp python3 scikit-learn travelling-salesman-problem
Last synced: 05 Apr 2026
https://github.com/oenm176/hmeq-loan-analysis
Menggali wawasan dari dataset Home Equity (HMEQ). Proyek ini membangun model klasifikasi untuk mendeteksi kredit macet, yang menampilkan pra-pemrosesan data lengkap, normalisasi, dan visualisasi pohon menggunakan Python.
classification-model credit-risk-analysis data-mining data-science data-visualization decision-tree hmeq-dataset machine-learning python scikit-learn student-project
Last synced: 13 Apr 2026
https://github.com/josepablodmg/python--linear-regression-advertising
A linear regression analysis to predict sales based on advertising spending across TV, radio, and newspaper channels. The project includes exploratory data analysis, model training, coefficient visualization, and residual analysis.
advertising data-analysis exploratory-data-analysis linear-regression machine-learning python regression scikit-learn visualization
Last synced: 06 May 2026
https://github.com/kenatsf/basic_data_analysis
Basic data science project: ETL, forecast and data visualization.
analysis data data-analysis data-science logistic-regression matplotlib matplotlib-pyplot numpy pandas powerbi python scikit-learn time-series time-series-analysis time-series-forecasting
Last synced: 05 Apr 2026
https://github.com/manojpatra1991/machine-learning-engineer-nanodegree
Machine Learning Engineer Nanodegree Projects - My Submissions
adaboost csv-files decision-tree html jupyter-notebook linear-regression machine-learning machine-learning-algorithms machine-learning-nanodegree naive-bayes-classifier python3 scikit-learn support-vector-machine
Last synced: 18 Apr 2026
https://github.com/mbarbetti/mediastinal-lymphoma-classification
Machine-learning-based classification of bulky mediastinal lymphomas using radiomic features
diagnosis-prediction lymphoma-classification machine-learning personalized-treatment precision-medicine radiomics-analysis scikit-learn texture-analysis
Last synced: 18 Apr 2026
https://github.com/rafay-imraan/recommendation-system
A machine learning model that outputs personalized similar movie recommendations for people based on the ones they have rated positively.
machine-learning pandas python scikit-learn
Last synced: 06 May 2026
https://github.com/pratyush905/farecast-nyc-taxifare-predictor
Machine learning models to predict nyc taxi fare based on given dataset
jupiter-notebook kaggle machine-learning matplotlib numpy python regression-models scikit-learn
Last synced: 13 Apr 2026
https://github.com/avtorgenii/ml-playground
A repository for exploring and experimenting with datasets, building machine learning models, and testing various techniques in data preprocessing, feature engineering, and model evaluation.
matplotlib ml pandas scikit-learn
Last synced: 06 May 2026