scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-07-02 00:27:34 UTC
- JSON Representation
https://github.com/icepanorama/internship-visualizations-and-demonstrations
A collection of some of the programs that I've written over the course of my internship.
artificial-intelligence machine-learning matplotlib numpy pandas python3 pytorch scikit-learn
Last synced: 14 Apr 2026
https://github.com/mahdi-meyghani/movie-recommendation-system
A Python-based movie recommendation system utilizing popularity-based, content-based, and collaborative filtering models with data science and machine learning techniques.
data-analysis data-science machine-learning recommendation-system scikit-learn scikitlearn-machine-learning
Last synced: 23 Jan 2026
https://github.com/djdhairya/rooftop-solar-detection
data-processing data-science deep-learning eda machine-learning pandas scikit-learn tif
Last synced: 27 Jan 2026
https://github.com/juzershakir/student-intervention
Predicting the likelihood that a given student will pass or not!
adaboost classification f1-score gridsearchcv logistic-regression machine-learning python2 random-forest scikit-learn student-intervention supervised-learning
Last synced: 22 Oct 2025
https://github.com/shubhamsoni98/prediction-with-binomial-logistic-regression
To predict client subscription to term deposits and optimize marketing strategies by identifying potential subscribers.
binomial data data-science eda machine-learning matplotlib pipeline python scikit-learn seaborn sklearn sql visualization
Last synced: 06 Feb 2026
https://github.com/trilokida/named-entity-recognition-and-classification
Given a string statement, the aim is to identify the B-Protein entity in the statement.
bagging-ensemble classification dictvectorizer ensemble-learning machine-learning naive-bayes-classifier named-entity-recognition ner random-forest scikit-learn
Last synced: 17 May 2026
https://github.com/haseeeb21/machine-learning-models
Machine Learning Models trained on Scikit-learn datasets. This repository contains the code files and saved models trained on Toy datasets (Classification & Regression), and Real World dataset.
anaconda classification classification-models jupyter-notebook knn knn-classification machine-learning machine-learning-algorithms python3 regression regression-models scikit-learn scikit-learn-python scikitlearn-machine-learning svm svm-classifier vscode
Last synced: 07 May 2026
https://github.com/messierandromeda/sentiment-analysis
Sentiment analysis with the IMDB movie review dataset.
imdb-dataset python scikit-learn sentiment-analysis
Last synced: 28 Jan 2026
https://github.com/bilgenurbekar/turkishcyberbullying
Contains fine-tuned BERT models and results in the text classification category using Turkish social media data
bert-fine-tuning huggingface-transformers matplotlib numpy pandas python pytorch scikit-learn transformers
Last synced: 07 Mar 2026
https://github.com/alexliap/sk_serve
Deployment of a Scikit-Learn model and it's column transformations made easy.
machine-learning mlops model-deployment scikit-learn
Last synced: 24 Oct 2025
https://github.com/jofaval/sonar
Binary Classification of Sonar Signals of Rocks and Metal cylinders in 1987
data-analysis data-science data-visualization machine-learning python scikit-learn sonar uci
Last synced: 09 Apr 2026
https://github.com/santiagoenriquega/ez-animate
A Python package for creating Matplotlib animations with minimal code. Built to quickly visualize model behavior.
animation machine-learning matplotlib python scikit-learn
Last synced: 15 Mar 2026
https://github.com/luliatuccu/weather_analysis
This project highlights a combination of data science techniques and Python programming to explore real-world weather data.
data-preprocessing eda feature-engineering machine-learning matplotlib numpy pandas regex scikit-learn seab seaborn weather weather-patterns
Last synced: 02 Apr 2026
https://github.com/smahala02/svm-machine-learning
This repository provides an in-depth tutorial and practical implementation of Support Vector Machines (SVM) for classification tasks, using Python and popular data science libraries.
classification data-science machine-learning python scikit-learn svm
Last synced: 30 Jan 2026
https://github.com/naufal-yafi/knn-regression.model
K-Nearest Neighbors
data-science knn-regression machine-learning scikit-learn
Last synced: 17 May 2026
https://github.com/anmolian/distracted_driver_classification
Image Classification
computer-vision deep-learning machine-learning scikit-learn tensorflow
Last synced: 02 May 2026
https://github.com/pradeep31747/smartsuggest-personalized_product_recommendations
This project implements a personalized product recommendation system using machine learning techniques to enhance user experience and drive engagement.
jupyter-notebook keras numpy pandas pyhton scikit-learn sql tensorflow vscode
Last synced: 28 Jan 2026
https://github.com/adriantomin/bulldozer-price-prediction
Predicting the Sale Price of Bulldozers Using Machine Learning ππ° This project uses machine learning to predict bulldozer sale prices based on historical data from the Kaggle Bluebook for Bulldozers competition. The goal is to minimize the RMSLE between actual and predicted prices.
data-science jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn
Last synced: 23 Jan 2026
https://github.com/mohammad95labbaf/churn-prediction
This project aims to predict customer churn using machine learning algorithms. The project includes data preprocessing, feature engineering, and model evaluation.
adaboost bagging churn churn-analysis churn-prediction decisiontree ensemble-learning knn randomforest scikit-learn sklearn svm voting
Last synced: 23 Jan 2026
https://github.com/elprofesoriqo/chrome-extension-gmail-spam-filter
Chrome extension that automatically identifies and moves emails marked as spam to the spam folder in Gmail.
api-client chrome-extension firebase-database javascript machine-learning python scikit-learn
Last synced: 09 Apr 2026
https://github.com/sonaligill/olympics-analysis
The outcome of this project is an interactive streamlit web application that visualizes the analysis of Olympic data while rendering different aspects of Olympic history, compare country performances, and gain insights into athlete demographics.
numpy plotly python scikit-learn scipy streamlit
Last synced: 28 Jan 2026
https://github.com/rahul-120/crop_recom
This project is a Machine Learning based Crop Recommendation System built using Flask. It helps farmers or users decide the most suitable crop to grow based on soil nutrients and environmental conditions.
crop-recommendation-system flask flask-application machine-learning python3 scikit-learn
Last synced: 02 May 2026
https://github.com/ficaan/ml-dl-projects
A collection of Machine Learning and Deep Learning projects implemented with frameworks including PyTorch, TensorFlow and scikit-learn.
deep-learning deep-learning-projects machine-learning machine-learning-projects pytorch scikit-learn tensorflow
Last synced: 27 Oct 2025
https://github.com/phonhay103/goodreads-300k-dataset
kaggle linear-regression pandas scikit-learn
Last synced: 08 May 2026
https://github.com/asut00/Machine-Learning-Program_42AI
Comprehensive Machine Learning path by 42AI: hands-on modules on regression, gradient descent, and real-world ML applications.
linear-regression machine-learning matplotlib numpy pandas python scikit-learn
Last synced: 27 Oct 2025
https://github.com/raulmaulidhino-dev/ml_modelling_regression
There are many factors that influence the grades/scores of students. One of the factors is study hours. In this mini analysis project, there are 3 models that will learn and predict the relation between study hours of students and their scores in an exam/test. This project will result the best ML model to solve the problem.
data data-analysis-python data-science eda machine-learning scikit-learn
Last synced: 28 Jan 2026
https://github.com/nordszamora/ds-ml-projects
My repository for Data Science & Machine Learning projects.
data-analysis data-science data-visualization jupyter-notebook kaggle machine-learning matplotlib numpy pandas python scikit-learn seaborn
Last synced: 15 Apr 2026
https://github.com/anuranjanjain/video-upscaler
A WebAPP designed for upscalling video to HD Resolution using custom Denoise filter and OpenCV
artificial-intelligence opencv python scikit-learn tailwindcss
Last synced: 24 Jan 2026
https://github.com/sahraiidle/email-spam-detector
Email/SMS spam detector with a Flask UI/API, tuned ML models (TFβIDF + SVM/LogReg/NB), and a ready-to-run web form plus JSON endpoint for predictions.
data machine-learning numpy pandas python randomforest scikit-learn spam-classifier spam-detection svm
Last synced: 24 Jan 2026
https://github.com/dilkushsingh/iris-species-classifier
Used Random Forest Classifier Algorithm
huggingface-spaces numpy pandas python random-forest-classifier scikit-learn streamlit
Last synced: 15 Apr 2026
https://github.com/snigdho8869/numerical-data-analysis-projects
Exploring numerical data analysis with credit card churn, fraud detection, health predictions and more.
adaboost cnn data-analysis deep-learning dnn ensemble-learning exploratory-data-analysis gradient-boosting-classifier keras logistic-regression machine-learning ml numeric numerical-analysis pandas python3 random-forest scikit-learn support-vector-machines tensorflow
Last synced: 15 Apr 2026
https://github.com/djdhairya/parkinson-s-disease-detection
datapreprocessing machine-learning modeling numpy pandas scikit-learn svm
Last synced: 15 Apr 2026
https://github.com/pyzit/recommandation-engine-in-drf-sk-learn
Full Stack Movie Recommendation System Project made in Django REST Framework and React JS
api django django-rest-framework movies reactjs recommender-system scikit-learn
Last synced: 28 Jan 2026
https://github.com/glennx1/heartdrive
ML-powered heart disease predictor using Streamlit, featuring data preprocessing, visualization, and user input interface.
matplotlib pandas python scikit-learn seaborn streamlit
Last synced: 29 Apr 2026
https://github.com/engineertolulope/us_states_living_ranking_analysis
Python script for analyzing and ranking U.S. states based on factors like cost of living, tax burden, diversity, crime rates, and climate. Uses weighted criteria to identify the best states to live in according to these metrics. Ideal for decision-making on relocation.
data-analysis data-science linear-regression machine-learning python scikit-learn
Last synced: 29 Jan 2026
https://github.com/allwin107/loan-prediction-web-app
A Flask-based loan prediction web app using a Random Forest model to predict loan approval based on user input. It includes a clean, responsive UI, form validation, and real-time prediction display.
classification data-processing deployment flask loan-prediction machine-learning python random-forest-classifier scikit-learn web-application
Last synced: 15 Apr 2026
https://github.com/samjoesilvano/airline_ticket_fare_prediction
Airline Fare Prediction using Machine Learning focuses on developing a Random Forest model to predict flight prices, achieving an RΒ² score of 0.804. The project includes hyperparameter tuning using RandomizedSearchCV, alongside extensive data preprocessing and feature engineering to ensure robust model performance.
airline-fare-prediction data-preprocessing data-visualization feature-engineering feature-selection hyperparameter-tuning machine-learning pandas python random-forest randomizedsearchcv regression-analysis scikit-learn
Last synced: 15 Apr 2026
https://github.com/shahaba83/airplane-ticket-cancellation
In this project, we try to predict the possibility of canceling the plane ticket by the buyer
datatime numpy pandas python scikit-learn seaborn
Last synced: 25 Feb 2026
https://github.com/asherk7/house-price-prediction
House Prices - Advanced Regression Techniques - Predict sales prices and practice feature engineering, RFs, and gradient boosting
data-science numpy pandas regression scikit-learn
Last synced: 15 Apr 2026
https://github.com/chengetanaim/beatrecommendersystembackend
A system for music producers and rappers/singers. I was trying to implement the product recommendation feature for music uploaded by producers. I used the collaborative filtering algorithm to be able to recommend songs to users.
fastapi scikit-learn sqlalchemy unsupervised-learning
Last synced: 06 Feb 2026
https://github.com/beatrizandradeds/sistema-recomendacao-filmes
π¬ Sistema de RecomendaΓ§Γ£o de Filmes usando ML | VetorizaΓ§Γ£o de texto, cosine similarity e NLP com Python
content-based-filtering cosine-similarity data-science data-science-projects machine-learning natural-language-processing nlp portfolio python recommendation-system scikit-learn
Last synced: 29 Apr 2026
https://github.com/lau1944/coronavirus-world-prediction
Coronavirus Case Confirmed Trend Around The World
coronavirus pandas python scikit-learn
Last synced: 15 Apr 2026
https://github.com/ramy-badr-ahmed/higgs-dataset-training
Training Higgs Dataset with Keras - https://doi.org/10.5281/zenodo.13133945
binary-classification cuda-toolkit cupy dask dask-dataframes higgs-boson keras keras-tensorflow matplotlib matplotlib-python numpy pandas pandas-dataframe scikit-learn uci-dataset uci-machine-learning
Last synced: 30 Jan 2026
https://github.com/jaypanchal9/fraud-detection-case-study
A comprehensive case study applying machine learning techniques to detect fraudulent transactions effectively.
machine-learning matplotlib numpy pandas python3 scikit-learn seaborn xgboost
Last synced: 15 Apr 2026
https://github.com/linggarm/poultry-meat-freshness-classification-with-transfer-learning-using-resnet
The utilization of the ResNet architecture and Transfer Learning methods to classify Poultry Meat Freshness images into two categories, namely Fresh and Rotten.
artificial-intelligence binary-classification cnn colab-notebooks computer-vision deep-learning fine-tuning imagenet machine-learning meat-classification numpy pandas poultry python residual-networks resnet scikit-learn supervised-learning tensorflow transfer-learning
Last synced: 15 Apr 2026
https://github.com/bangaji313/recommender-system-movielens
Proyek Sistem Rekomendasi Film dengan Content-Based & Collaborative Filtering. Submission untuk modul Machine Learning Terapan di Coding Camp 2025.
collaborative-filtering content-based-filtering data-science deep-learning dicoding jupyter-notebook keras movie-recommendation movielens pandas python recommender-system scikit-learn tensorflow
Last synced: 15 Apr 2026
https://github.com/nathan-lindstedt/student_risk
Student Success Model (SSM)
fairlearn machine-learning scikit-learn shap student-risk xgboost
Last synced: 30 Jan 2026
https://github.com/itssahilwhat/ai-fundamentals
A curated collection of fundamental AI concepts, algorithms, and code implementations β including Machine Learning, Deep Learning, and Computer Vision β built from scratch and with practical examples.
computer-vision deep-learning machine-learning numpy pandas python pytorch scikit-learn
Last synced: 15 Apr 2026
https://github.com/tamk-kol/project_orbital_data_analysis
The goal of this project is to develop an automatic method to detect orbital maneuvers using machine learning.
matplotlib numpy pandas scikit-learn
Last synced: 30 Jan 2026
https://github.com/umasivakumar14/f1imagerecognition
Predicts the F1 driver given in an image of their face.
crossvalidation flask gridsearchcv haar-cascade image-recognition machine-learning python scikit-learn seaborn wavelet-transform
Last synced: 25 Feb 2026
https://github.com/diiblo/la-poste-predictive-flux
PrΓ©diction journaliΓ¨re du flux de colis dans les centres de tri de La Poste. Pipeline complet : gΓ©nΓ©ration de donnΓ©es, modΓ©lisation LightGBM, orchestration via Airflow (Docker), stockage PostgreSQL et dashboard interactif Streamlit. Projet rΓ©alisΓ© en MastΓ¨re 2 Data Engineering Γ lβECE Paris.
airflow docker postgresql scikit-learn streamlit
Last synced: 31 Jan 2026
https://github.com/gunjangyl/iris-detection
The Iris Detection Project classifies different species of Iris flowers using machine learning techniques. It analyzes four key featuresβsepal length, sepal width, petal length, and petal widthβto predict one of three classes: Setosa, Versicolor, or Virginica. The project uses algorithms like KNN, Decision Trees, or SVM for classification. Model pe
knn-classification matplotlib python scikit-learn seaborn
Last synced: 15 Apr 2026
https://github.com/arraypd/airflow-etl-elt
airflow duckdb elt etl matplotlib metabase ml numpy pandas postgresql python scikit-learn
Last synced: 15 Apr 2026
https://github.com/manu-karenite/medical-insurance-cost-predictor
Medical Insurance Cost Generator is a Linear Regression based Predictor which is used to estimate and predict the Cost a person has to pay while Buying a Medical Insurance.
kaggle-dataset linear-regression machine-learning matplotlib numpy pandas python3 reactjs scikit-learn
Last synced: 15 Apr 2026
https://github.com/emv271828/diabetes_cdc_uci_machine_learning
Segunda avaliaΓ§Γ£o para a disciplina de InteligΓͺncia Artificial da Universidade Federal Fluminense.
jupyter-notebook machine-learning pandas python scikit-learn
Last synced: 15 Apr 2026
https://github.com/jofaval/titanic-disaster
Data Analysis of the famous Titanic Disaster in 1912 with Machine Learning
classification data-analysis data-science data-visualization google-colab kaggle machine-learning python scikit-learn
Last synced: 15 Apr 2026
https://github.com/sarmad426/ai
AI basic to advanced featuring Machine Learning, Deep Learning and Data Science.
ai data-science deep-learning hugging-face machine-learning numpy pandas python scikit-learn
Last synced: 15 Apr 2026
https://github.com/christiansandovalgarcia01-creator/megaline-plan-classifier
Modelo de clasificaciΓ³n para recomendar plan Smart vs Ultra (Megaline). Split 60/20/20, RandomForest ganador, accuracy TEST β₯ 0.75. Incluye matriz de confusiΓ³n y classification report. Stack: Python, Pandas, scikit-learn, Jupyter.
classification data-science jupyter-notebook machine-learning python random-forest scikit-learn telecom
Last synced: 15 Apr 2026
https://github.com/samiyaalizaidi/nn-ml-homeworks
Homework solutions for CPE-4903: Neural Networks & Machine Learning at Kennesaw State University.
machine-learning machine-learning-workflow neural-networks numpy scikit-learn
Last synced: 15 Apr 2026
https://github.com/as1467/canada-per-capita-income-prediction
This project is a simple machine learning exercise to predict Canada's per capita income based on historical data. The dataset used in this project was sourced from the CodeBasics GitHub repository and is used here to practice linear regression as part of my machine learning learning process.
machine-learning matplotlib-pyplot pandas python scikit-learn
Last synced: 15 Apr 2026
https://github.com/moustafamohamed01/breast-cancer-prediction
A machine learning model built with PyTorch to predict if a tumor is malignant or benign using the Breast Cancer Dataset. The model uses a neural network to classify the data and shows how to train, evaluate, and visualize results.
ai data-science deep-learning machine-learning neural-network python pytorch scikit-learn
Last synced: 15 Apr 2026
https://github.com/rooshmica/concrete-compressive-strength-prediction
Concrete Dataset for Regression Model training
linear-regression machine-learning polynomial-regression python rmse-score scikit-learn
Last synced: 15 Apr 2026
https://github.com/nikitalpopov/evotor_champ
solution for evotor data challenge
data-analysis data-science python scikit-learn
Last synced: 15 Apr 2026
https://github.com/idaraabasiudoh/telco-churn-logistic-regression
A predictive model using logistic regression to identify customers likely to churn from a telecommunications company.
logistic-regression machine-learning python3 scikit-learn
Last synced: 01 Feb 2026
https://github.com/nits2612/data-science-projects
Portfolio of data science projects completed by me during PGP AI/ML, self learning, and hobby purposes.
data data-science dataanalysis deep deep-learning keras machine-learning matplotlib numpy opencv pandas python scikit-learn seaborn surprise-python tensorflow transfer-learning
Last synced: 01 Feb 2026
https://github.com/tanzim-prog/sentiment_analysis_ml_stringdata
The motive of this project is to find out the customer satisfaction of some residential hotels of Dhaka.
accuracy-score classification confusion-matrix f1-score logistic-regression machine-learning model-evaluation modelbuilding nlp nltk numpy pandas precision-score recall-score scikit-learn sentiment-analysis
Last synced: 01 Feb 2026
https://github.com/udacity-machinelearning-internship/more-spam-classifying
Implementing more spam classifying using Ensemble Methods in python
classification jupyter jupyter-notebook machine-learning pandas python scikit-learn sklearn
Last synced: 15 Apr 2026
https://github.com/legendarym4x/data_science
Data Science Course
jupyter-notebook keras matplotlib nltk numpy pandas scikit-learn spacy tensorflow
Last synced: 02 Feb 2026
https://github.com/marcusjhang/nvda-timeseriesanalysis
NVDA stock price predictor (WIP)
keras lstm pandas scikit-learn time-series-analysis
Last synced: 15 Apr 2026
https://github.com/khanovico/python-stock-analyzer
This is a Webapp implemented by python and several data science frameworks, enabling online stock trend analyzing.
amcharts-js-charts data-analysis data-visualization flask javascript pandas python scikit-learn
Last synced: 02 Feb 2026
https://github.com/sarowarahmed/advertising-sales-app
π Advertising Sales Predictor: A web app powered by a Machine Learning model, built with Numpy, Pandas, Scikit-learn, and Streamlit, to forecast sales based on TV, Newspaper, and Online Advertising. Deployed on Streamlit Cloud for real-time, easy-to-use predictions.
advertising app machine-learning multiple-linear-regression numpy pandas sales scikit-learn streamlit
Last synced: 07 Feb 2026
https://github.com/vladimiracunadev-create/python-data-science-program
Python Data Science Program β 197 clases en 9 partes. Pauta avanzada derivada de GΓ©ron, VanderPlas, Huyen, ISLP y Barocas/Hardt/Narayanan. Recurso personal de aprendizaje, enseΓ±anza y mejora continua.
bootcamp data-analysis data-science education jupyter machine-learning matplotlib numpy pandas python scikit-learn
Last synced: 01 Jun 2026
https://github.com/sharkb8t/credit-risk-classification
Demonstrates my abilities to use Jupyter Notebook with scikit-learn to train and evaluate a machine learning model.
jupyter-notebook numpy pandas pathlib python scikit-learn
Last synced: 15 Apr 2026
https://github.com/subratamondal1/quora-automated-text-classification
Text Classification with Bag of Words - Natural Language Processing
bag-of-words machine-learning matplotlib natural-language-processing nlp nltk numpy pandas python3 pytorch scikit-learn
Last synced: 15 Apr 2026
https://github.com/aerojam95/math70076-data-science-cw2
This repository presents the second coursework for the MATH70076 Data Science module at Imperial College London, where the project showcases different machine and deep learning models for image classification
data-science deep-learning machine-learning python3 pytorch scikit-learn
Last synced: 15 Apr 2026
https://github.com/danicaalana/wine-dataset-decision-tree
This project is developed as part of Digital Skill Fair (DSF) 35.0 - Data Science by Dibimbing. I am using Wine Recognition Dataset from scikit-learn, which is the results of a chemical analysis of wines grown in the same region in Italy by three different cultivators.
data data-analysis-python data-science decision-tree-classification machine-learning python scikit-learn wine-dataset
Last synced: 18 Apr 2026
https://github.com/tungct/tngtclassifer
bayes-classifier python2 scikit-learn
Last synced: 15 Apr 2026
https://github.com/max00358/sign_language_detection
A sign language detector that recognizes ASL(American Sign Language) alphabet
Last synced: 09 Feb 2026
https://github.com/sarowarahmed/predicting-kolkata-house-price
π Predicting Kolkata House Price: A web app powered by a Machine Learning model, built with Numpy, Pandas, Scikit-learn, and Streamlit, to predict house prices in Kolkata. Deployed on Streamlit Cloud for easy access and real-time predictions.
app kolkata linear-regression machine-learning numpy pandas scikit-learn streamlit
Last synced: 26 Feb 2026
https://github.com/codealphaprojects/spotify-music-recommender
CodeAlpha Machine Learning Task-1
jupyter-notebook machine-learning matplotlib numpy pandas pycharm-ide python scikit-learn spotify spotify-api streamlit vscode
Last synced: 09 Feb 2026
https://github.com/brossend/automl_bank_project
Automated ML pipeline for the UCI Bank Marketing dataset: ETL, Optuna-based AutoML, model evaluation, MLflow logging, pytest tests, Docker, and CI/CD.
automl bank-marketing binary-classification ci-cd classification data-science docker docker-compose etl github-actions gitlab-ci machine-learning ml-pipeline mlflow model-monitoring optuna pytest python scikit-learn uci-dataset
Last synced: 02 Jun 2026
https://github.com/sachinh123/cognitive-customer-insights-with-watson-ai
This project analyzes customer data to provide insights for personalized services, behavior prediction, and improved support.
flask ibm-cloud ibm-watson-assistant ibm-watson-nlu nltk python scikit-learn
Last synced: 10 Feb 2026
https://github.com/djdhairya/medicine-recommendation-system
csv deep-learning f1-score machine-learning pickle random-forest scikit-learn svc svm
Last synced: 10 Feb 2026
https://github.com/0eix/ibm-ds-spacex-falcon9
IBM Professional data science certificate Final Project Notebooks
data-science data-visualization exploratory-data-analysis ibm poetry scikit-learn shap
Last synced: 11 Feb 2026
https://github.com/nurulashraf/predictive-maintenance-analysis-for-machine-failure-prevention
Predictive maintenance analysis for machine failure prevention using sensor data and ML. Built a Random Forest model and Gradio dashboard to identify high-risk machines for proactive maintenance.
data-science failure-prediction gradio industrial-iot machine-learning power-bi predictive-maintenance python scikit-learn
Last synced: 16 Apr 2026
https://github.com/sabin74/fake_news_detection
This project implements a Fake News Detection system using Python, Natural Language Processing (NLP), and machine learning. It classifies news articles as Real or Fake based on their textual content.
fake-news-detection kaggle-dataset multinomial-naive-bayes passive-aggressive-classifier python3 regex scikit-learn
Last synced: 16 Apr 2026
https://github.com/arseniistratiuk/decision-tree-classifier-lab
A student lab project focused on tree algorithms and a Decision Tree classifier.
bellman-ford decision-tree-classifier floyd-warshall graph-algorithms kruskal-algorithm machine-learning networkx prim-algorithm scikit-learn
Last synced: 27 Feb 2026
https://github.com/sanjiv856/machine_learning_scikit-learn
Repository for machine learning in Python using Scikit-learn.
pipelines python scikit-learn sklearn titanic-kaggle titanic-survival-prediction
Last synced: 27 Feb 2026
https://github.com/codedby-mozz/habits_vs_academic_performance
This repository contains a Jupyter Notebook that explores the relationship between student lifestyle habits and academic performance. It demonstrates the process of data loading, exploratory data analysis (EDA), correlation analysis, and the development of a predictive model using linear regression to predict exam scores based on daily habits.
linear-regression python scikit-learn
Last synced: 16 Apr 2026
https://github.com/cego669/dirtycategoriesencoding
Repository containing two classes (StringAgglomerativeEncoder and StringDistanceEncoder) useful for grouping or visualizing the distance between dirty categorical variables. They are compatible with the scikit-learn API.
category clustering dimensionality-reduction dirty hierarchical-clustering machine-learning scikit-learn singular-value-decomposition svd
Last synced: 11 Feb 2026
https://github.com/c2ramel/autonomous-semantic-discovery
An unsupervised machine learning engine that utilizes Non-negative Matrix Factorization (NMF) to autonomously extract and visualize latent semantic topics from the 20 Newsgroups dataset.
data-visualization machine-learning nlp nmf python scikit-learn unsupervised-learning
Last synced: 16 Apr 2026
https://github.com/mindkerchief/baselineml
A collection of machine learning task performed during my studies in computer science major in intelligent system.
decision-tree dummy gaussian-mixture-models kmeans-clustering linear-regression logistic-regression machine-learning matplotlib numpy pandas random-forest scikit-learn seaborn tensorflow
Last synced: 16 Apr 2026
https://github.com/selcia25/iris-dataset-classification
βThis repository contains a Python script for classifying the Iris dataset using the Random Forest algorithm.
data-processing iris-classification pandas random-forest-classifier scikit-learn
Last synced: 16 Apr 2026
https://github.com/prajakta1321/seariseiq
Hybrid-Forecasting-and-Trend-Detection-of-Sea-Level-Rise
analysis arima-forecasting colab-notebook exploratory-data-analysis linear-regression logistic-regression machine-learning matplotlib ml pandas python3 qsvm scikit-learn seaborn
Last synced: 16 Apr 2026
https://github.com/arshc0der/n.o.v.a-geospatial-ozone-predictor
An AI-powered geospatial intelligence dashboard for predicting atmospheric ozone levels using 27 years of NASA data. Features 3D climate mapping and live satellite tracking.
atmospheric-science climate-tech dashboard-ui data-visualization desktop-app geospatial-analysis gis machine-learning matplotlib ozone-prediction pandas python random-forest-regressor satellite-tracking scikit-learn tkinter windows-executable
Last synced: 01 Mar 2026
https://github.com/s0fft/airline-passenger-satisfaction
Airline-Customer-Model β Machine Learning Project on: Scikit-learn / Pandas / Matplotlib / Seaborn
jupyter-notebook mashine-learning matplotlib pandas python3 scikit-learn seaborn
Last synced: 12 Feb 2026
https://github.com/zsailer/skspline
A Scikit-learn interface on Scipy's spline.
Last synced: 16 Apr 2026