scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-06-25 00:23:58 UTC
- JSON Representation
https://github.com/lorenzorottigni/ml-iris-svm
Machine Learning python bootcamp: Support Vector Machines on iris flower dataset
ipynb machine-learning numpy pandas python scikit-learn seaborn support-vector-machines
Last synced: 10 Apr 2026
https://github.com/jpcadena/hackathon-kin
Hackathon Kin: Customers churn (Finance and Risk).
algorithms ann churn-prediction classification data-science keras machine-learning python scikit-learn supervised-learning tensorflow
Last synced: 07 Apr 2026
https://github.com/kanika300393/loan_prediction
This project implements a Loan Prediction system using Support Vector Machine (SVM). It includes data preprocessing, visualization of features like income and education, and model evaluation. The goal is to predict loan approval based on the dataset. Clone the repo to explore the code and improve the model.
data-science machine-learning numpy pandas python scikit-learn svm-classifier
Last synced: 09 Apr 2026
https://github.com/oneapi-src/customer-churn-prediction
AI Starter Kit for customer churn prediction using Intel® Extension for Scikit-learn*
Last synced: 04 Apr 2025
https://github.com/akshaypatra/cardiovascular_disease_detection
AI-driven ECG classification model that detects cardiovascular abnormalities such as arrhythmia and atrial fibrillation using a hybrid CNN-LSTM deep learning approach.
keras matplotlib numpy pandas python3 scikit-learn seaborn tensorflow wfdb
Last synced: 14 Apr 2026
https://github.com/moritzkoerber/tune_preprocessing_algos
Files for this blogpost https://moritzkoerber.github.io/python/tutorial/2019/11/18/blogpost/
cross-validation hyperparameter-tuning machine-learning python scikit-learn
Last synced: 30 Apr 2026
https://github.com/tinaland101/credit-risk-classification
The purpose of this project is to build a credit risk classification model using machine learning techniques. This model helps identify the creditworthiness of borrowers based on historical lending data. Specifically, it uses a logistic regression model to predict whether a loan is healthy (0) or high-risk (1).
numpy pandas pathlib scikit-learn
Last synced: 30 Apr 2026
https://github.com/samuelpillai/machine-learning-classification-regression-nlp
A curated collection of machine learning mini-projects covering classification, regression, and natural language processing (NLP). This project demonstrates model training, evaluation, feature engineering, and pipeline integration using real-world datasets and Python tools like Scikit-learn, pandas, and NLTK.
classification data-analysis data-science data-visualization feature-engineering jupyter-notebook machine-learning ml-pipeline model-evaluation nlp python regression-models scikit-learn supervised-learning text-mining
Last synced: 30 Apr 2026
https://github.com/abhivur/connections-ai
Contributors: Meet Gamdha, Gaurav Nimmagadda
bert python scikit-learn word2vec
Last synced: 30 Apr 2026
https://github.com/mukeshthenraj/video-engagement-prediction
ML project to predict video engagement using Random Forest & XGBoost
gridsearchcv matplotlib numpy pandas random-forest-classifier roc-auc-curve scikit-learn video-analytics xgboost
Last synced: 30 Apr 2026
https://github.com/fbarffmann/credit-risk-classification
Classified 19,000+ loans as high-risk or healthy using logistic regression. Achieved 100% precision for healthy loans and 84% precision for high-risk loans.
classification credit-risk data-analysis logistic-regression machine-learning model-evaluation pandas python scikit-learn
Last synced: 30 Apr 2026
https://github.com/rajukoushik/machine-learning
machine-learning
machine-learning python scikit-learn
Last synced: 01 May 2026
https://github.com/fadlani-aditya/iris-plant-classification
This project focuses on classifying different species of Iris flowers using the Random Forest algorithm. The dataset, sourced from Scikit-learn, contains four key features: sepal length, sepal width, petal length, and petal width, which are used to predict the flower species (Setosa, Versicolor, and Virginica).
agriculture data-science iris-dataset machine-learning python scikit-learn supervised-learning
Last synced: 01 May 2026
https://github.com/arittra-bag/disease-predictor
Disease Predictor!
disease-prediction gradio logistic-regression machine-learning scikit-learn
Last synced: 01 May 2026
https://github.com/pavi2410/trafficsignrecognition
ML classification model using SVM (90% accuracy)
ai artificial-intelligence classification jupyter-notebook machine-learning ml pandas python scikit-learn support-vector-machines svm
Last synced: 01 May 2026
https://github.com/afondiel/intro-to-artificial-intelligence-free-course-le-wagon
Intro to Artificial Intelligence Free Course from @LeWagon
ai ai-courses artificial-intelligence data-science lewagon machine-learning numpy pandas prophet-facebook regression-models scikit-learn sklearn
Last synced: 01 May 2026
https://github.com/arturovaine/n8n-nodes-sklearn
Custom n8n nodes for integrating scikit-learn machine learning algorithms into your n8n workflows.
machine-learning n8n n8n-nodes scikit-learn sklearn
Last synced: 08 Jun 2026
https://github.com/vansh-khaneja/spam-email-detection
This is a spam email detection model
machine-learning naive-bayes-classifier scikit-learn spam-detection
Last synced: 01 May 2026
https://github.com/clinton-mwachia/machine-learning-with-python
machine learning with python
machine-learning python regression scikit-learn
Last synced: 01 May 2026
https://github.com/danishzulfiqar/language-detection-nlp-model
This machine learning model is designed to accurately detect and classify text in 18 languages using NLP
fastapi jupyter-notebook machine-learning natural-language-processing scikit-learn
Last synced: 01 May 2026
https://github.com/anastasiaschmidt1/sqli-detection-ml
UNI-PROJEKT: Erkennung von SQL-Injection-Angriffen durch maschinelles Lernen (SVM-Modell)
bht-berlin machine-learning scikit-learn sqli svm
Last synced: 02 May 2026
https://github.com/maxwelllzh/linearizer
Linearizing parameters for linear regression
data-analysis machine-learning scikit-learn
Last synced: 02 May 2026
https://github.com/tetsumichiumada/customer_segments
Identify customers by clustering them
machine-learning python scikit-learn unsupervised-learning
Last synced: 02 May 2026
https://github.com/luizassimoes/sklearn-kaggle-titanic
This repository was created to store all the code for tackling the Titanic challenge on Kaggle.
kaggle machine-learning scikit-learn
Last synced: 02 May 2026
https://github.com/vutsalsinghal/churn-prediction
Predict if a customer will stay or not
data-science jupyter-notebook machine-learning pandas python scikit-learn
Last synced: 02 May 2026
https://github.com/bishopce16/cryptocurrencies
An analysis on cryptocurrencies dataset using unsupervised machine learning, PCA algorithm, and K-means clustering.
hvplot jupyter-notebook pandas plotly python scikit-learn unsupervised-machine-learning visual-studio-code
Last synced: 02 May 2026
https://github.com/gperdrizet/recommender-system-streamlit
Streamlit deployment of K-nearest neighbors movie recommendation system to Render.
ci-cd data-science github-actions github-workflows k-nearest-neighbors machine-learning natural-language-processing pytest python recommender-system render scikit-learn tf-idf
Last synced: 03 May 2026
https://github.com/jofaval/red-wine-quality
Data Analysis of the Red Portuguese's Wine's Quality in 2009
classification data-analysis data-science data-visualization google-colab kaggle logistic-regression machine-learning python scikit-learn wine-quality xgboost
Last synced: 03 May 2026
https://github.com/moritzkoerber/data_science_posts
This repository hosts the code for my data science related blog posts.
hyperparameter-tuning machine-learning pipeline python scikit-learn
Last synced: 03 May 2026
https://github.com/insane-group/scikit-learn-template
Generic template to bootstrap your scikit-learn project
Last synced: 03 May 2026
https://github.com/viniciusds2020/ml_pycaret_classificacao
Sistema de preprocessamento e treinamento de modelos de machine learning utilizando PyCaret. Uma metodologia low-code para processos de MLops
machine-learning mlops preprocessing pycaret python scikit-learn
Last synced: 03 May 2026
https://github.com/fandredev/ml-my-guide
my own annotations about ML/DS using pandas, matplotlib, numpy, scikit learn
anaconda matplotlib numpy pandas plotly scikit-learn seaborn
Last synced: 03 May 2026
https://github.com/bahailime/breast-cancer-ml
🦠 Breast cancer survival prediction (notebook + streamlit)
breast-cancer-prediction csv gaussian-processes ipywidgets jupyter-notebook lightgbm linear-discriminant-analysis machine-learning ml python random-forest scikit-learn shap streamlit svc
Last synced: 03 May 2026
https://github.com/zhenglinlei/zdmp
Industry 4.0 Optimization with Machine Learning AI
industry-4 knn-classification machine-learning pandas python scikit-learn
Last synced: 03 May 2026
https://github.com/srisaihariharan/mic_sentiment_analysis_v
Sentiment analysis of IMDb movie reviews using Python, Scikit-learn, and TF-IDF.
machine-learning natural-language-processing nlp python scikit-learn sentiment-analysis sentiment-classification
Last synced: 03 May 2026
https://github.com/furk4nbulut/uygulamalarla-makine-ogrenmesi-ve-derin-ogrenme-atolyesi
Bu repository, Manisa'da gerçekleştirilen BTK Akademi Uygulamalı Makine Öğrenmesi ve Derin Öğrenme Atölyesi'ne ait eğitim sürecini kapsamaktadır. Atölyede katılımcılar, ileri düzey makine öğrenmesi ve derin öğrenme teknikleriyle ilgili teorik ve pratik bilgiler edinmektedir.
matplotlib numpy pandas scikit-learn seaborn
Last synced: 03 May 2026
https://github.com/kaustavmodak/business-aided-customer-feedback-assessment-system
A Streamlit-based sentiment analysis app that classifies customer reviews into Positive, Neutral, or Negative using a pre-trained ML mode
framework machine-learning matplotlib nlp nltk numpy pandas pickle regex scikit-learn seaborn sentiment-analysis streamlt tfidf-vectorizer
Last synced: 03 May 2026
https://github.com/jonad/finding_donors
Predicting income with UCI Census Income Dataset using supervised machine learning algorithms
numpy pandas scikit-learn scikitlearn-machine-learning
Last synced: 03 May 2026
https://github.com/arnavk-09/phishing-detection
🎣 Detect Phishing URLs with Data Pre-fitted... API & Web UI
csv data fastapi flask python scikit-learn
Last synced: 03 May 2026
https://github.com/srilaasya/breast-cancer-classifier
Used several Python libraries to make a K-Nearest Neighbor classifier that is trained to predict whether a patient has breast cancer
knearest-neighbor-classifier python scikit-learn
Last synced: 03 May 2026
https://github.com/martn2023/housing-prices-ml-supervised-learning
My first end-to-end ML/supervised learning project: housing price estimation
data-science data-visualization decision-tree end-to-end-machine-learning linear-regression machine-learning matplotlib pandas predictive-analytics random-forest scikit-learn supervised-learning
Last synced: 03 May 2026
https://github.com/stefan-500/ml-network-traffic-classification-unsw-nb15
Network traffic classification using Machine Learning
classification machine-learning network-traffic-classification python scikit-learn support-vector-machines svc-model unsw-nb15-dataset
Last synced: 03 May 2026
https://github.com/ceodaniyal/telecom_customer_churn_prediction
A machine learning project that predicts whether a telecom customer will churn (leave the service) using customer demographics, account information, and service usage. The repository includes data preprocessing, model training (with logistic regression), feature scaling, and example predictions.
classification customer-churn-prediction data-science logistic-regression machine-learning ml-project pandas prediction python scikit-learn streamlit telecom
Last synced: 04 May 2026
https://github.com/abdullahalzubaer/feature-selection-ranking
In-depth analysis regarding feature selection and ranking.
feature-ranking feature-selection random scikit-learn
Last synced: 04 May 2026
https://github.com/danielwohlr/delivery_time_series
Time series forecasting of food delivery service data
forecasting-time-series python scikit-learn
Last synced: 04 May 2026
https://github.com/abhivur/graduate-income-forecaster
Contributors: Abdussalam Raheem, Chiara Su, and Joseph Botros
matplotlib numpy pandas python scikit-learn seaborn
Last synced: 04 May 2026
https://github.com/marionchaff/real-estate-price-prediction-france
Real estate price prediction using French public database DVF
data-analysis dvf-data machine-learning price-prediction python real-estate scikit-learn
Last synced: 04 May 2026
https://github.com/mariiasam/stroke-prediction
A model for predicting the risk of stroke in a patient
balanced-random-forest-classifier decission-tree-classifier gradient-boosting imbalanced-learning joblib logistic-regression matplotlib numpy random-forest-classifier scikit-learn seaborn streamlit
Last synced: 04 May 2026
https://github.com/joel-beck/airbnb-oslo
Price Prediction Models for Airbnb Apartments in Oslo | Winter Term 2021/22
prediction python pytorch scikit-learn
Last synced: 04 May 2026
https://github.com/keven-rdr/rio-airbnb-predictor
Estudo de IA, utilizando modelos de previsão como o regressor para determinar valor de imóvel
airbnb ia kaggle php price regression-models scikit-learn
Last synced: 04 May 2026
https://github.com/dakii24/credit-card-fraud-detection
This repository contains a machine learning project focused on detecting fraudulent credit card transactions. The project includes data preprocessing, model training, and evaluation to identify and prevent fraudulent activities.
capstone-project class-imbalance classification-algorithm credit-card credit-card-fraud data-science decision-trees fraud machine-learning open-data python scikit-learn svm svm-classifier
Last synced: 04 May 2026
https://github.com/themusharraf/ml-app
machine-learning ml pandas scikit-learn
Last synced: 04 May 2026
https://github.com/drod75/nyc-arrests-analysis
This is a simple Data Science Project made to analyze and display data and trends found within the NYC Arrests Year to Date Dataset.
data-analysis data-visualization folium jupyter-notebook matplotlib-pyplot nyc-opendata nypd python scikit-learn seaborn
Last synced: 04 May 2026
https://github.com/chathumiamarasinghe/nn-training-model
A comprehensive project for training neural networks to solve real-world problems. This repository includes customizable code for building, training, and evaluating neural network architectures using popular deep learning frameworks.
jupyter-notebook matplotlib numpy phyton scikit-learn
Last synced: 04 May 2026
https://github.com/aqueeqazam/machine-learning-using-scikit
This repository contains all of the algorithms used to train the machine learning models using the Scikit library.
Last synced: 04 May 2026
https://github.com/pierrealexandre78/deathpredict
Predict Hospital mortality rate using Machine Learning for patients admitted in ICU (Intensive Care Unit)
healthcare hospital machine-learning predictions python random-forest-classifier scikit-learn xgboost-classifier
Last synced: 05 May 2026
https://github.com/jcm-ai/personal-data-science-projects
This page contains all of my personal data science projects. 📊📈📉👨💻
data-analysis data-visualization exploratory-data-analysis jupyter-notebooks machine-learning-algorithms matplotlib-pyplot numpy-library pandas-python personal-project predictive-modeling programming python3 scikit-learn scipy seaborn statistical-analysis
Last synced: 05 May 2026
https://github.com/simpl1fy/spam-classifier-project
A web application to classify spam texts or emails.
multinomial-naive-bayes nltk python render scikit-learn text-classification
Last synced: 05 May 2026
https://github.com/s-matke/eco-forecast
Machine learning model used for predicting European country with most green surplus energy generated
data-science green-energy machine-learning scikit-learn supervised-learning
Last synced: 05 May 2026
https://github.com/anu-gtb/sleep_disorder_pred
Link to DagsHub repository :
dagshub data-science dvc-pipeline hyperparameter-tuning machine-learning mlflow mlflow-ui mlops predictive-modeling python3 random-forest-classifier scikit-learn sleep-disorders
Last synced: 05 May 2026
https://github.com/celineboutinon/bookworms
OpenClassrooms Data Analyst 2022-2023 - Projet 6
apriori-algorithm data-analysis data-analytics data-visualisation dataframes matplotlib-pyplot mlxtend numpy pandas python scikit-learn scikit-posthocs scikitlearn seaborn statsmodels
Last synced: 05 May 2026
https://github.com/marconicivitavecchia/stazione-monitoraggio-ambientale
Codice in MicroPython per ESP32 per il corso tenuto dalla nostra scuola rivolto ai docenti sulla creazione di una stazione di monitoraggio ambientale che copre gli argomenti di Python, IoT ed Intelligenza Artificiale.
ai esp32 micropython micropython-esp32 python school-project scikit-learn
Last synced: 05 May 2026
https://github.com/hitthecodelabs/petalanalyticsstreamlit
Web application developed with Streamlit that predicts the Iris flower type based on its physical features
matplotlib model numpy pickle python scikit-learn sklearn streamlit
Last synced: 05 May 2026
https://github.com/monish-nallagondalla/universal-bank
Credit Card Ownership Prediction A machine learning project that predicts credit card ownership using features like age and income, balancing class distributions for improved accuracy.
classification-models credit-card-prediction data-analysis data-classification decision-tree-classifier imbalanced-datasets machine-learning model-evaluation python scikit-learn
Last synced: 05 May 2026
https://github.com/smaddanki/pattern-pursuit-challenge
A personal challenge to build a production-ready trading signal system for S&P 500 stocks using deep learning. This project progresses from basic ML models to a complete trading infrastructure, focusing on 5-day forward return prediction and signal generation.
deep-learning machine-learning pytorch quantative-trading quantitative-finance quantitative-research scikit-learn
Last synced: 05 May 2026
https://github.com/akash-47-tank/personalized-e-commerce-review-summarizer
Personalized E-commerce Product Review Summarizer: A Streamlit app that summarizes product reviews (e.g., from a CSV) using T5-small and tailors summaries to user preferences (price, durability, etc.) with NLP and lightweight ML.
data-analysis e-commerce machine-learning nlp personalization portfolio python scikit-learn sentiment-analysis streamlit t5 transformers web-app
Last synced: 05 May 2026
https://github.com/teja-1403/coursera-machine-learning-with-python-honors
This project involves building a classifier to predict rainfall for the next day based on weather data from the Australian Government's Bureau of Meteorology. Various machine learning techniques such as Linear Regression, KNN, Decision Trees, Logistic Regression, and SVM were implemented and evaluated.
classification hierarchical-clustering machine-learning regression scikit-learn scipy
Last synced: 05 May 2026
https://github.com/zuhairzia/customer-segmentation
📖 About Customer Segmentation using KMeans clustering to analyze demographics, income, and spending. Helps businesses with targeted marketing and customer insights.
joblib matplotlib numpy pandas scikit-learn seaborn
Last synced: 05 May 2026
https://github.com/rohra-mehak/sciencesync
System for Personalized Google Scholar Alerts Processing and Data Management, and provision of ML based clustering analysis
agglomerative-clustering clustering crossref-api customtkinter google-api google-scholar graph-api machine-learning numpy pandas python3 scientific-article-analysis scikit-learn sqlite3
Last synced: 05 May 2026
https://github.com/rohansardar/iris_flower
A basic ML project on the iris flower classification
data-science iris-classification iris-dataset ml python scikit-learn
Last synced: 05 May 2026
https://github.com/gbourniq/cnn-multiclass-classification-gear
Using Machine Learning and Deep Learning to predict the category of outdoor equipment
image-classification keras-tensorflow multiclass-classification python scikit-learn tensorboard-visualizations
Last synced: 05 May 2026
https://github.com/aryar-06/linear-regression
A Python project demonstrating basic linear regression with gradient descent and matrix operations, alongside scikit-learn comparison.
data-analysis data-preprocessing educational-project gradient-descent linear-regression machine-learning python regression-algorithms scikit-learn
Last synced: 05 May 2026
https://github.com/antoniskl/un-general-debate-corpus-classification
The aim of this project is to classify UNGDC speeches with regards to climate change. As a secondary objective, a correlation is being examined between these speeches, the forestation and the happiness index of the countries.
classification data-science jupyter-notebook machine-learning nlp python regression scikit-learn text-classification text-preprocessing
Last synced: 05 May 2026
https://github.com/debangshu9183/spam-mail-detection
To detect spam mail
logistic-regression machine-learning numpy pandas scikit-learn seaborn
Last synced: 05 May 2026
https://github.com/vanilladucky/housing-prediction
This is a data analytics and machine learning project that I undertook using a housing dataset on Kaggle in order to put my machine learning knowledge to practice and some practical application.
data-science machine-learning python scikit-learn
Last synced: 05 May 2026
https://github.com/zenitsu272/fault-detection-ml
Machine Learning based Fault Detection in machines using sensor data
artificial-intelligence decsion-tree machine-learning pandas pandas-dataframe pandas-python scikit-learn
Last synced: 05 May 2026
https://github.com/pjj11005/ml_with_pytorch_study
[머신 러닝 교과서: 파이토치 편] -> 학습한 코드 저장소
deep-learning graph-neural-networks machine-learning neural-networks pytorch scikit-learn transformer
Last synced: 06 May 2026
https://github.com/grandechowhiskey/fcc-machine_learning-boilerplates
A collection of projects completed as part of the FreeCodeCamp "Machine Learning with Python" certification. These projects focus on implementing machine learning models, data preprocessing, and predictive analysis using libraries like scikit-learn and TensorFlow.
ai ml python3 scikit-learn tensorflow
Last synced: 06 May 2026
https://github.com/fahrettinsolak/ai-based-salary-scale-calculation-project
This project demonstrates a Polynomial Regression model using a dataset related to experience and salary. The model is built using Python with the pandas, matplotlib, and sklearn libraries. The dataset includes information on years of experience and corresponding salary.
artificial-intelligence deep-learning jupyter-notebook machine-learning matplotlib pandas pyhton scikit-learn
Last synced: 05 May 2026
https://github.com/sadmansakib93/mental-resilience-analysis-using-machine-learning
Utilized supervised and unsupervised ML techniques to analyze mental health and resilience levels of medical students [Project completed on December, 2019]
artificial-intelligence classification clustering correlation linear-regression machine-learning machine-learning-algorithms mental-health python regression resilience scikit-learn statistical-analysis
Last synced: 06 May 2026
https://github.com/rishisolanke/twitter-sentiment-analysis-using-machine-learning-
A research project that classifies tweets as positive, negative, or neutral using ML algorithms (Logistic Regression, Naïve Bayes, SVM) with NLP preprocessing.
data-science data-visualization logistic-regression machine-learning ml-models naive-bayes natural-language-processing nlp scikit-learn sentiment-analysis svm text-classification twitter-data
Last synced: 06 May 2026
https://github.com/samia35-2973/living-type-classification-from-codon-usage
Machine learning project to classify living types based on codon usage data using Random Forest and XGBoost classifiers.
classification codon-usage data-cleaning data-preprocessing excel exploratory-data-analysis living-type machine-learning python random-forest-classifier scikit-learn supervised-learning xgboost-classifier
Last synced: 06 May 2026
https://github.com/eshansugeesh/fico-score-loan-default-modeling-project
Credit risk assessment using FICO score segmentation, loan default modeling, discretization techniques, and log-likelihood evaluation for predictive analytics in financial services.
bucketing classification credit-risk customer-segmentation data-science discretization fico-score financial-analytics loan-analysis loan-default log-likelihood machine-learning numpy pandas predictive-modeling risk-modeling scikit-learn segmentation statistical-modelling
Last synced: 06 May 2026
https://github.com/radoslawregula/binary-classification-metrics
A model implementing a solution to the binary classification problem along with several accuracy metrics.
binary-classification classification jupyter-notebook machine-learning matplotlib pandas python scikit-learn stochastic-gradient-descent
Last synced: 06 May 2026
https://github.com/billgewrgoulas/recommendation-systems
Algorithms for joke rating prediction using the joke data-set from Kaggle.
algorithm clustering collaborative-filtering machine-learning numpy pandas recommender-system scikit-learn scypi
Last synced: 06 May 2026
https://github.com/erick957/saleprice-prediction-dataset-analysis-and-cleaning-advance-regression
🏠 Predict house prices using advanced regression techniques with this comprehensive analysis and cleaning project, from data loading to model deployment.
data-analysis data-science eda google-colab machine-learning numpy pandas python scikit-learn scikit-learn-python
Last synced: 06 May 2026
https://github.com/anu-gtb/campusplacement
Link to Streamlit app :
binary-classification campus-placements data-science data-visualization machine-learning matplotlib ml-algorithms predictive-modeling python3 scikit-learn seaborn statistics streamlit
Last synced: 06 May 2026
https://github.com/5hraddha/optimize-oil-well-locations
In the quest for harnessing valuable energy resources, the OilyGiant mining company wants to expand its operations by discovering new oil well locations. To achieve this, a data-driven approach is adopted, leveraging geological exploration data from three distinct regions and employing techniques in data analysis and modeling.
linear-regression numpy pandas scikit-learn supervised-learning
Last synced: 06 May 2026
https://github.com/sabin74/boston_house_prediction
This project aims to predict the median value of owner-occupied homes in Boston suburbs using various machine learning regression models. Multiple regression techniques were applied, including Linear Regression, Decision Tree, Random Forest, Gradient Boosting and dimensionality reduction with PCA. Hyperparameter tuning was performed.
boston-housing-price-prediction hyperparameter-tuning kaggle-dataset pca-analysis python3 regression-models scikit-learn
Last synced: 06 May 2026
https://github.com/msikorski93/predicting-prices-on-king-county-housing-dataset
Predicting house prices using different regression analysis models.
catboost eda gradient-boosting king-county lightgbm linear-regression machine-learning neural-network polynomial-regression real-estate regression-models scikit-learn tensorflow xgboost
Last synced: 06 May 2026
https://github.com/deshwalx/diabetes-prediction-svm
My first ML project using SVM to predict diabetes
beginner-project classification diabetes machine-learning python scikit-learn svm svm-classifier
Last synced: 06 May 2026
https://github.com/ejw-data/ml-playground
Testing the limitations, inabilities, and strengths of models with synthetic data
machine-learning python scikit-learn
Last synced: 06 May 2026
https://github.com/pradeep-r04/spam-email-classification
Spam Email Classification Using NLP and Machine Learning involves building a system to identify and categorize emails as either spam or non-spam (ham). This process typically uses Natural Language Processing (NLP) techniques to analyze and preprocess text data and machine learning algorithms to train a model for classification.
artificial-intelligence machine-learning naive-bayes-classifier nlp pkl python scikit-learn streamlit
Last synced: 06 May 2026
https://github.com/cycle-sync-ai/student-score-analysis
A data-driven student performance analysis project using UCI dataset (396 students, 33 features). Implements machine learning models (K-means, PCA, Decision Tree, Random Forest, Linear Regression) to analyze academic patterns and predict student scores based on lifestyle, health, and study habits.
clustering clustering-algorithm decision-trees feature-engineering learning-management-system linear-regression machine-learning machine-learning-algorithms matplotlib numpy pandas pca pickle prediction prediction-algorithm scikit-learn score seaborn student
Last synced: 06 May 2026
https://github.com/lintangwisesa/pdb_mti_ui_lab1_k6
Tugas Lab 1 Pengelolaan Data Besar MTI UI 2023
machine-learning python3 scikit-learn
Last synced: 06 May 2026
https://github.com/barbarpotato/applied-data-science-with-python-specialization
This skills-based specialization is intended for learners who have a basic python or programming background, and want to apply statistical, machine learning, information visualization, text analysis, and social network.
data-science matplotlib pandas scikit-learn
Last synced: 06 May 2026
https://github.com/kartheekdama/salary-prediction
This salary prediction model leverages machine learning techniques, including Random Forest, Decision Tree, and Linear Regression, to estimate salaries based on individual attributes such as age, gender, education level, job title, and years of experience. The Random Forest model outperforms the others, achieving the highest R-squared score.
decision-tree exploratory-data-analysis feature-importance linear-regression machine-learning random-forest scikit-learn
Last synced: 06 May 2026
https://github.com/kumar-o/customer-churn-prediction
🔍 Predict customer churn using a synthetic dataset with advanced models and metrics to enhance business retention strategies and decision-making.
churn-prediction customer-churn-prediction customer-survival-analysis data-science factorization-machines flask-application gridsearchcv libsvm machine-learning model-evaluation numpy pipelines roc-auc scikit-learn seaborn smote survival-analysis xgboost4j
Last synced: 06 May 2026
https://github.com/josepablodmg/python--linear-regression-advertising
A linear regression analysis to predict sales based on advertising spending across TV, radio, and newspaper channels. The project includes exploratory data analysis, model training, coefficient visualization, and residual analysis.
advertising data-analysis exploratory-data-analysis linear-regression machine-learning python regression scikit-learn visualization
Last synced: 06 May 2026