scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-06-25 00:23:58 UTC
- JSON Representation
https://github.com/zuhairzia/titanic-survival-project
This is a Titanic Survival Prediction Model developed using Python, Pandas, Scikit-learn, and Jupyter Notebook. The model predicts whether a passenger survived the Titanic disaster based on features such as age, gender, and passenger class.
csv-dataset flask jupyter-notebook matplotlib numpy pandas pandas-library python scikit-learn seaborn streamlit
Last synced: 11 Apr 2026
https://github.com/kishanlalchoudhary/te-sem-6
TE SEM 6 Assignments
cpp data-science dsa-cpp matplotlib nltk numpy pandas python salesforce scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/karimosman89/resume-screening
Screen resumes to identify the best candidates.Build a machine learning model that screens resumes and ranks candidates based on job descriptions.Streamline the hiring process for HR departments by automating candidate screening.
machine-learning-algorithms nlp-machine-learning nltk-python python scikit-learn spacy text-processing
Last synced: 29 Apr 2026
https://github.com/dane-meister/titanic-survival-prediction
This project applies machine learning to the Titanic dataset to predict whether a passenger survived.
classification data-cleaning data-science exploratory-data-analysis feature-engineering jupyter-notebook kaggle knn logistic-regression machine-learning pandas python scikit-learn svm titanic-dataset titanic-kaggle
Last synced: 11 Apr 2026
https://github.com/djdhairya/crop-recommendation
Crop Recommendation System is a powerful tool for enhancing agricultural decision-making. By leveraging data-driven insights, it empowers farmers to maximize yield and ensure sustainable practices.
adaboostclassifier bagging-classifier csv decision-trees gaussian html knn-classification logistic-regression machine-learning machine-learning-algorithms matplotlib model numpy pandas random-forest random-forest-classifier scikit-learn seaborn svc
Last synced: 11 Apr 2026
https://github.com/shreeparab1890/handwritten-digit-recognition
In this iPython Noetbook we are going to use the MNIST dataset for the implementation of a handwritten digit recognition app using LogisticRegression and SGDClassifier and compare the accuracy and other metrics.
handwritten-digit-recognition image-classification matplotlib mnist-dataset python scikit-learn sklearn
Last synced: 11 Apr 2026
https://github.com/msikorski93/breast-cancer-classifying
Identifying and assigning breast cancer diagnosis using machine learning methods, based on observations in WDBC dataset. All classifiers have been evaluated and performed well for this task.
breast-cancer classification k-nearest-neighbours keras logistic-regression naive-bayes neural-networks scikit-learn tensorflow
Last synced: 30 Apr 2026
https://github.com/kirtipratihar/python_libraries_for_ds
This repository serves as a comprehensive guide to Python programming for Data Science. It covers essential topics like data manipulation, data visualization, machine learning, and statistical analysis using popular libraries such as Pandas, NumPy, Matplotlib, Seaborn, and Scikit-Learn.
artificial-intelligence machine-learning numpy pandas python scikit-learn tensorflow
Last synced: 11 Apr 2026
https://github.com/akimuddinshaikh/machine-learning-project
A comparative study of regression models (Decision Tree, Random Forest, Ridge, Lasso, SVM) for predicting real estate prices in King County, NYC, and California using PCA & Pipeline techniques.
machine-learning pca-analysis python regression-models scikit-learn statsmodels
Last synced: 16 May 2026
https://github.com/eljandoubi/deploy-ml
Deploying a ML Model to Cloud Application Platform with FastAPI
ci-cd fastapi github-actions gunicorn pandas pytest render scikit-learn uvicorn
Last synced: 11 Apr 2026
https://github.com/das-amlan/delay-prediction-in-urban-mobility-networks
Predicting delays in Urban mobility netwrok using different ML algorithms.
delay-prediction gradient-boosting machine-learning python r scikit-learn
Last synced: 05 Apr 2026
https://github.com/mr-ndi/tibebai
Machine learning experiments on student performance prediction. Inspired by tibeb (wisdom) in Amharic, this project explores regression models to understand how study factors influence exam scores.
ai data-science education elevvo google-colab internship kaggle linear-regression machine-learning matplotlib pandas polynomial-regression prediction regression scikit-learn student-performance tibebai-wisdom
Last synced: 11 Apr 2026
https://github.com/timothyjan/intro-machine-learning-classifiers
We will use the scikit-learn library, which is a higher-level machine learning library that will work with NumPy data, and Pandas, a library that makes it easier to manipulate data. We will explore a variety of classification algorithms, and compare their performance on a “real-world” dataset, which will introduce its own set of challenges.
numpy pandas python scikit-learn
Last synced: 11 Apr 2026
https://github.com/vasu7052/spam-classifier
This is a Machine Learning Project to detect whether a given sentence maybe a spam or not using Python and Keras.
keras keras-neural-networks python3 scikit-learn spam-classification tensorflow
Last synced: 11 Apr 2026
https://github.com/jashandeep032kaur-dot/heart-disease-prediction
This is my heart disease prediction project.
css flask-application machine-learning-algorithms pyhton3 scikit-learn
Last synced: 17 May 2026
https://github.com/renato4333/learn-artificial-intelligence
bayesian-inference capsule-network causal-inference convolutional-neural-networks data-structures deep-learning deep-learning-algorithm knn lua matplotlib pandas probabilistic-programming python pytorch question-answering regression-algorithms scikit-learn torch
Last synced: 11 Apr 2026
https://github.com/brej-29/disaster-tweets-nlp-model-benchmarks
Benchmark NLP models on Kaggle “Disaster Tweets”: TF-IDF + Naive Bayes baseline, Keras deep nets (Dense/LSTM/GRU/BiRNN/Conv1D), and TensorFlow Hub Universal Sentence Encoder transfer learning—compared using accuracy, precision, recall, and F1.
bidirectional-rnn cnn conv1d deep-learning disaster-tweets gru kaggle keras lstm machine-learning naive-bayes nlp rnn scikit-learn tensorflow tensorflow-hub text-classification tfidf
Last synced: 11 Apr 2026
https://github.com/priteshramani/movie-recommender
A content-based movie recommendation system using Python, Pandas, and cosine similarity to suggest movies based on their features.
cosine-similarity pandas pickle python scikit-learn streamlit
Last synced: 11 Apr 2026
https://github.com/simranjeet97/spam-classification
Spam Classification Using Natural Language Processing (NLP), Scikit-Learn Library, and Bayesian Method.
data-science emails kaggle kaggle-dataset naive-bayes-classifier nlp-machine-learning nltk-python python scikit-learn spam-classification
Last synced: 11 Apr 2026
https://github.com/vamsi0333/ai-sentiment-cicd-project
End-to-end CI/CD deployment of an AI-powered Sentiment Analysis API using FastAPI, Docker, Kubernetes, Terraform, and GitHub Actions. Demonstrates complete MLOps + DevOps workflow.
ai cicd devops docker fastapi github-actions kubernetes mlops scikit-learn terraform
Last synced: 12 Apr 2026
https://github.com/abrarshahok/electric-vehicle-charging-station-energy-consumption-prediction
With the rapid adoption of electric vehicles, optimizing energy usage at charging stations has become crucial for improving operational efficiency and ensuring customer satisfaction. This tool leverages predictive modeling to forecast energy consumption for charging sessions based on various input features.
matplotlib numpy pandas plotly python3 scikit-learn xgboost
Last synced: 09 Jun 2026
https://github.com/mrktsm/spam-email-recognizer
Long Short-Term Memory (LSTM) network trained to classify emails as spam or non-spam. It processes email content to make accurate predictions and can be integrated into projects for efficient spam detection and email management.
data-preprocessing keras lstm-neural-network model-architecture nltk numpy pandas performance-evaluation scikit-learn spam-classification-model tenserflow training-the-model
Last synced: 09 Apr 2026
https://github.com/abdiasarsene/routerwise-api-predictive-analytics-for-shipments
🧭 RouterWise optimise la logistique d’œuvres d’art grâce à une pipeline MLOps automatisée, prédictive et monitorée, intégrée au backend de PrecisioArt.
bentoml docker fastapi jenkins mlflow prometheus scikit-learn
Last synced: 11 Apr 2026
https://github.com/subratamondal1/heart-attack-prediction
Heart Attack Prediction of patients based on the required data. Data Ingestion - Data Preparation - Exploratory Data Analysis (EDA) - Modelling - Evaluation.
data-analysis data-science data-visualization kaggle-dataset machine-learning matplotlib-pyplot numpy pandas python3 scikit-learn seaborn
Last synced: 09 Apr 2026
https://github.com/alpha597/music_classification_ml
A project which compares different machine learning algorithms' accuracy in music genre classification of a large dataset.
machine-learning pandas python scikit-learn tensorflow
Last synced: 11 Apr 2026
https://github.com/lmizner/grokking_data_science
Coding practice for basic data science interview questions in Python
data-science numpy pandas python scikit-learn
Last synced: 11 Apr 2026
https://github.com/jingjing-jin/purchase-behavior-analysis
Purchase Behavior Analysis for Targeted Customer Segmentation
clustering-algorithm data-mining machine-learning python scikit-learn
Last synced: 20 Jan 2026
https://github.com/christianconchari/bike-sharing-demand
Este repositorio contiene el trabajo práctico final de la materia Aprendizaje de Máquina II de la Especialización en Inteligencia Artificial (CEIA) de la Facultad de Ingeniería de la Universidad de Buenos Aires (FIUBA).
airflow docker fastapi machine-learning mlflow python scikit-learn
Last synced: 20 Jan 2026
https://github.com/jayadavv/dynamic-ml-model-selector
An interactive web application that allows users to upload their datasets and dynamically select, train, and evaluate various machine learning models. The app provides comprehensive performance metrics and visualizations, making it easy for users to analyze their data effectively.
decision-trees linear-regression logistic-regression matplotlib-pyplot plotly python random-forest scikit-learn streamlit
Last synced: 11 Apr 2026
https://github.com/arrhythmia-detection/authorprovidedfeaturescombineddtoptimized
Deploys an optimized Decision Tree for Arrhythmia classification using Chapman ECG dataset on Arduino UNO board
arduino-uno arrhythmia-classification atmega328p chapman-ecg decision-tree-classifier eloquent scikit-learn
Last synced: 17 May 2026
https://github.com/monish-nallagondalla/cement_strength_prediction
The Cement Strength Prediction project uses machine learning to predict the compressive strength of cement based on its components, such as Cement, Fly Ash, Water, Superplasticizer, Coarse Aggregate, Fine Aggregate, and Age. The goal is to forecast compressive strength (MPa) for optimized cement production and quality control.
cement-strength-prediction construction-industry data-analysis data-preprocessing data-science data-visualization feature-engineering machine-learning predictive-modeling python regression-analysis scikit-learn
Last synced: 11 May 2026
https://github.com/blaz-cerpnjak/student-dropout-prediction
Student dropout predictions based on grades and other info. Classification problem with MLPClassifier.
classification machine-learning mlpclassifier neural-networks poetry predicting-student-dropout python scikit-learn scikit-learn-pipelines
Last synced: 17 May 2026
https://github.com/priyanshul28/ml_regression_eda_waiterstip
An EDA and Machine Learning Regression exercise on the Waiter's Tip dataset demonstrating the use of Linear Regression, Neural Network Regressors, Decision Trees, Random Forests, Linear SVR, XGBoost, etc. The models are optimized using hyperparameter tuning through GridSearchCV.
eda machine-learning regression scikit-learn seaborn
Last synced: 17 May 2026
https://github.com/genaray/ml.shopanalytics
A minimalist Python & cloud ML project that trains on Amazon sales & review data to recommend optimal prices/discounts to boost ratings/sales and surface actionable visual insights. Powered end-to-end by AWS CloudFront, S3, ALB & Fargate and Svelte.
ai aws aws-alb aws-cloudfront aws-ecs aws-fargate aws-s3 cicd devops machine-learning python scikit-learn terraform
Last synced: 11 Apr 2026
https://github.com/gregoritsch3/ml_eda_clustering_aidassessment
An EDA and Machine Learning Clustering exercise on the Country Aid Assessment dataset demonstrating the use of PCA, KMeans and DBSCAN clustering, Elbow Methods, etc. The clustering algorithm successfully demarcates countries that are in most dire need of aid based on their GDPP and Child Mortality rate.
anova dbscan kmeans machine-learning matplotlib numpy pandas pca scikit-learn seaborn statistics
Last synced: 16 Apr 2026
https://github.com/kishanlalchoudhary/be-sem-7
BE SEM 7 Assignments
blockchain cpp design-analysis-algorithms machine-learning matplotlib numpy pandas scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/felinjob/ibm-applied-data-science-capstone
Este projeto, parte da especialização IBM Data Science Professional Certificate, prevê o sucesso do pouso do Falcon 9 da SpaceX. Usando dados da API da SpaceX e Web Scraping, o projeto inclui análise de dados e Machine Learning para gerar insights sobre os lançamentos.
data-analysis data-science data-visualization ibm jupyter-notebook machine-learning numpy pandas python scikit-learn seaborn sql
Last synced: 11 Apr 2026
https://github.com/dionixius7/titanic-disaster-ml-model
This project predicts the survival of passengers on the Titanic by using Kaggle Titanic Disaster Dataset. The dataset contains information related to passengers, such as age, gender, and class. Different machine learning algorithms have been applied for this predictive model to accomplish an accurate prediction that will define the survival chances
data-analysis data-science data-visualization eda knn-classifier machine-learning neural-network python scikit-learn svm tensorflow titanic-kaggle titanic-survival-prediction
Last synced: 07 Feb 2026
https://github.com/jonad/boston_housing_price
Predicting Boston Housing Prices.
boston-housing-dataset jupyter-notebook matplotlib numpy pandas python3 scikit-learn
Last synced: 11 Apr 2026
https://github.com/surajsanap/technohack_mlinternship
1) Wine Quality Analysis and Classification, 2)Movie Review Sentiment Analysis, 3)Diabetes Prediction Using Machine Learning
deep-learning machine-learning pandas python scikit-learn
Last synced: 08 May 2025
https://github.com/paulinhok14/csgo-datascience-project
📊 Analysis of CS:GO grenade usage patterns and their impact on match outcomes using data science and statistical methods.
matplotlib mlflow numpy python scikit-learn scipy seaborn
Last synced: 30 Dec 2025
https://github.com/sshbuilder/movie-recommendation-system
The primary goal of this project is to provide personalized movie recommendations to users based on their preferences and the characteristics of the movies. This is achieved through a multi-step process involving data preprocessing, text vectorization, and recommendation generation.
anaconda-environment data-science jupyter-notebook machine-learning movie-recommendation movies pandas python3 recommendation-system recommender-system scikit-learn scikitlearn-machine-learning
Last synced: 26 Feb 2025
https://github.com/mitchmedeiros/mlcompare
Quickly compare machine learning models across libraries and datasets.
huggingface-datasets kaggle machine-learning openml pytorch scikit-learn xgboost
Last synced: 02 Feb 2026
https://github.com/eljandoubi/genre_classification
Create an ML pipeline for Genre Classification using MLflow.
hydra machine-learning mlflow numpy pandas pandas-profiling pytest scikit-learn scipy wandb
Last synced: 11 Apr 2026
https://github.com/talapanenivarshithchowdary/asteroid-detection-ml
This project uses Machine Learning to detect and classify asteroids based on trajectory and size, aiding in Near-Earth Object detection and planetary defense.
classification data-science decision-trees jupyter-notebook knn logistic-regression machine-lea matplotlib numpy pandas pillow prediction python3 random-forest scikit-learn
Last synced: 11 Apr 2026
https://github.com/audy21/datacamp
Learning portfolio documenting my progress, while taking Data Analyst & Data Science certifications from DataCamp.
data-analysis data-science machine-learning matplotlib numpy pandas python scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/swarnabhaghosh/house-price-prediction-model
Built an end-to-end regression pipeline to predict house prices using Linear Regression with automated preprocessing (PowerTransform, StandardScaling) via Scikit-learn's Pipeline and ColumnTransformer.
column-transformer linear-regression matplotlib-pyplot numpy pandas pipeline python scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/heyitsjoealongi/fantasy-football-qbwr-model
Fantasy Football: Quarterback / Wide Receiver - Gaussian Process Regression (GPR) Machine Learning Model
machine-learning matplotlib model numpy python scikit-learn
Last synced: 01 Apr 2025
https://github.com/aksoni07/movie-recommendation
A hybrid movie recommendation system designed to deliver personalized and accurate suggestions by combining user preferences, item attributes, and collaborative patterns, ensuring a seamless and engaging experience.
clustering content-based-filtering data-analysis embeddings jupyter-notebook numpy ollaborative-filtering pandas personalization python recommendation-systems scikit-learn user-item-interactions
Last synced: 11 Apr 2026
https://github.com/rizquuula/sentimentanalyzenaivebayes
Analisis Sentimen menggunakan metode Naive Bayes dengan "One time learning" dan "Continuous Learning"
machine-learning naive-bayes nlp python scikit-learn sentiment-analysis text-classification
Last synced: 17 May 2026
https://github.com/urvee1810/bitcoin-price-forecasting-using-arma
The analysis reveals the challenges of predicting Bitcoin prices during highly volatile periods and demonstrates how traditional time series models perform under different market conditions. The project includes comparative analysis of model performance during stable and volatile market phases.
arima arma augmented-dickey-fuller-test feature-engineering machine-learning matplotlib mplfina numpy pandas python random-forest randomforestregressor scikit-learn seaborn statsmodels time-series-analysis
Last synced: 06 Mar 2026
https://github.com/adirbella37/safety-analytics-project
Final project in Safety Management: analytics and predictive modeling for occupational incidents. Includes EDA, logistic regression, Poisson/Negative Binomial with overdispersion checks, ROC/AUC, and prediction exercises.
classification data-visualization drunk-and-drive eda logistic-regression matplotlib negative-binomial numpy occupational-safety overdispersion pandas poisson-regression python road-safety roc-auc scikit-learn seaborn statmodels
Last synced: 09 Apr 2026
https://github.com/capac/higher-education-students-performance-evaluation
Machine learning project for evaluating higher education student performance
docker evidently grafana mlflow postgresql prefect python scikit-learn xgboost
Last synced: 09 Apr 2026
https://github.com/das-amlan/customer-churn-prediction
Predicting customer churn using machine learning algorithms
customer-churn-prediction imbalanced-data keras-tensorflow machine-learning pandas prediction-model python scikit-learn seaborn tensorflow
Last synced: 11 Apr 2026
https://github.com/ricardorobledo/ml_optimization
matplotlib numpy python scikit-learn xgboost
Last synced: 11 Apr 2026
https://github.com/allanreda/telco-customer-churn-predictor-app
A web-based machine learning application that predicts customer churn using a logistic regression model. Built with Scikit-Learn for model training, Gradio for the user interface, and deployed on Google Cloud App Engine. The app allows users to input customer data and receive predictions on churn risk to support business decision-making.
app-engine data-visualization deployment google-cloud gradio hyperparameter-tuning logistic-regression machine-learning numpy pandas scikit-learn
Last synced: 16 Apr 2026
https://github.com/dastogirrudro/machine-learning-and-deep-learning
This is my thesis project which i have done in varsity.Here i used machine learning and deep learning i used LSTM as deep learning.This can identify aggresive spam message. Here i used pandas scikit-learn and many more framework i used python as a programming language.I used many algorithm for highering the accuracy of my project.
deep-learning lstm machine-learning numpy pandas python scikit-learn
Last synced: 11 Apr 2026
https://github.com/matbesancon/kaggle-digit-recognizer
Some tests with the Kaggle Digit Recognition challenge
image-processing kaggle kaggle-digit-recognizer machine-learning mnist-dataset numpy pandas python scikit-image scikit-learn
Last synced: 11 Apr 2026
https://github.com/duruii/contest-dingtalkcup2-a
2023年第二届“钉钉杯”大学生大数据挑战赛——智能手机用户监测数据分析
data-mining machine-learning pandas scikit-learn xgboost
Last synced: 12 Mar 2025
https://github.com/andrewjmack/credit-risk-classification
Supervised learning model trained and evaluated on loan risk for potential use in the prediction of the creditworthiness of an applicant
banking loan-prediction-analysis machine-learning pandas python scikit-learn supervised-learning
Last synced: 11 Apr 2026
https://github.com/trimoyee-g/adenovirus-disease-prediction
A machine learning project using scikit-learn to compare models for Adenovirus detection, selecting the most effective one based on accuracy, precision, and recall.
machine-learning matplotlib python random-forest-classifier scikit-learn
Last synced: 11 Apr 2026
https://github.com/trimoyee-g/flipkart-reviews-sentiment-analysis
A RandomForestClassifier-based sentiment analysis model for efficient binary categorization of Flipkart reviews.
machine-learning matplotlib python random-forest-classifier scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/trimoyee-g/phishing-site-predictor
A phishing site prediction model using scikit-learn's Random Forest Classifier, achieving high accuracy and gaining insights into website characteristics.
data-visualization machine-learning python random-forest-classifier scikit-learn
Last synced: 11 Apr 2026
https://github.com/pradeep-r04/attendiq
AttendIQ is a Face Recognition Attendance System designed to automate and streamline the attendance process with precision and ease. By leveraging real-time face detection and recognition technology, AttendIQ eliminates the need for manual roll calls or ID-based check-ins. The system captures facial data during a quick registration process .
csv cv2 kneighborsclassifier numpy os pandas pickle python scikit-learn streamlit time
Last synced: 02 Apr 2026
https://github.com/ghoumbadji/analyzing-customer-churn-for-a-telecom-company
The project involves utilizing various machine learning techniques, both supervised and unsupervised, to detect customer churn and identify the key factors contributing to it.
churn-analysis churn-prediction kaggle machine-learning pandas random-forest-classifier scikit-learn
Last synced: 03 May 2026
https://github.com/pratishtha-abrol/sentimentanalysis
Logistic Regression: A sentiment analysis case study
logistic-regression nltk-python scikit-learn sentiment-analysis
Last synced: 17 May 2026
https://github.com/alsult/wine_classification
This is a wine classification project based on 13 numerical features of wines grown in the same region in Italy but derived from three different cultivars.
logistic-regression machine-learning matplotlib multiclass-classification pandas python scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/blakeziegler/binary-classification-competition
Binary Classification of Insurance Crosselling Kaggle Competition
data-analysis data-science database kaggle kaggle-competition machine-learning python rstudio scikit-learn xgboost
Last synced: 17 Nov 2025
https://github.com/abhipatel35/svm-hyperparameter-optimization-for-breast-cancer
Utilizing SVM for breast cancer classification, this project compares model performance before and after hyperparameter tuning using GridSearchCV. Evaluation metrics like classification report showcase the effectiveness of the optimized model.
breast-cancer cancer-diagnosis classification data-analysis data-science gridsearchcv healthcare hyperparameter-tuning jupyter-notebook machine-learning medical-imaging pycharm python scikit-learn support-vector-machine svm
Last synced: 05 Feb 2026
https://github.com/lijesh010/ml_project_data_preprocessing
The main objective of this project is to design and implement a robust data preprocessing system that addresses common challenges such as missing values, outliers, inconsistent formatting, and noise. By performing effective data preprocessing, the project aims to enhance the quality, reliability, and usefulness of the data for machine learning.
data-cleaning data-exploration data-preprocessing machine-learning numpy pandas-python python scikit-learn
Last synced: 11 Apr 2026
https://github.com/alexsolov28/ml_course
Курс "Технология машинного обучения"
colab-notebooks jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn
Last synced: 05 Apr 2025
https://github.com/pranavgautam29/flight-price-prediction
The Flight Price Prediction project uses machine learning to forecast flight ticket prices based on historical data. Hosted on Streamlit Community Cloud and deployed via Streamlit, this application allows users to input flight details such as departure and arrival airports, travel dates, and class to receive accurate price predictions.
machine-learning prediction-model regression scikit-learn statistical-machine-learning streamlit
Last synced: 21 Feb 2026
https://github.com/hotequil/computer-vision
Study about computer vision.
jupyter-notebook matplotlib numpy python scikit-learn
Last synced: 13 Apr 2026
https://github.com/javi-cc/python-ml-portcanto
Portcanto és un projecte de simulació d'un trajecte en bicicleta. S'ha definit 4 tipus de ciclistes que es diferencien en el temps que tarda a fer el trajecte. L'objectiu és descobrir els 4 patrons amb l'algoritme de clustering KMeans.
clustering docker docker-compose kmeans machine-learning mlfow pydoc pylint python scikit-learn testing venv
Last synced: 13 Apr 2026
https://github.com/alisson-t-bucchi/cost-of-living-ai-ml
Cost of living predictor for some world cities, using AI and ML to scrap and predict cost for each selected city.
artificial-intelligence linear-regression machine-learning matplotlib pandas-dataframe python scikit-learn
Last synced: 18 Jun 2025
https://github.com/kkinzzza/meansalaryprediction
This project focuses on predicting the mean salary for job vacancies from HeadHunter.
catboost classic-ml regression salary-prediction scikit-learn
Last synced: 29 Apr 2026
https://github.com/oceanuz/car-price-regression
A comprehensive ML evaluation and improvement notebook for a car price prediction model. It includes topics such as scoring with r2, cross-validation, overfitting/underfitting diagnosis, and polynomial regression. *Ridge regression* is applied to reduce overfitting, and (GridSearchCV) techniques are used to find the best alpha hyperparameter.
cross-validation data-science grid-search hyperparameter-tuning machine-learning machine-learning-models model-evaluation overfitting python regression ridge-regression scikit-learn
Last synced: 11 Dec 2025
https://github.com/szymonrucinski/pippi-lang
Elegant 📑 text preprocessing pipeline 🚰 available as pip package 🐍 based on scikit-learn pipeline. Combines Transformer and Column Transformer into a single object.
data-cleaning data-science nlp pipeline scikit-learn
Last synced: 30 Apr 2026
https://github.com/scikit-learn/pairwise-distances-reductions-asv-suite
A dedicated asv suite for scikit-learn private PairwiseDistancesReductions
asv benchmarks cython scikit-learn
Last synced: 18 Jan 2026
https://github.com/mahendra077/knn-iris
knn-classifier ml numpy pandas scikit-learn seaborn
Last synced: 29 Apr 2026
https://github.com/lefteris-souflas/the-algorithmic-approach-to-winning-guess-who
This repository provides a systematic approach to winning the "Guess Who?" game through advanced machine learning techniques. It offers a comprehensive methodology for enhancing gameplay strategy and optimizing decision-making processes with meticulous attention to detail.
decision-tree drawio gradient-boosting graphviz-dot lightgbm machine-learning matplotlib numpy pandas python random-forest scikit-learn
Last synced: 09 Apr 2026
https://github.com/mramshaw/intro-to-ml
Intro to Machine Learning - Pattern Recognition for Fun and Profit
machine-learning matplotlib ml numpy pandas pip pip3 python scikit-learn scipy seaborn seaborn-plots sklearn statsmodels tensorflow weka
Last synced: 11 Apr 2026
https://github.com/sudarshanasrao/ee559-machine_learning-usc
USC graduate level Machine Learning course
cnn keras machine-learning neural-networks numpy python scikit-learn scipy tensorflow
Last synced: 11 Apr 2026
https://github.com/abideen-olawuwo/bulldozer-prediction
Predicting the Future Price of Bulldozer
machine-learning matplotlib numpy pandas python random-forest-regressor scikit-learn
Last synced: 11 Apr 2026
https://github.com/abhiagwl/ml_task_nyoffice
ml scikit-learn scipy sparse-matrix svm url-classification web-crawler
Last synced: 16 Jan 2026
https://github.com/ebadshabbir/decision_tree_algorithm
Decision Tree Classifier for Social Network Ads A Python implementation of a Decision Tree Classifier to predict user purchasing behavior based on age and estimated salary. Includes feature scaling, model evaluation (confusion matrix and accuracy), and visualizations of decision boundaries for both training and test sets.
decision-tree-classifier jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn
Last synced: 11 Apr 2026
https://github.com/lohiyah/real-estate-price-forecast
A Python-based app predicting real estate prices using machine learning. Built with Pandas, NumPy, Scikit-learn, Matplotlib, and Seaborn for data processing and visualization, and Flask for the web interface.
flask matplotlib numpy pandas python3 scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/pekiiipy/credit-card-fraud-detection
🔍 Detect credit card fraud efficiently using advanced machine learning techniques, achieving high accuracy rates on a large dataset of transactions.
adasyn anomaly-detection class-imbalance credit-card-fraud data-visualization fraud fraud-detection frauddetection kaggle keras logistic-regression plotly-python postgresql random-forest scikit-learn tensorflow tree-model xgboost
Last synced: 11 Apr 2026
https://github.com/akhileshthite/india-population
ML (simple linear regression) model for predicting India's population.
machine-learning numpy pandas python scikit-learn
Last synced: 09 Apr 2026
https://github.com/prajakta1321/authencheck
Amdocs Gen AI Graduate Hackathon 2024-25- A comprehensive fact-checking and misinformation detection system that leverages cutting-edge AI models and multiple news sources to verify information circulating on social media
api bert-fine-tuning flask-application matplotlib ngrok-server nlp nlp-machine-learning numpy pandas python3 scikit-learn seaborn wandb
Last synced: 05 Apr 2026
https://github.com/bhuvan-s-prasad/streamlit-regression
A machine learning project that predicts housing prices in California using regression techniques. This project includes comprehensive exploratory data analysis, feature engineering, linear regression modeling, and an interactive Streamlit web application for making predictions.
california-housing-price-prediction exploratory-data-analysis linear-regression machine-learning matplotlib numpy pandas python scikit-learn seaborn streamlit supervised-learning
Last synced: 11 Apr 2026
https://github.com/nafis2508/mobile-price-predictor
Machine learning project that classifies mobile phones into price ranges (low, medium, high, very high) based on hardware specifications.
classification data-science eda jupyter-notebook kagle knn logistic-regression machine-learning mobile-price-prediction python scikit-learn xgboost
Last synced: 24 Jun 2026
https://github.com/nicolasvauche/vinylexplore_ml
VinyleXplore est un moteur de recommandation de vinyles intelligent basé sur l'humeur et le contexte d'écoute de l'utilisateur. Il utilise FastAPI pour exposer une API REST et scikit-learn pour entraîner un modèle de Machine Learning qui améliore la pertinence des suggestions.
machine-learning python scikit-learn vinyle
Last synced: 17 May 2026
https://github.com/sonnguyen25/hackbeanpot-2025
EarthBeats - An Eco-friendly Pocket Road Trip Companion
css flask googlemaps-api html humeai knn-model mongodb nextjs nodejs numpy pandas python reactjs recommender-system scikit-learn spotify tailwindcss
Last synced: 11 Apr 2026
https://github.com/netcodez/climate-prediction-pipeline
Predicting London's climate using machine learning techniques. This project aims to forecast mean temperature in Celsius (°C) using various regression models and logging experiments with MLflow
huggingface machine-learning mlflow mlflow-tracking mlflow-tracking-server mlops python scikit-learn streamlit
Last synced: 09 Apr 2026
https://github.com/snigdho8869/regression-analysis-projects
Repository showcasing a collection of diverse regression analysis projects including salary prediction and more.
deep-learning deep-learning-regression deeplearning gradient-boosting-regressor keras linear-regression machine-learning machine-learning-algorithms random-forest-regression random-forest-regressor regression regression-algorithms regression-analysis regression-trees scikit-learn tensorflow voting-regressor xgboost-regression
Last synced: 05 May 2026
https://github.com/parbhat-cpp/suicidal-ml
A machine learning/NLP-based system to identify signs of suicidal ideation from user text inputs.
bash cicd classification docker fastapi githubactions jinja2 jupyter-notebook machine-learning natural-language-processing nlp numpy pandas python scikit-learn
Last synced: 11 Apr 2026
https://github.com/pramodyasahan/model-selection
This repository explores and compares different regression models for predicting continuous outcomes. This repository includes implementations and evaluations of five key regression models. The primary goal is to demonstrate how each model works, evaluate their performance using R-squared values, and guide users in selecting the best model.
machine-learning modelselection numpy pandas python regression scikit-learn
Last synced: 08 Mar 2025
https://github.com/djdhairya/pneumonia-detection
https://youtu.be/1SQIrxhMuUs?si=lF2cg8eTnETf-5Qx
cnn cv deep-learning flask gunicorn keras matplotlib opencv pandas pillow scikit-learn seaborn tensorflow vgg19
Last synced: 11 Apr 2026