Projects in Awesome Lists tagged with sckit-learn
A curated list of projects in awesome lists tagged with sckit-learn .
https://github.com/Dan-Boat/PyESD
Python Package for Empirical Statistical Downscaling. pyESD is under active development and all colaborators are welcomed. The purpose of the package is to downscale any climate variables e.g. precipitation and temperature using predictors from reanalysis datasets (eg. ERA5) to point scale. pyESD adopts many ML and AL as the transfer function.
deep-learning downscaling ensemble-machine-learning machine-learning precipitation sckit-learn tensorflow2
Last synced: 20 Jul 2025
https://github.com/ayyucedemirbas/machine_learning_algorithms
Machine learning fundamentals
data-science hacktoberfest keras machine-learning pytorch sckit-learn tensorflow
Last synced: 31 Jul 2025
https://github.com/legolasvzla/django-twitter-spark
Thesis project: topic categorization and sentiment analysis on twitter with Apache Spark
apache-spark apache-zookeeper django django-rest-framework nltk pyspark python3 react react-bootstrap react-hooks sckit-learn sentiment-analysis swagger topic-classification tweepy word-cloud
Last synced: 14 Jul 2025
https://github.com/mint-lab/dl_tutorial
Machine Learning and Deep Learning Tutorial
deep-learning dl machine-learning ml pytorch sckit-learn
Last synced: 28 Oct 2025
https://github.com/gympohnpimol/Last-Mile-Logistics
cplex cplex-optimization-solver cplex-tutorial cvrp delivery-service exact-algorithm k-means-clustering lastmile lastmile-delivery mathematical-programming matplotlib nni numpy optimization pandas python sckit-learn vrp vrptw
Last synced: 05 Apr 2025
https://github.com/gesiscss/sexism_custom_classifier
Custom classifiers to detect sexist language.
bert natural-language-processing nlp sckit-learn sexism-detection
Last synced: 07 May 2025
https://github.com/dataspieler12345/python-for-ds-ml
My Python learning experience 📚🖥📳📴💻🖱✏
anova-analysis anova-test beautifulsoup lasso-regression machine-learning-algorithms matplotlib matplotlib-pyplot monte-carlo-methods nltk-python numpy-library pandas-library polynomial-regression randomforestclassifier randomforestregressor ridge-regression scikitlearn-machine-learning scipy sckit-learn seaborn t-distribution
Last synced: 10 Jun 2025
https://github.com/shanathvemula/framework_for_superwised_prediction
This is the framework for supervised algorithms in mechine learning
bootstrap4 decision-tree-classifier django django-file-upload drop-function k-nearest-neighbours logistic-regression matplotlib media-files neural-network neural-networks numpy-arrays pandas preprocessing python sckit-learn static-files supervised-learning-algorithms support-vector-machines
Last synced: 15 May 2025
https://github.com/kalebu/spam-filter-using-machine-learning
A python code to training your own spam filter in Python
machine-learning machine-learning-algorithms nlp python-projects python3 sckit-learn spam-filter spam-sms
Last synced: 01 Apr 2025
https://github.com/kudzaiprichard/coin-compass
A microservice API for predicting cryptocurrency via machine learning model
binance binance-api cryptocurrency flask jason-web-tokens java machine-learning microservice pandas python sckit-learn seaborn sklearn springboot zipkin zipkin-sleuth
Last synced: 24 Feb 2025
https://github.com/bhattbhavesh91/hummingbird-demo
A small demo which shows how Microsoft's Hummingbird can scale ML Model Inferences using GPU's
demo gpu hummingbird machine-learning neural-networks pytorch sckit-learn tensor-computation
Last synced: 07 Sep 2025
https://github.com/khaymanii/gold-price-detection-model
This model is built using python and Random Forest Regressor algorithm
matplotlib numpy pandas python sckit-learn
Last synced: 21 Jun 2025
https://github.com/parth-jatav/movie-recommendation-project
An ML-based movie recommendation system built using a dataset from Kaggle. This project preprocesses movie data to generate recommendations based on cosine similarity. The system uses Python libraries such as Pandas, NumPy, NLTK, and sklearn for data processing and machine learning. The user interface is developed with Streamlit.
ml movie-recommendation-app sckit-learn
Last synced: 26 Sep 2025
https://github.com/micahondiwa/applied-data-science
A collection of 8 Applied Data Science projects.
ab-testing clustering data-science data-visualization decision-trees ensemble-learning logistic-regression mongodb pymongo regression regression-algorithms regression-models sckit-image sckit-learn sckit-learn-pipeline sqlite3
Last synced: 12 Oct 2025
https://github.com/kareimgazer/classify-cifar-100
classifying CIFAR-100 data set using MCSVM and Deep Conv Net
cifar100 classification convolutional-neural-networks deep-learning feature-extraction image-classification machine-learning neural-network neural-networks pca python sckit-learn tensorflow
Last synced: 29 Mar 2025
https://github.com/khaymanii/fake_news_prediction_model
This model was built using python and logistic regression algorithm
matplotlib numpy pandas python sckit-learn
Last synced: 15 Sep 2025
https://github.com/erenokur/machine-learning-playground
Experiment with machine learning and AI algorithms, write guides, and documents.
hidden-markov-model machine-learning numpy python pytorch sckit-learn tensorflow
Last synced: 31 Mar 2025
https://github.com/rhazra-003/indiebot
A basic chatbot which answers questions based on history of India
chatbot jupyter-notebook nlp nltk numpy python3 sckit-learn
Last synced: 20 Mar 2025
https://github.com/subh888999/calories_nutritions_predictions
A machine learning-based Streamlit app that predicts daily calorie needs and provides a personalized macronutrient and hydration plan based on user lifestyle inputs.
bmi-calculator calorie-prediction data-science fitness healthcare huggingface machine-learning multioutput-regressor nutrition python regression sckit-learn streamlit
Last synced: 01 Jul 2025
https://github.com/hetuvpatel/ml-diabetes-risk-progression-stage
Machine learning project analyzing diabetes risk progression using K-Means and Hierarchical clustering techniques on the Pima Indian Diabetes dataset. 🧠📊
cluster-analysis data-visualization heirarchical-clustering kmap kmeans machine-learning matplotlib sckit-learn seaborn
Last synced: 23 Sep 2025
https://github.com/akankshakusf/ml_bird-strike-cost-prediction-project
This is my Machine Learning Project for the Master Program at USF
keras machine-learning machine-learning-algorithms muma python sckit-learn tenserflow universityofsouthflorida usf
Last synced: 07 Apr 2025
https://github.com/rkarahul/data_science_project_2023
machine-learning matplotlib numpy pandas python sckit-learn seborn
Last synced: 16 Oct 2025
https://github.com/Udacity-MachineLearning-Internship/Titanic-Survival-Model
Applying Titanic Survival Model with decision trees in python
decision-trees machine-learning sckit-learn
Last synced: 17 Jul 2025
https://github.com/khaymanii/wine-quality-prediction-model
This is a model built using Python and Random Forest Classifier which is an ensemble algorithm and also a supervised learning algorithm
matplotlib numpy pandas python sckit-learn
Last synced: 31 Dec 2025
https://github.com/lynk4/ai-ml
Artificial Intelligence (AI) and Machine Learning (ML) ....;)
ai artificial-intelligence huggingface huggingface-transformers machine-learning-algorithms machinelearning numpy pandas prediction-model predictions python python3 sckit-learn sentiment-analysis
Last synced: 21 Feb 2025
https://github.com/md-emon-hasan/7-explore-different-classifier-ml-app
A project exploring various classification algorithms, showcasing their implementation, comparison, and evaluation using Python and scikit-learn.
k-nearest-neighbors knn random-forest sckit-learn streamlit support-vector-machine svm
Last synced: 14 Jun 2025
https://github.com/debasish-dutta/nlp-disaster-prediction
This repo contains my NLP processing of tweets determining whether they are disaster tweets or not of a kaggle open competition.
kaggle-competition nlp-machine-learning sckit-learn
Last synced: 04 Oct 2025
https://github.com/thamirisq/hackday_dengue
This was a Machine Learning challenge made in group focused on directing financial resources and community interventions for dengue control. The project was based on fictitious data provided by the DS Team, who organized the challenge. We were a group of five persons who developed the result.
machinelearning matplotlib-pyplot metrics sckit-learn seaborn sklearn xgboost
Last synced: 07 Oct 2025
https://github.com/aryanyadav-dev/celestial-spectroscopy
Developed a Deep learning model using TensorFlow and Keras to classify synthetic spectral data from celestial objects, including stars and galaxies. Utilizing a Convolutional Neural Network (CNN), the model analyzes spectroscopic features and achieves high accuracy in predicting object classifications.
cnn keras matplotlib python sckit-learn tensorflow
Last synced: 23 Feb 2025
https://github.com/debasish-dutta/spam-email-classifier
Created spam-email classifier models using both sckit-learn modules and through the normal process using probabilities
data-science jupyter-notebook sckit-learn spam-email-classifier webapp
Last synced: 13 Aug 2025
https://github.com/mendez-luisjose/weather-prediction-with-scikit-learn-streamlit-and-deployed-with-flask
Weather Prediction with Scikit Learn, Streamlit and Deployed with Flask
Last synced: 14 Mar 2025
https://github.com/abdul-rafay19/internintelligence_machinelearningintern
A collection of hands-on projects completed during my Machine Learning Virtual Internship at Intern Intelligence. Includes hyperparameter tuning using Scikit-Learn and Optuna, and deep learning model development for image and text data using TensorFlow, Keras, and PyTorch.
ai algorithm algorithms artificial-intelligence intelligence intern-intelligence internship machine-learning machine-learning-algorithms machinelearning programming programming-language python pytorch sckit-learn tenserflow
Last synced: 24 Oct 2025
https://github.com/khaymanii/medical_insurance_cost_prediction-_model
This Model was built using Python and Linear Regression algorithm
matplotlib numpy pandas python sckit-learn seaborn
Last synced: 17 Oct 2025
https://github.com/shreyadhir/classification-penguins
Classification of Penguins using K-Means Clustering developed with Scikit-Learn
kmeans-clustering python sckit-learn
Last synced: 21 Jul 2025
https://github.com/khaymanii/customer_segmentation_model
This model was built using Python and KMeans Clustering algorithm
matplotlib numpy pandas python sckit-learn seaborn
Last synced: 16 Jun 2025
https://github.com/phenomsg/ml-notebook
This project is designed for personal learning and exploration of fundamental machine learning concepts.
decision-trees linear-regression logistic-regression machine-learning model-evaluation-metrics neural-network opencv pandas python3 recommendation-system sckit-learn supervised-machine-learning tensorflow2 unsupervised-machine-learning
Last synced: 24 Mar 2025
https://github.com/khaymanii/heart-disease-prediction-model
This repository contains a model built using python and Logistic Regression algorithm
matplotlib numpy pandas python sckit-learn
Last synced: 14 Oct 2025
https://github.com/rishieeee/spam-email-classifier
A simple machine learning project that classifies emails as spam or ham using TF-IDF and a Multinomial Naive Bayes model. The project covers data cleaning, text preprocessing, feature extraction, model training, and evaluation. A great beginner-friendly introduction to NLP and ML workflows.
multinomial-naive-bayes numpy pandas python sckit-learn tf-idf
Last synced: 19 Nov 2025
https://github.com/4702chahat/rock-vs-mine
This Project is based on Machine Learning which uses Logistic Regression model for predicting whether the object detected by Submarine is Rock or Mine
accuracy-score data-science deep-learning jupyter-notebook logestic-regression machine-learning numpy-arrays pandas-dataframe predicitve predictive-model python rock-vs-mine sckit-learn sklearn-classifier sklearn-library sklearn-metrics
Last synced: 24 Mar 2025
https://github.com/udacity-machinelearning-internship/titanic-survival-model
Applying Titanic Survival Model with decision trees in python
decision-trees machine-learning sckit-learn
Last synced: 18 Mar 2025
https://github.com/debasish-dutta/car-price-prediction
An end to end ML project based on the kaggle dataset of used car price regression data.
data-science machine-learning sckit-learn
Last synced: 12 Mar 2025
https://github.com/priyanshscpp/ECE3491-ML_Spam_Detector-Practice
IIT Delhi COL341 Machine Learning
pandas python regression-models sckit-learn
Last synced: 12 May 2025
https://github.com/yareva/linear-regression-predictor
Linear Regression Predictor Model
matplotlib numpy pandas python sckit-learn
Last synced: 10 Apr 2025
https://github.com/earanda1979/calories_nutritions_predictions
Personalized nutrition and caloric recommendations using machine learning. Optimize your diet for weight loss, muscle gain, or maintenance. 🌟🍽️
bmi-calculator calorie-prediction data-science fitness healthcare huggingface machine-learning multioutput-regressor nutrition python regression sckit-learn streamlit
Last synced: 04 Jul 2025
https://github.com/anupreet02/deep-learning-challenge
The objective of this analysis is to develop a deep learning model capable of predicting whether a charity funded by Alphabet Soup is likely to be successful. The model is built using the charity dataset, which contains various features related to each charity, and is used to classify charities as successful or not based on these features.
numpy pandas sckit-learn tensorflow
Last synced: 16 Mar 2025
https://github.com/alphan26/optimal-logistics-locator
This is a project in which we estimate the biomass avaibility of places due to their index and determine the optimal preprocessing depot and biorafinery in Gujarat, India
numpy pandas python sckit-learn
Last synced: 23 Jun 2025
https://github.com/lhcee3/bc-classification
Breast Cancer classification done using both Machine Learning and Deep Learning.
breast-cancer breast-cancer-classification deep-learning machine-learning neural-networks sckit-learn tensorflow
Last synced: 14 Oct 2025
https://github.com/rkschroeder/portfolio
This repository contains my portfolio of data science projects.
matplotlib numpy pandas sckit-learn seaborn
Last synced: 05 Oct 2025
https://github.com/priyanshscpp/ece3491-ml_spam_detector-practice
IIT Delhi COL341 Machine Learning
pandas python regression-models sckit-learn
Last synced: 09 Oct 2025
https://github.com/norafrn/customer-clustering
Implemented a full K-Means clustering pipeline using Python, scikit-learn, and Pandas to segment customers in the Instacart dataset based on shopping behaviour. Automated preprocessing, feature scaling, and visualization (PCA, heatmaps).
heatmap k-means-clustering pandas pca-analysis sckit-learn
Last synced: 09 Oct 2025
https://github.com/sunilpanda14/polytoxiq
A Polymer Toxicity Prediction Tool using PSMILE Strings
autogluon cosine-similarity dnn molecule polymer sckit-learn sentence-transformers tox21 toxicity-prediction transfer-learning zeroshot-learning
Last synced: 02 Apr 2025
https://github.com/philiptitus/height-prediction
Built an ML model to predict height of a person based on their age.
linear-regression machine-learning machine-learning-algorithms matplotlib numpy sckit-learn supervised-learning
Last synced: 03 Apr 2025
https://github.com/debasish-dutta/heart-disease-project
This contains the notebook of the heart disease prediction ML model.
Last synced: 12 Mar 2025
https://github.com/udacity-machinelearning-internship/feature-scaling
Applying feature scaling with linear regression in python
feature-scaling linear-regression machine-learning sckit-learn
Last synced: 18 Mar 2025
https://github.com/sarah-ribeiro/linear_regression_data_science_ml_ia
This project uses scikit-learn for linear regression analysis. With a dataset, we compare variables using functions like LinearRegression(). Guided by curiosity and machine learning, we seek patterns and correlations, inching closer to unraveling the data's secrets.
artificial-intelligence jupyter-notebook machine-learning matplotlib matplotlib-pyplot pandas python sckit-learn
Last synced: 16 Jun 2025
https://github.com/muhkartal/e-forecast
machine learning-powered energy consumption prediction system that analyzes historical data to forecast future energy usage trends, optimizing efficiency and sustainability.
fastapi joblib matplotlib numpy pandas pydantic pytest sckit-learn seaborn tensorflow tqdm uvicorn xgboost yaml
Last synced: 18 Mar 2025
https://github.com/venkat-0706/titanic-survival-prediction
A machine learning project predicting Titanic passenger survival using data preprocessing, feature engineering, and model optimization with Logistic Regression, Random Forest, and XGBoost.
classification-report confusion-matrix gridsearchcv matplotlib numpy onehot-encoder pandas sckit-learn seaborn train-test-split xgboost
Last synced: 04 Apr 2025
https://github.com/ilijamihajlovic/random-forest-classification
This project demonstrates how to build a Random Forest Classifier to predict music genres using audio feature data from Spotify. The model is trained on a curated subset of the spotify_tracks.csv dataset, focusing on popular genres such as pop, country, hip-hop, rock, latin, edm and more.
ai artificial-intelligence machine-learning machine-learning-algorithms machinelearning pandas python random-forest random-forest-classifier sckiit-learn sckit-learn
Last synced: 18 Jun 2025
https://github.com/shankhadweep/diabetes-prediction-systemv3
This project demonstrates a machine learning solution for predicting diabetes based on user-provided health data. The application uses Streamlit for an interactive web interface and advanced interpretability tools like SHAP and permutation importance to explain model predictions.
jupyter-notebook machine-learning matplotlib numpy pandas plotly randomforestclassifier sckit-learn seaborn streamlit-webapp
Last synced: 11 Sep 2025
https://github.com/udacity-machinelearning-internship/regularization
Implementing regularization using sckit-learn
machine-learning regularization sckit-learn
Last synced: 11 Jul 2025
https://github.com/abdiasarsene/lexemotion-an-intelligent-dashboard
LexEmotion is a cutting-edge NLP dashboard designed for legal professionals, law firms, and investigators. It leverages the latest advances in Natural Language Processing to extract emotions, detect key themes, and summarize incident or legal reports — in multiple languages and formats.
fitz googletrans langdetect matplotlib numpy pandas sckit-learn spacy transformer
Last synced: 23 Jun 2025
https://github.com/amr-yasser226/machine-learning-for-network-intrusion-detection
A complete pipeline for network intrusion detection comparing label encoding and one‑hot encoding, with SMOTE resampling, feature selection, and ensemble modeling using scikit‑learn and XGBoost, also this was phase one of our University's "CSAI 253- Machine Learning" course.
csai-253 cybersecurity cybersecurity-training ensamble-methods feature-engineering imbalanced-learning machine-learning machine-learning-algorithms network-intrusion-detection one-hot-encoding sckit-learn smote tree-based-model xgboost zewailcity
Last synced: 17 Jul 2025
https://github.com/Udacity-MachineLearning-Internship/Regularization
Implementing regularization using sckit-learn
machine-learning regularization sckit-learn
Last synced: 17 Jul 2025
https://github.com/Udacity-MachineLearning-Internship/Feature-Scaling
Applying feature scaling with linear regression in python
feature-scaling linear-regression machine-learning sckit-learn
Last synced: 17 Jul 2025
https://github.com/somyaagar/bengaluru_house_price_prediction
bootstrap flask-application html js jupyter-notebook pandas python sckit-learn
Last synced: 30 Dec 2025
https://github.com/owenl0000/housepricesproject
Kaggle Project
data-analysis data-science data-visualization gridsearchcv kaggle-competition kaggle-dataset linear-regression machine-learning machine-learning-algorithms numpy onehot-encoding ordinal-encoding pandas python random-forest-regression sckit-learn seaborn streamlit xgboost-regressor
Last synced: 30 Dec 2025
https://github.com/dwija12903/ai-lab
A collection of practical implementations from my AI Labs course
keras numpy sckit-learn tensorflow
Last synced: 05 Apr 2025
https://github.com/aranzadata/taxidemandpredictor
Modelo de regresión de series temporales para predecir la demanda de taxis en un aeropuerto de gran afluencia, optimizando la asignación de la flota mediante la incorporación de características temporales y categóricas utilizando Scikit-learn
forecasting scipy sckit-learn seasonality statsmodels time-series-analysis
Last synced: 29 Mar 2025
https://github.com/debasish-dutta/titanic
This is the basic go-to beginner-friendly Titanic Dataset which predicts wheater one survives the Titanic disaster.
Last synced: 22 Jul 2025
https://github.com/ebadshabbir/k-means-clustering
This repository demonstrates the implementation of the K-Means clustering algorithm to segment mall customers based on their annual income and spending behavior. By identifying distinct customer clusters, businesses can gain insights into customer groups and create targeted marketing strategies to improve customer engagement.
clustering jupyter-notebook k kmeans-clustering machine-learning matplotlib-pyplot pandas python sckit-learn
Last synced: 27 Jun 2025
https://github.com/rkarahul/ok.win-big-small-predictor
Predict the next “Big” or “Small” outcome on the OK.Win lottery-style game using OCR + time-series features + ML.
joblib numpy opencv-python paddleocr paddlepaddle paddlepaddle-gpu pandas python sckit-learn
Last synced: 03 Nov 2025
https://github.com/kuennethgroup/polytoxiq
PolyToxiQ: A WebApp for Polymer Toxicity Prediction using Transfer Learning from Tox21 Additives
autogluon dnn molecule polymer sckit-learn sentence-transformers tox21 toxicity-classification transfer-learning
Last synced: 23 Jul 2025
https://github.com/zanuarts/customer-behaviour
Find Customer Behaviour with decision tree.
decision-tree-classifier python sckit-learn
Last synced: 14 Mar 2025
https://github.com/debasish-dutta/bulldozer-price-regression
Contains another of my ML model of kaggle dataset
Last synced: 25 Dec 2025
https://github.com/morsalinislamshapon/diabetes-prediction-systemv3
This repository contains a machine learning model that predicts diabetes using user health data. It features an interactive web interface built with Streamlit and provides insights into model predictions through SHAP and permutation importance. 🐙🌟
jupyter-notebook machine-learning matplotlib numpy pandas plotly randomforestclassifier sckit-learn seaborn streamlit-webapp
Last synced: 29 Jul 2025
https://github.com/rayniel95/erecog
Simple ethnicity classifier using transfer learning with VGG. // unfinished
artificial-intelligence artificial-neural-networks artificialintelligence deep-learning deeplearning-ai ethnicity-classifier fairface-dataset jupyter-notebook keras machine-learning machinelearning machinelearning-python neural-networks notebook-jupyter numpy pandas-python python python3 sckit-learn
Last synced: 31 Jul 2025
https://github.com/yasinefeee/parkspotter_pretest_enviroment
The ParkSpotter project is designed to detect the occupancy status of parking spots in a simulation environment. Using a toy model, a camera system, and a machine learning model, this system identifies whether a parking space is EMPTY or NOT EMPTY in real-time.
ai-systems classifier computer-vision cv2 numpy opencv parking-spot-detection parking-spots python sckit-image sckit-learn simulation-environment svm svm-classifier
Last synced: 08 Aug 2025
https://github.com/abbaszaidi123/music-recommendation-system
mlops nnlp pandas python sckit-learn
Last synced: 25 Dec 2025
https://github.com/muhkartal/xai_dashboard
an interactive AI dashboard for machine learning model analysis and explainability, supports model training, dataset exploration, feature importance analysis, and SHAP-based explanations for both individual predictions and overall model behavior, compare multiple models, visualize insights, and export results seamlessly
joblib numpy pandas python sckit-learn shap streamlit xgboost
Last synced: 07 Oct 2025
https://github.com/devesh8423/machine_learning
Machine Learning practice projects, Jupyter notebooks, and datasets for learning regression, classification, and data analysis.
classification data-analysis data-science data-visualization jupyter-notebook machine-learning matplotlib ml-project numpy-library pandas python regression sckit-learn seaborn
Last synced: 19 Aug 2025
https://github.com/ubeydgur/iris-flower-classifier
Classification of iris flowers according to leaf characteristics.
classification machine-learning matplotlib pandas sckit-learn seaborn sklearn
Last synced: 24 Aug 2025
https://github.com/khaymanii/credit-card-fraud-detection-model
This model was built using python and Logistic Regression Machine Learning algorithm
matplotlib numpy pandas python sckit-learn
Last synced: 14 Mar 2025
https://github.com/pietrapaz/oficina_cd_dados
Arquivos da oficina de Ciência de Dados ✅
colab-notebook powerbi python r rlanguage sckit-learn sql
Last synced: 29 Jun 2025
https://github.com/alevp-dev/saber11-analytics
Initial data analysis for an artificial intelligence bootcamp project
knnimputer linear-regression matplotlib pandas python sckit-learn seaborn
Last synced: 30 Jun 2025