An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with sckit-learn

A curated list of projects in awesome lists tagged with sckit-learn .

https://github.com/Dan-Boat/PyESD

Python Package for Empirical Statistical Downscaling. pyESD is under active development and all colaborators are welcomed. The purpose of the package is to downscale any climate variables e.g. precipitation and temperature using predictors from reanalysis datasets (eg. ERA5) to point scale. pyESD adopts many ML and AL as the transfer function.

deep-learning downscaling ensemble-machine-learning machine-learning precipitation sckit-learn tensorflow2

Last synced: 20 Jul 2025

https://github.com/mint-lab/dl_tutorial

Machine Learning and Deep Learning Tutorial

deep-learning dl machine-learning ml pytorch sckit-learn

Last synced: 28 Oct 2025

https://github.com/gesiscss/sexism_custom_classifier

Custom classifiers to detect sexist language.

bert natural-language-processing nlp sckit-learn sexism-detection

Last synced: 07 May 2025

https://github.com/bhattbhavesh91/hummingbird-demo

A small demo which shows how Microsoft's Hummingbird can scale ML Model Inferences using GPU's

demo gpu hummingbird machine-learning neural-networks pytorch sckit-learn tensor-computation

Last synced: 07 Sep 2025

https://github.com/khaymanii/gold-price-detection-model

This model is built using python and Random Forest Regressor algorithm

matplotlib numpy pandas python sckit-learn

Last synced: 21 Jun 2025

https://github.com/parth-jatav/movie-recommendation-project

An ML-based movie recommendation system built using a dataset from Kaggle. This project preprocesses movie data to generate recommendations based on cosine similarity. The system uses Python libraries such as Pandas, NumPy, NLTK, and sklearn for data processing and machine learning. The user interface is developed with Streamlit.

ml movie-recommendation-app sckit-learn

Last synced: 26 Sep 2025

https://github.com/khaymanii/fake_news_prediction_model

This model was built using python and logistic regression algorithm

matplotlib numpy pandas python sckit-learn

Last synced: 15 Sep 2025

https://github.com/erenokur/machine-learning-playground

Experiment with machine learning and AI algorithms, write guides, and documents.

hidden-markov-model machine-learning numpy python pytorch sckit-learn tensorflow

Last synced: 31 Mar 2025

https://github.com/rhazra-003/indiebot

A basic chatbot which answers questions based on history of India

chatbot jupyter-notebook nlp nltk numpy python3 sckit-learn

Last synced: 20 Mar 2025

https://github.com/subh888999/calories_nutritions_predictions

A machine learning-based Streamlit app that predicts daily calorie needs and provides a personalized macronutrient and hydration plan based on user lifestyle inputs.

bmi-calculator calorie-prediction data-science fitness healthcare huggingface machine-learning multioutput-regressor nutrition python regression sckit-learn streamlit

Last synced: 01 Jul 2025

https://github.com/hetuvpatel/ml-diabetes-risk-progression-stage

Machine learning project analyzing diabetes risk progression using K-Means and Hierarchical clustering techniques on the Pima Indian Diabetes dataset. 🧠📊

cluster-analysis data-visualization heirarchical-clustering kmap kmeans machine-learning matplotlib sckit-learn seaborn

Last synced: 23 Sep 2025

https://github.com/Udacity-MachineLearning-Internship/Titanic-Survival-Model

Applying Titanic Survival Model with decision trees in python

decision-trees machine-learning sckit-learn

Last synced: 17 Jul 2025

https://github.com/khaymanii/wine-quality-prediction-model

This is a model built using Python and Random Forest Classifier which is an ensemble algorithm and also a supervised learning algorithm

matplotlib numpy pandas python sckit-learn

Last synced: 31 Dec 2025

https://github.com/md-emon-hasan/7-explore-different-classifier-ml-app

A project exploring various classification algorithms, showcasing their implementation, comparison, and evaluation using Python and scikit-learn.

k-nearest-neighbors knn random-forest sckit-learn streamlit support-vector-machine svm

Last synced: 14 Jun 2025

https://github.com/debasish-dutta/nlp-disaster-prediction

This repo contains my NLP processing of tweets determining whether they are disaster tweets or not of a kaggle open competition.

kaggle-competition nlp-machine-learning sckit-learn

Last synced: 04 Oct 2025

https://github.com/thamirisq/hackday_dengue

This was a Machine Learning challenge made in group focused on directing financial resources and community interventions for dengue control. The project was based on fictitious data provided by the DS Team, who organized the challenge. We were a group of five persons who developed the result.

machinelearning matplotlib-pyplot metrics sckit-learn seaborn sklearn xgboost

Last synced: 07 Oct 2025

https://github.com/aryanyadav-dev/celestial-spectroscopy

Developed a Deep learning model using TensorFlow and Keras to classify synthetic spectral data from celestial objects, including stars and galaxies. Utilizing a Convolutional Neural Network (CNN), the model analyzes spectroscopic features and achieves high accuracy in predicting object classifications.

cnn keras matplotlib python sckit-learn tensorflow

Last synced: 23 Feb 2025

https://github.com/debasish-dutta/spam-email-classifier

Created spam-email classifier models using both sckit-learn modules and through the normal process using probabilities

data-science jupyter-notebook sckit-learn spam-email-classifier webapp

Last synced: 13 Aug 2025

https://github.com/mendez-luisjose/weather-prediction-with-scikit-learn-streamlit-and-deployed-with-flask

Weather Prediction with Scikit Learn, Streamlit and Deployed with Flask

sckit-learn streamlit

Last synced: 14 Mar 2025

https://github.com/abdul-rafay19/internintelligence_machinelearningintern

A collection of hands-on projects completed during my Machine Learning Virtual Internship at Intern Intelligence. Includes hyperparameter tuning using Scikit-Learn and Optuna, and deep learning model development for image and text data using TensorFlow, Keras, and PyTorch.

ai algorithm algorithms artificial-intelligence intelligence intern-intelligence internship machine-learning machine-learning-algorithms machinelearning programming programming-language python pytorch sckit-learn tenserflow

Last synced: 24 Oct 2025

https://github.com/khaymanii/medical_insurance_cost_prediction-_model

This Model was built using Python and Linear Regression algorithm

matplotlib numpy pandas python sckit-learn seaborn

Last synced: 17 Oct 2025

https://github.com/shreyadhir/classification-penguins

Classification of Penguins using K-Means Clustering developed with Scikit-Learn

kmeans-clustering python sckit-learn

Last synced: 21 Jul 2025

https://github.com/khaymanii/customer_segmentation_model

This model was built using Python and KMeans Clustering algorithm

matplotlib numpy pandas python sckit-learn seaborn

Last synced: 16 Jun 2025

https://github.com/khaymanii/heart-disease-prediction-model

This repository contains a model built using python and Logistic Regression algorithm

matplotlib numpy pandas python sckit-learn

Last synced: 14 Oct 2025

https://github.com/rishieeee/spam-email-classifier

A simple machine learning project that classifies emails as spam or ham using TF-IDF and a Multinomial Naive Bayes model. The project covers data cleaning, text preprocessing, feature extraction, model training, and evaluation. A great beginner-friendly introduction to NLP and ML workflows.

multinomial-naive-bayes numpy pandas python sckit-learn tf-idf

Last synced: 19 Nov 2025

https://github.com/4702chahat/rock-vs-mine

This Project is based on Machine Learning which uses Logistic Regression model for predicting whether the object detected by Submarine is Rock or Mine

accuracy-score data-science deep-learning jupyter-notebook logestic-regression machine-learning numpy-arrays pandas-dataframe predicitve predictive-model python rock-vs-mine sckit-learn sklearn-classifier sklearn-library sklearn-metrics

Last synced: 24 Mar 2025

https://github.com/udacity-machinelearning-internship/titanic-survival-model

Applying Titanic Survival Model with decision trees in python

decision-trees machine-learning sckit-learn

Last synced: 18 Mar 2025

https://github.com/debasish-dutta/car-price-prediction

An end to end ML project based on the kaggle dataset of used car price regression data.

data-science machine-learning sckit-learn

Last synced: 12 Mar 2025

https://github.com/yareva/linear-regression-predictor

Linear Regression Predictor Model

matplotlib numpy pandas python sckit-learn

Last synced: 10 Apr 2025

https://github.com/earanda1979/calories_nutritions_predictions

Personalized nutrition and caloric recommendations using machine learning. Optimize your diet for weight loss, muscle gain, or maintenance. 🌟🍽️

bmi-calculator calorie-prediction data-science fitness healthcare huggingface machine-learning multioutput-regressor nutrition python regression sckit-learn streamlit

Last synced: 04 Jul 2025

https://github.com/anupreet02/deep-learning-challenge

The objective of this analysis is to develop a deep learning model capable of predicting whether a charity funded by Alphabet Soup is likely to be successful. The model is built using the charity dataset, which contains various features related to each charity, and is used to classify charities as successful or not based on these features.

numpy pandas sckit-learn tensorflow

Last synced: 16 Mar 2025

https://github.com/alphan26/optimal-logistics-locator

This is a project in which we estimate the biomass avaibility of places due to their index and determine the optimal preprocessing depot and biorafinery in Gujarat, India

numpy pandas python sckit-learn

Last synced: 23 Jun 2025

https://github.com/lhcee3/bc-classification

Breast Cancer classification done using both Machine Learning and Deep Learning.

breast-cancer breast-cancer-classification deep-learning machine-learning neural-networks sckit-learn tensorflow

Last synced: 14 Oct 2025

https://github.com/rkschroeder/portfolio

This repository contains my portfolio of data science projects.

matplotlib numpy pandas sckit-learn seaborn

Last synced: 05 Oct 2025

https://github.com/norafrn/customer-clustering

Implemented a full K-Means clustering pipeline using Python, scikit-learn, and Pandas to segment customers in the Instacart dataset based on shopping behaviour. Automated preprocessing, feature scaling, and visualization (PCA, heatmaps).

heatmap k-means-clustering pandas pca-analysis sckit-learn

Last synced: 09 Oct 2025

https://github.com/debasish-dutta/heart-disease-project

This contains the notebook of the heart disease prediction ML model.

data-science sckit-learn

Last synced: 12 Mar 2025

https://github.com/udacity-machinelearning-internship/feature-scaling

Applying feature scaling with linear regression in python

feature-scaling linear-regression machine-learning sckit-learn

Last synced: 18 Mar 2025

https://github.com/sarah-ribeiro/linear_regression_data_science_ml_ia

This project uses scikit-learn for linear regression analysis. With a dataset, we compare variables using functions like LinearRegression(). Guided by curiosity and machine learning, we seek patterns and correlations, inching closer to unraveling the data's secrets.

artificial-intelligence jupyter-notebook machine-learning matplotlib matplotlib-pyplot pandas python sckit-learn

Last synced: 16 Jun 2025

https://github.com/muhkartal/e-forecast

machine learning-powered energy consumption prediction system that analyzes historical data to forecast future energy usage trends, optimizing efficiency and sustainability.

fastapi joblib matplotlib numpy pandas pydantic pytest sckit-learn seaborn tensorflow tqdm uvicorn xgboost yaml

Last synced: 18 Mar 2025

https://github.com/venkat-0706/titanic-survival-prediction

A machine learning project predicting Titanic passenger survival using data preprocessing, feature engineering, and model optimization with Logistic Regression, Random Forest, and XGBoost.

classification-report confusion-matrix gridsearchcv matplotlib numpy onehot-encoder pandas sckit-learn seaborn train-test-split xgboost

Last synced: 04 Apr 2025

https://github.com/ilijamihajlovic/random-forest-classification

This project demonstrates how to build a Random Forest Classifier to predict music genres using audio feature data from Spotify. The model is trained on a curated subset of the spotify_tracks.csv dataset, focusing on popular genres such as pop, country, hip-hop, rock, latin, edm and more.

ai artificial-intelligence machine-learning machine-learning-algorithms machinelearning pandas python random-forest random-forest-classifier sckiit-learn sckit-learn

Last synced: 18 Jun 2025

https://github.com/shankhadweep/diabetes-prediction-systemv3

This project demonstrates a machine learning solution for predicting diabetes based on user-provided health data. The application uses Streamlit for an interactive web interface and advanced interpretability tools like SHAP and permutation importance to explain model predictions.

jupyter-notebook machine-learning matplotlib numpy pandas plotly randomforestclassifier sckit-learn seaborn streamlit-webapp

Last synced: 11 Sep 2025

https://github.com/udacity-machinelearning-internship/regularization

Implementing regularization using sckit-learn

machine-learning regularization sckit-learn

Last synced: 11 Jul 2025

https://github.com/abdiasarsene/lexemotion-an-intelligent-dashboard

LexEmotion is a cutting-edge NLP dashboard designed for legal professionals, law firms, and investigators. It leverages the latest advances in Natural Language Processing to extract emotions, detect key themes, and summarize incident or legal reports — in multiple languages and formats.

fitz googletrans langdetect matplotlib numpy pandas sckit-learn spacy transformer

Last synced: 23 Jun 2025

https://github.com/amr-yasser226/machine-learning-for-network-intrusion-detection

A complete pipeline for network intrusion detection comparing label encoding and one‑hot encoding, with SMOTE resampling, feature selection, and ensemble modeling using scikit‑learn and XGBoost, also this was phase one of our University's "CSAI 253- Machine Learning" course.

csai-253 cybersecurity cybersecurity-training ensamble-methods feature-engineering imbalanced-learning machine-learning machine-learning-algorithms network-intrusion-detection one-hot-encoding sckit-learn smote tree-based-model xgboost zewailcity

Last synced: 17 Jul 2025

https://github.com/Udacity-MachineLearning-Internship/Regularization

Implementing regularization using sckit-learn

machine-learning regularization sckit-learn

Last synced: 17 Jul 2025

https://github.com/Udacity-MachineLearning-Internship/Feature-Scaling

Applying feature scaling with linear regression in python

feature-scaling linear-regression machine-learning sckit-learn

Last synced: 17 Jul 2025

https://github.com/dwija12903/ai-lab

A collection of practical implementations from my AI Labs course

keras numpy sckit-learn tensorflow

Last synced: 05 Apr 2025

https://github.com/aranzadata/taxidemandpredictor

Modelo de regresión de series temporales para predecir la demanda de taxis en un aeropuerto de gran afluencia, optimizando la asignación de la flota mediante la incorporación de características temporales y categóricas utilizando Scikit-learn

forecasting scipy sckit-learn seasonality statsmodels time-series-analysis

Last synced: 29 Mar 2025

https://github.com/debasish-dutta/titanic

This is the basic go-to beginner-friendly Titanic Dataset which predicts wheater one survives the Titanic disaster.

data-science sckit-learn

Last synced: 22 Jul 2025

https://github.com/ebadshabbir/k-means-clustering

This repository demonstrates the implementation of the K-Means clustering algorithm to segment mall customers based on their annual income and spending behavior. By identifying distinct customer clusters, businesses can gain insights into customer groups and create targeted marketing strategies to improve customer engagement.

clustering jupyter-notebook k kmeans-clustering machine-learning matplotlib-pyplot pandas python sckit-learn

Last synced: 27 Jun 2025

https://github.com/rkarahul/ok.win-big-small-predictor

Predict the next “Big” or “Small” outcome on the OK.Win lottery-style game using OCR + time-series features + ML.

joblib numpy opencv-python paddleocr paddlepaddle paddlepaddle-gpu pandas python sckit-learn

Last synced: 03 Nov 2025

https://github.com/kuennethgroup/polytoxiq

PolyToxiQ: A WebApp for Polymer Toxicity Prediction using Transfer Learning from Tox21 Additives

autogluon dnn molecule polymer sckit-learn sentence-transformers tox21 toxicity-classification transfer-learning

Last synced: 23 Jul 2025

https://github.com/zanuarts/customer-behaviour

Find Customer Behaviour with decision tree.

decision-tree-classifier python sckit-learn

Last synced: 14 Mar 2025

https://github.com/debasish-dutta/bulldozer-price-regression

Contains another of my ML model of kaggle dataset

data-science sckit-learn

Last synced: 25 Dec 2025

https://github.com/morsalinislamshapon/diabetes-prediction-systemv3

This repository contains a machine learning model that predicts diabetes using user health data. It features an interactive web interface built with Streamlit and provides insights into model predictions through SHAP and permutation importance. 🐙🌟

jupyter-notebook machine-learning matplotlib numpy pandas plotly randomforestclassifier sckit-learn seaborn streamlit-webapp

Last synced: 29 Jul 2025

https://github.com/yasinefeee/parkspotter_pretest_enviroment

The ParkSpotter project is designed to detect the occupancy status of parking spots in a simulation environment. Using a toy model, a camera system, and a machine learning model, this system identifies whether a parking space is EMPTY or NOT EMPTY in real-time.

ai-systems classifier computer-vision cv2 numpy opencv parking-spot-detection parking-spots python sckit-image sckit-learn simulation-environment svm svm-classifier

Last synced: 08 Aug 2025

https://github.com/muhkartal/xai_dashboard

an interactive AI dashboard for machine learning model analysis and explainability, supports model training, dataset exploration, feature importance analysis, and SHAP-based explanations for both individual predictions and overall model behavior, compare multiple models, visualize insights, and export results seamlessly

joblib numpy pandas python sckit-learn shap streamlit xgboost

Last synced: 07 Oct 2025

https://github.com/devesh8423/machine_learning

Machine Learning practice projects, Jupyter notebooks, and datasets for learning regression, classification, and data analysis.

classification data-analysis data-science data-visualization jupyter-notebook machine-learning matplotlib ml-project numpy-library pandas python regression sckit-learn seaborn

Last synced: 19 Aug 2025

https://github.com/ubeydgur/iris-flower-classifier

Classification of iris flowers according to leaf characteristics.

classification machine-learning matplotlib pandas sckit-learn seaborn sklearn

Last synced: 24 Aug 2025

https://github.com/khaymanii/credit-card-fraud-detection-model

This model was built using python and Logistic Regression Machine Learning algorithm

matplotlib numpy pandas python sckit-learn

Last synced: 14 Mar 2025

https://github.com/pietrapaz/oficina_cd_dados

Arquivos da oficina de Ciência de Dados ✅

colab-notebook powerbi python r rlanguage sckit-learn sql

Last synced: 29 Jun 2025

https://github.com/alevp-dev/saber11-analytics

Initial data analysis for an artificial intelligence bootcamp project

knnimputer linear-regression matplotlib pandas python sckit-learn seaborn

Last synced: 30 Jun 2025