scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-06-23 00:27:46 UTC
- JSON Representation
https://github.com/daniel1kp/diamond-price-predictor
💎Diamond Price Predictor is an user-friendly web application that simplifies the analysis and prediction of diamond prices.
altair pandas python scikit-learn streamlit
Last synced: 13 Apr 2026
https://github.com/andrewquijano/operating_systems_ii
Creating an Intrusion Detection System
ids kdd99 nsl-kdd-dataset scikit-learn
Last synced: 17 Jan 2026
https://github.com/felipeclarindo/energy-predict-api
Api para realizar previsões sobre energia.
api api-development api-rest flask pandas pickle python scikit-learn
Last synced: 13 Apr 2026
https://github.com/mehmoodulhaq570/machine-learning-models
A repository consisting of machine learning models for predicting the future instance. More specifically this repository is a Machine Learning course for those who are interested in learning the basics of machine learning algorithms.
decision-trees gradient-descent gradient-descent-algorithm knn-algorithm linear-regression linear-regression-models logistic-regression-algorithm machine-learning-algorithms machine-learning-models ml naive-bayes-algorithm one-hot-encoding pca python random-forest-classifier scikit-learn svm-model
Last synced: 08 Apr 2025
https://github.com/akhand-pratap-tiwari/sentiment-analysis-on-reviews-using-linear-svc-and-tf-idf
Sentiment Analysis using Linear SVC and TF-IDF.
machine-learning natural-language-processing nltk scikit-learn sentiment-analysis text-classification
Last synced: 28 Apr 2026
https://github.com/machinelearningprodigy/covid-19-detection-system
It is a Flask-based web application that predicts the likelihood of COVID-19 infection based on user symptoms. The app utilizes a K-Nearest Neighbors (KNN) model trained on relevant medical features to assess COVID-19 risk.
algorithms feature-engineering flask machine-learning numpy pandas scikit-learn
Last synced: 20 Apr 2026
https://github.com/aarryasutar/hate_speech_detection
This project aims to detect hate speech on Twitter using advanced NLP and machine learning techniques, exploring feature extraction methods like TF-IDF and sentiment analysis, and evaluating models such as Logistic Regression and SVM.
confusion-matrix doc2vec gensim logistic-regression matplotlib naive-bayes nltk numpy pandas python random-forest scikit-learn seaborn stemming stopwords-removal svm tf-idf-vectorizer tokenization vader word-cloud
Last synced: 09 Apr 2026
https://github.com/flysirin/adstextclassification
Classification of advertisements by topic
docker excel flask pandas python pytorch scikit-learn
Last synced: 02 Jan 2026
https://github.com/dpb24/fake-news-detector
📰 NLP: Fake News Detection using Classical Machine Learning
bag-of-words decision-tree decision-tree-classifier fake-news feature-engineering feature-extraction machine-learning matplotlib natural-language-processing nlp nlp-machine-learning predictive-analytics predictive-modeling scikit-learn text vectorization visual-studio-code xgboost xgboost-classifier xgboost-model
Last synced: 27 Apr 2026
https://github.com/lfenzo/ml-solar-sao-paulo
Implementation of scientific project regarding the use of Machine Learning in Solar Radiation Prediction
forecasting machine-learning python scikit-learn
Last synced: 11 Apr 2026
https://github.com/mayankmittal29/stockvision
Stock price predictor LSTM Sequential Model with Dropout Regularization by which we can analyse any stock tickers, do its fundamental analysis using fundamental ratios and charts visualisations of 100MA and 200MA and can also predict stock price for next 10 days with its trend. Can also view candle stick charts for stock trading and latest news.
keras lstm-neural-networks matplotlib-pyplot mplfinance numpy pandas python scikit-learn streamlit yfinance-api
Last synced: 07 Apr 2026
https://github.com/tansudasli/analytics-sandbox
from Statistical approach to Machine learning
feature-engineering machine-learning matplotlib numpy opencv pandas probability regex scikit-learn seaborn statistics
Last synced: 13 Apr 2026
https://github.com/fadlyhts/klasifikasistuntingbalita
Sistem Klasifikasi Status Stunting menggunakan Machine Learning untuk mendeteksi status gizi balita berdasarkan BB/U, TB/U, dan BB/TB. Dibangun dengan Python, Flask, dan Scikit-learn.
child-health data-science flask healthcare indonesia-healthcare machine-learning medical-informatics python scikit-learn stunting-classification
Last synced: 20 Mar 2025
https://github.com/hokagem/damagedlogginganalyzer
A project about an analyzation of a statistic of damaged logging (wood) in Germany using Python.
analysis csv csv-parser k-fold-cross-validation numpy pandas pandas-dataframe pandas-python polynomial-regression scikit-learn statistics wood
Last synced: 03 May 2026
https://github.com/alisonmitchell/boston-housing
Investigation of the Boston housing dataset to evaluate, train and test a regression model to predict house prices.
data-science machine-learning matplotlib numpy pandas python scikit-learn scipy seaborn
Last synced: 10 Apr 2026
https://github.com/ramadhanabelio/qrcode-generator
Responsive Web for Generate QR Code
flask python python-framework python-library qrcode qrcode-generator scikit-learn tailwindcss web web-app
Last synced: 13 Apr 2026
https://github.com/ate329/ids
An intrusion detection system with a web interface built using Django
ai cyber-security cybersecurity decision-tree django ids intrusion-detection intrusion-detection-system machine-learning network python random-forest random-forest-classifier scapy scikit-learn traffic webapp
Last synced: 17 Mar 2025
https://github.com/mirgis/plucky-playground
A modest collection of machine learning and deep learning algorithms, along with examples implemented in diverse toolkits.
bayes bayesian deep-learning examples ipynb keras machine-learning neural-network pandas playground python3 pytorch scikit-learn sklearn statistics tensorflow
Last synced: 13 Apr 2026
https://github.com/md-emon-hasan/ml-project-car-price-prediction
🚗 End-to-end ML project for predicting car prices based on various features. Includes data preprocessing, model training, and a Flask web for predictions.
car-price-prediction car-price-predictor data-science feature-engineering ml predictive-modeling scikit-learn
Last synced: 10 Mar 2026
https://github.com/srikarveluvali/heart-disease-prediction-ml
This machine learning project aims to predict the presence or absence of heart disease in individuals based on a set of health-related features. By utilizing a dataset containing information about patients, we employ various machine learning techniques and data analysis to build a predictive model.
exploratory-data-analysis machine-learning python scikit-learn
Last synced: 04 May 2026
https://github.com/leabrodyheine/water-pump-status-prediction
This project implements machine learning models to predict the status of water pumps in Tanzania using data from DrivenData's competition. The project includes preprocessing steps, model evaluation using cross-validation, and hyperparameter optimization with Optuna.
argparse cross-validation gradient-boosting-classifier logistic-regression machine-learning multilayer-perceptron numpy optuna pandas random-forest-classifier scikit-learn
Last synced: 11 Apr 2026
https://github.com/devspidr/ml-programs
A collection of foundational machine learning programs covering supervised and unsupervised algorithms, implemented using Python and libraries like scikit-learn, pandas, and matplotlib. Ideal for beginners and students learning core ML concepts through practical coding.
classification machine-learning-algorithms regression scikit-learn supervised-learning unsupervised-learning
Last synced: 30 May 2026
https://github.com/yancotta/anti-aging-epigenetics-ml-app
A thesis MVP for a personalized anti-aging system that analyzes genetic SNPs and lifestyle habits using ML models (Random Forest and Neural Networks) to provide risk assessments and actionable recommendations. Built with FastAPI, React, PostgreSQL, and containerized via Docker for scalability and explainability.
anti-aging bioinformatics docker explainable-ai fastapi genetics healthtech machine-learning mlops personalized-medicine pytorch reactjs scikit-learn synthetic-data thesis-project
Last synced: 16 Sep 2025
https://github.com/alam025/ai-email-guardian
🛡️ AI-Powered Email Guardian: 99.2% accurate spam detection using machine learning. Open-source, privacy-focused email security. ⚡ 50ms detection time.
artificial-intelligence email-filter email-security hishing-detection machine-learning-cybersecurity nlp open-source privacy python scikit-learn security-tools spam-detection tensorflow text-classification
Last synced: 10 Mar 2026
https://github.com/aarryasutar/credit_eda
This project focuses on cleaning and analyzing a loan application dataset to gain insights into the factors influencing loan defaults. Through systematic data cleaning, visualization, and merging with previous application data, it provides a robust foundation for further predictive modeling.
binning boxplot correlation-matrix data-cleaning data-splitting dataframe feature-engineering heatmap jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn
Last synced: 13 Apr 2026
https://github.com/kaleharshavardhan07/spam_mail-_detector_ai_model
This project implements a spam detection system for SMS messages using machine learning techniques.
mathplotlib nltk numpy panda python scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/djleamen/music-mood-matcher-demo
An intelligent music recommendation system that analyzes your mood through natural language and suggests personalized playlists using advanced AI and machine learning techniques.
demo machine-learning matplotlib ml natural-language-processing nlp nltk nltk-python numpy openai pandas python scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/uea-geral/rna-perceptron-exercise
🤖Disciplina de RNA: treinamento de um neurônio Perceptron.
jupyter-notebook neural-network numpy perceptron python scikit-learn
Last synced: 13 Apr 2026
https://github.com/troublem1/mle
MultiLabel-Transformer(MLE) is an extended version of a LabelEncoder, such that, it encodes multiple categorical columns to numeric in any workflow or pipeline
packages python3 scikit-learn sklearn
Last synced: 05 Jan 2026
https://github.com/michael-insights/portfolio
This repository showcases my projects and skills in Data Analytics, Data Science, and Machine Learning. It includes hands-on work in data analysis, predictive modeling, and machine learning algorithms, aimed at solving real-world problems.
data-analytics data-science data-visualization datapreprocessing jupyter-notebooks machine-learning matplotlib numpy pandas predictive-modeling python scikit-learn sql
Last synced: 11 Apr 2026
https://github.com/rayyan9477/data-driven-house-price-prediction-and-property-recommendation-app
The app leverages algorithms to accurately predict house prices and recommend similar properties based on a saved dataset through content-based filtering. It is tailored for homebuyers seeking their dream house and real estate investors looking for profitable opportunities, providing powerful insights and data-driven decision-making support.
data-science eda html machine-learning numpy pandas python scikit-learn
Last synced: 06 Apr 2026
https://github.com/ondrejhruby/fashion-mnist
Machine Learning Project: Fashion MNIST Classification using Random Forest and PCA for dimensionality reduction.
classification computer-vision data-processing dimensionality-reduction fashion-mnist feature-extraction image-classification machine-learning machine-learning-algorithms pca pca-analysis python random-forest scikit-learn
Last synced: 03 Jan 2026
https://github.com/vidhi1290/zomato-data-analysis
Zomato Data Analysis - Explore the world of Zomato restaurant data through Python and data analysis. Uncover trends and insights using Pandas for data manipulation and Matplotlib for visualization. Join us in this journey to reveal the hidden stories within the data!
data-analysis data-analysis-python data-science data-visualization dataprocessing machine-learning machine-learning-algorithms matplotlib numpy pandas python scikit-learn zomato-data-analysis
Last synced: 11 Apr 2026
https://github.com/palak-463/tablataalrecognitionsystem
Software built using Python which makes use of CNN and FNN to detect the Taals of the Tabla, an Indian classical music instrument. 🎛️
cnn deep-learning flask fnn librosa numpy os pickle python scikit-learn
Last synced: 11 Apr 2026
https://github.com/colinwu0403/heartbpmusic
Music discovery platform that recommends you a song based on your heart's BPM and your mood using Machine Learning.
django neurokit2 scikit-learn spotify-web-api vuejs
Last synced: 05 May 2026
https://github.com/singhkunwardeep/twitter_sentiment_analysis
A machine learning project to classify Twitter sentiment into positive, negative, categories using Logistic Regression and TF-IDF Vectorization. This project involves data preprocessing, feature extraction, model training, and evaluation of the sentiment of tweets. Built with Python, NLTK, and Scikit-learn.
logistic-regression nltk-python pandas-dataframe python3 scikit-learn tfidf-vectorizer
Last synced: 05 May 2026
https://github.com/rahimizadeh/prediction-api-with-flask-and-mlflow
An end-to-end machine learning project demonstrating model lifecycle management with MLflow and production deployment using Flask.
flask machine-learning mlflow mlops-workflow python random-forest-regression rest-api scikit-learn
Last synced: 13 Apr 2026
https://github.com/evanmarshall-dev/evanmarshall-tech
Professional IT services platform featuring serverless AWS infrastructure, ML-powered service recommendations, and automated CI/CD deployment. Built to showcase full-stack development, cloud architecture, and machine learning engineering skills.
api-gateway aws ci-cd cloud-computing cloudfront devops full-stack github-actions infrastructure-as-code lambda machine-learning mlops nextjs portfolio python react s3 scikit-learn serverless terraform
Last synced: 13 Apr 2026
https://github.com/razamehar/naples-diaper-market-geo-analytics-for-potential-estimation
Analyzing Fater company's diaper market potential and enhancing revenue estimation for Naples stores: A Socio-Demographic, Territorial, and Points of Interest Perspective
contingency-table decision-trees diaper-market ensemble-machine-learning feature-importance geo-analytics gridsearchcv hyperparameter-tuning kruskal-wallis-test mann-whitney-u-test market-analysis pareto-analysis python random-forest revenue-estimation scikit-learn
Last synced: 20 May 2026
https://github.com/snikumbh/seqarchr
seqArchR: Identifying (promoter) sequence architectures de novo using NMF
clustering nmf nonnegative-matrix-factorization promoter-sequence-architectures r r-package scikit-learn sequence-analysis sequence-architectures unsupervised-machine-learning
Last synced: 19 Jun 2025
https://github.com/trainingbypackt/machine-learning-fundamentals-elearning
Use Python and scikit-learn to get up and running with the hottest developments in AI
artificial-intelligence clustering decision-tree machine-learning neural-network python scikit-learn supervised-learning unsupervised-learning
Last synced: 10 Apr 2026
https://github.com/stitchsages/implyo
An advanced imputation library compatible with mixed type data with a focus on performance and high accuracy, with advanced imputation algorithms for numeric and categorical variables.
imputation imputation-algorithm imputation-methods knn machine-learning pandas pandas-dataframe pip python python3 random-forest scikit-learn
Last synced: 23 Jun 2026
https://github.com/srilaasya/handwriting-recognition-using-k-means
Used K-means clustering and scikit-learn to cluster images of handwritten digits.
handwriting-recognition k-means python scikit-learn
Last synced: 13 Apr 2026
https://github.com/mnitin-reddy/reducing-review-overhead-with-ml-based-application-screening
A machine learning classification project to filter out low-probability visa applications using historical data. It features an end-to-end implementation with CI/CD on AWS, achieving 93% accuracy with a KNN model optimized through Optuna, alongside integration of MLOps tools like Evidently and MLflow.
aws docker githubactions hypothesistesting machinelearning matplotlib mlflow mlops mongodb numpy optuna pandas python scikit-learn seaborn
Last synced: 10 Apr 2026
https://github.com/nafisalawalidris/logistic-regression-model-for-breast-cancer-recurrence-prediction
Predicting Breast Cancer Recurrence - A logistic regression model using patient attributes to classify recurrence risk. Dataset analysis and model evaluation. Contributions welcome.
breast-cancer classification-model data-analysis data-science healthcare logistic-regression machine-learning python recurrence-prediction scikit-learn
Last synced: 17 May 2026
https://github.com/witch-world/iris-flower-classification-decision-tree-classifier
ai algorithms artificial-intelligence decision-tree decision-tree-classifier machine-learning machine-learning-algorithms machinelearning-python matplotlib numpy numpy-arrays pandas pandas-dataframe pandas-python python python3 scikit-learn
Last synced: 06 Apr 2026
https://github.com/sivatsk26/university-admit-eligibility-predictor
This project is created using Machine Learning and Regression methods- a statistical technique to predict the outcome of event which is to verify the users’ admission eligibility level, considering the universities they have chosen. This is achieved based on the algorithms implemented, when is user feed the application with the required information
html-css-javascript ibm-cloud ibm-watson linear-regression machine-learning matplotlib numpy pandas python python-flask random-forest scikit-learn
Last synced: 13 Apr 2026
https://github.com/rohitpawar001/bone_marrow_surival_prediction
Bone marrow transplants can be life-saving, but predicting patient survival is complex. In this project, I used machine learning to analyze key medical factors and improve survival predictions. I also implemented CI/CD pipelines, used MLflow for model tracking, and deployed the model on an AWS EC2 instance.
aws docker ec2-instance flask machine-learning mlflow python scikit-learn
Last synced: 08 Apr 2026
https://github.com/ahasannn/real-time-expression-detector
A Real Time Expression Detector using Python and Machine Learning
csv expression-detection-from-cam gradient-boosting logistic-regression machine-learning-algorithms mediapipe numpy opencv pandas python random-forest ridge-classifier scikit-learn
Last synced: 07 Apr 2026
https://github.com/samarthmule/chatbot
This project implements a generic chatbot using Natural Language Processing (NLP) and Machine Learning techniques. The chatbot is designed to classify user input into predefined intents and provide context-aware responses. The solution is scalable, interactive, and suitable for various domains.
chatbot internship machine-learning machine-learning-algorithms nlp nltk project-repository python python3 scikit-learn streamlit
Last synced: 13 Apr 2026
https://github.com/moustafamohamed01/moustafamohamed01
Software Engineering Student | Specializing in Machine Learning, Deep Learning, and Generative AI
artificial-intelligence artificial-neural-networks config convolutional-neural-networks data-analysis data-science data-visualization deep-learning keras machine-learning matplotlib natural-language-processing numpy pandas python pytorch recurrent-neural-networks scikit-learn seaborn tensorflow
Last synced: 08 Apr 2026
https://github.com/gokulgowthams/smart-premium
An Interactive Premium Amount Detection for user which accurately predicts the required premium amount for a default loan by using series of questions that satisfies the criteria in Streamlit Application
data-preprocessing feature-engineering git github mlflow model-deployment numpy pandas python scikit-learn streamlit xgboost
Last synced: 11 Apr 2026
https://github.com/strcoder4007/machine-learning-deep-learning-practice
Implementation of Linear/Logistic Reg, K-NN, SVM, Clustering, K-Means, ConvNet, ResNet, MobileNet, RNN, LSTM etc. using Pandas, SciKitLearn, NumPy & TensorFlow 2
convolutional-neural-networks matplotlib scikit-learn tensorflow2
Last synced: 15 May 2026
https://github.com/imswappy/brain-tumor-detection
🧠 Deep learning project for brain tumor classification using MRI images. Built with transfer learning (VGG16 + fine-tuning), TensorFlow/Keras, and deployed via Streamlit. Dataset & model loaded dynamically from KaggleHub. Includes training notebook, evaluation, and interactive web app.
kagglehub keras numpy pandas scikit-learn streamlit tensorflow vgg16-model
Last synced: 13 Apr 2026
https://github.com/supriya811106/healthcare-recommedation-system
A Flask-based web app that predicts diseases based on symptoms and recommends specialized doctors. It uses machine learning for accurate health predictions and location-based doctor searches.
css flask-application healthcare-application html javascript machine-learning numpy pandas recommendation-system scikit-learn
Last synced: 04 Mar 2026
https://github.com/ayushshahh/fespn
A neural network made to predict final exam scores of students
mlp mlp-regressor multilayer-perceptron neural-network prediction-model scikit-learn
Last synced: 02 May 2026
https://github.com/takkii/pylean
Data analysis ( 🐍 💎 📈 )
analayze matplotlib numpy pandas python scikit-learn
Last synced: 09 Sep 2025
https://github.com/takkii/rock_ptarmigan
[PyPi] Approximate value evaluation 📊
approximate keras pca rock-ptarmigan scikit-learn tensorflow
Last synced: 10 May 2026
https://github.com/aswinbarath/ml-classifier
A Machine Learning model which performs classification
classification image-classification jupyter-notebook machine-learning matplotlib-pyplot python3 scikit-learn
Last synced: 13 Apr 2026
https://github.com/edisedis777/pyspark-ml-features
A PySpark implementation of 6 lesser-known Scikit-Learn features optimized for Azure Databricks. This project translates powerful machine learning techniques from Scikit-Learn into PySpark's distributed computing framework.
azure databricks databricks-notebooks large-scale machine-learning pyspark python scikit-learn scikitlearn-machine-learning
Last synced: 13 Apr 2026
https://github.com/nordszamora/sentiment
This ML project is used to predict the text sentiment positive, negative or neutral. Its used a Reddit dataset in kaggle to train the model and make a prediction on new/unseen data.
django joblib kaggle naive-bayes-classifier pandas python redditdata scikit-learn
Last synced: 11 Apr 2026
https://github.com/thananjaya/admission_chance_prediction
Admission Chance Prediction using linear regression, wrapped up using Flask framework
flask linear-regression machine-learning python3 scikit-learn
Last synced: 17 Apr 2026
https://github.com/matsunagalab/tutorial_analyzingmddata
Google colab notebooks for typical MD trajectory analysis routines with Python
mdtraj molecular-dynamics scikit-learn tutorial
Last synced: 20 Apr 2026
https://github.com/anumit-web/ml-masterlist-2024
ML Masterlist, Machine Learning, 2024
image-segmentation kmeans-clustering machine-learning matplotlib numpy pandas reinforcement-learning scikit-learn supervised-learning unsupervised-learning
Last synced: 19 Jan 2026
https://github.com/tharindanimnajith/deep-learning-spam-detection
Deep Learning classifiers to detect spam SMS messages - LSTM Model, DenseNet CNN Models - NLP, Python, Jupyter Notebook, Tensorflow, Keras, Numpy, Pandas, Matplotlib, Scikit-Learn
deep-learning densenet keras lstm nlp python3 scikit-learn tensorflow
Last synced: 05 Apr 2026
https://github.com/ki3mono/naive_bayes_classifier
This project implements Naive Bayes Classifiers for two data types: Multinomial Naive Bayes Classifier and Gaussian Naive Bayes Classifier. Developed as part of the Probability Theory and Statistics course in the second year of Computer Science at AGH University of Krakow.
iris-dataset mushroom-dataset naive-bayes-classifier numpy python scikit-learn
Last synced: 06 May 2026
https://github.com/Zen204/airbnb-availability
A machine learning model that predicts Airbnb listing availability, utilizing feature engineering and supervised learning techniques to improve guest experience and optimize host management.
binary-classification data-analysis data-preprocessing data-visualization feature-engineering machine-learning matplotlib model-evaluation nlp pandas predictive-modeling python scikit-learn seaborn supervised-learning
Last synced: 02 Apr 2025
https://github.com/aryansingla45/cab-fare-prediction
This repository contains a Cab Fare Prediction Web App that uses machine learning to predict fares based on distance, time, and location, with Google Maps API integration for route and fare calculations. It features a user-friendly interface and secure authentication, powered by a Kaggle dataset.
flask-application google-maps-api html-css-javascript machine-learning machine-learning-algorithms pandas-numpy python scikit-learn weather-api
Last synced: 11 May 2026
https://github.com/hopquangdo/minisklearn
✨ A lightweight Python-from-scratch implementation of core Machine Learning algorithms inspired by scikit-learn.
Last synced: 09 Jul 2025
https://github.com/lordmitrii/win-prediction-django
A web application on Django framework. It predicts a winning team based on given sets of dota2 heroes.
django dota2 jupyter-notebook machine-learning python scikit-learn web
Last synced: 13 Apr 2026
https://github.com/abdullah321umar/internee.pk-dataanalytics_internship-assignment4
🌟 Fraud Detection in Application 🌟 Through Isolation Forest and K-Means Clustering, the project detects suspicious patterns like inconsistent income, duplicate entries, and unrealistic employment data. This end-to-end workflow transforms raw data into actionable fraud insights — enhancing trust and accuracy.
anomaly-detection csv-handling data-cleaning data-exporting data-import data-normalization exploratory-data-analysis export interpretation matplotlib model-evaluation pandas pca python reporting scaling scikit-learn seaborn
Last synced: 06 May 2026
https://github.com/0xsolanaceae/neurolog
ML-Powered Log Analysis
isolation-forest machine-learning scikit-learn
Last synced: 24 Apr 2026
https://github.com/rusiruchapana/blood-group-prediction
Creating a machine learning project to predict blood groups from fingerprint patterns
asp-net-web-api keras matplotlib numpy opencv-python pandas pillow scikit-learn tensorflow
Last synced: 11 Apr 2026
https://github.com/rs2416/Detecting_Social_Anxiety
This repository contains the full dataset and code needed to recreate the classification models and reproduce the results within this paper: https://formative.jmir.org/2021/10/e32656/
jupyter-notebook machine-learning python scikit-learn social-anxiety
Last synced: 07 May 2025
https://github.com/finite-sample/stagecoachml
Build two-stage models when your features arrive in two batches at different times.
machine-learning scikit-learn two-stage-models
Last synced: 14 Jan 2026
https://github.com/adrien-1997/bike-forecast-paris-velib
Bike-sharing demand forecasting in Paris (Vélib’). A data science and machine learning project leveraging open urban mobility data to predict bike availability, analyze time series usage patterns, and provide interactive dashboards for visualization.
bike-sharing dashboard data-science duckdb forecasting machine-learning matplotlib open-data pandas paris predictive-modeling python scikit-learn streamlit transportation urban-mobility velib
Last synced: 11 Apr 2026
https://github.com/wkrzywiec/polish-crops
Machine learning project to find out if price of a wheat in Poland depends on a weather.
decision-tree-regression eurostat imgw jupyter-notebook linear-regression machine-learning matplotlib-pyplot numpy pandas pipenv poland polynomial-regression python random-tree regression-models scikit-learn support-vector-regression
Last synced: 11 Apr 2026
https://github.com/andi611/libsvm-classification
Performing classification tasks with the LibSVM toolkit on four different datasets: Iris, News, Abalone, and Income.
abalone abalone-dataset classification classification-algorithm data-mining income income-dataset iris iris-dataset libsvm libsvm-ready news-dataset newsgroups-dataset scikit-learn svm svm-classifier svm-training
Last synced: 30 Aug 2025
https://github.com/uhstray-io/pyrizon
Data Collection, Analysis, Mapping, Pipelining & Transformation, & API using Python
api data-engineering etl numpy pandas plotly python pytorch raw-data scikit-learn seaborne sql sqlite tensorflow
Last synced: 09 Apr 2026
https://github.com/varun-khorgade/churnshield-customer-retention-predictor
Built an ML-based classification model to predict customer churn. Applied data preprocessing, feature engineering, and ensemble algorithms to improve prediction accuracy and help businesses implement retention strategies.
classification-algorithm datapreprocessing f1-score feature-engineering hyperparameter-tuning logistic-regression matplotlib model-evaluation numpy pandas python ran roc-auc scikit-learn seaborn xgboost
Last synced: 07 May 2026
https://github.com/elifirinci/mushrooms-plants-classification
This project features AI models for identifying mushrooms and plants as poisonous or edible using image-based predictions. Both models are tested through an interactive Gradio interface, ensuring user-friendly and accurate identification for foragers and researchers.
classification cnn cnn-classification gradio image-classification machine-learning mushroom-classification plant-classification scikit-learn
Last synced: 17 May 2026
https://github.com/dinhanhx/determination
Scripts to set global random for some machine learning framework
determination deterministic keras pytorch randomness scikit-learn tensorflow2
Last synced: 12 Apr 2026
https://github.com/rakibhhridoy/easywaydiveinto-datascience
Data Science is not as easy as it seems at first. The most problem faced by new learner are lack of resource knowledge as well as confusion in using the various resources. I hope this repository will benefit confusion learner.
algorithms algorithms-implemented bayesian-statistics data-science deep-learning deep-neural-networks linear-algebra machine-learning matplotlib multivariate-calculus numpy optimization pandas python scikit-learn scipy seaborn statistics statsmodels tensorflow
Last synced: 06 Apr 2026
https://github.com/ayisha-mohammed/freecodecamp_data_analysis_projects
freeCodeCamp-Data_Analysis projects using python
matplotlib-pyplot numpy python scikit-learn seaborn-plots
Last synced: 07 May 2026
https://github.com/fohlen/stats-experiment
A tiny stats experiment with GENESIS data
matplotlib python3 scikit-learn
Last synced: 17 May 2026
https://github.com/somjit101/human-activity-recognition
This project is to build a model that predicts the human activities such as Walking, Walking Upstairs, Walking Downstairs, Sitting, Standing or Laying using readings from the sensors on a smartphone carried by the user.
decision-tree-classifier eda feature-engineering gradient-boosting-classifier grid-search human-activity-recognition keras logistic-regression lstm random-forest-classifier rbf-kernel scikit-learn seaborn-plots signal-processing support-vector-classifier support-vector-machine t-sne tensorflow uci-har-dataset uci-machine-learning
Last synced: 23 Feb 2026
https://github.com/udityamerit/curafind-powered-by-ai
CuraFind AI is a web-based application leveraging Natural Language Processing (NLP) to intelligently recommend medicines. Users can search using symptoms, medicine names, or free-text descriptions, and receive suggestions along with brand substitutes for drugs
ai machine-learning nlp numpy pandas scikit-learn
Last synced: 18 Sep 2025
https://github.com/evangks/hierarchical-clustering-mall-customers
A comprehensive machine learning project demonstrating hierarchical clustering for customer segmentation on the Mall Customers dataset. Includes EDA, preprocessing, multiple linkage/distance comparisons, and professional visualizations.
clustering data-science hierarchical-clustering jupyter-notebook machine-learning mall-customers portfolio-project python scikit-learn unsupervised-learning
Last synced: 07 Mar 2026
https://github.com/soroush-04/incrementalsvm-road-accident-prediction
Enhance SVM and incremental SVM machine learning models for road accident severity prediction
incremental-learning machine-learning python scikit-learn svm
Last synced: 09 Apr 2026
https://github.com/tasninanika/heart-disease-analysis
The Heart Disease Analysis project is a comprehensive machine learning study aimed at predicting the presence of heart disease using the Heart Disease UCI Dataset.
knn logistic-regression matplotlib numpy pandas python3 random-forest scikit-learn seaborn
Last synced: 09 Apr 2026
https://github.com/farrajota/kaggle_titanic
My solutions to the "Titanic: Machine Learning from Disaster" kaggle competition
docker docker-compose kaggle kaggle-competition kaggle-titanic notebook pyspark python scikit-learn
Last synced: 11 Apr 2026
https://github.com/skekre98/picture-compressor
A tool for compressing images using unsupervised machine learning
kmeans-clustering scikit-learn
Last synced: 17 May 2026
https://github.com/weisscharlesj/SciCompChem_Experimental
Scientific Computing for Chemists Jupyter Book
book chemistry jupyter jupyter-lab matplot nmrglue numpy pandas python scikit-image scikit-learn scipy seaborn textbook
Last synced: 07 Nov 2025
https://github.com/psavarmattas/stroke-prediction-ml-model
This is just a theoretical Machine Learning Model that will analyze the data and determine where the stroke can occur.
classification decision-tree-classifier knearest-neighbor-classifier logistic-regression machine-learning multi-layer-perceptron python random-forest scikit-learn sklearn stacking-classifier support-vector-classifier
Last synced: 09 May 2026
https://github.com/1ayanabil1/kaggle-competition
An overview of the competitions and links to their respective folders for detailed information and code
kaggle kaggle-competition kaggle-dataset kaggle-house-prices kaggle-solution kaggle-titanic machine-learning machine-learning-algorithms machinelearning python python-lambda python-script python3 pytorch scikit-learn tensorflow
Last synced: 10 Apr 2026
https://github.com/ksatrajit0/heart-disease-prediction-ml
Predicts the risk of heart attack in a patient using their medical record
heart-disease-prediction machine-learning matplotlib numpy pandas scikit-learn seaborn
Last synced: 19 Apr 2026