scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-06-25 00:23:58 UTC
- JSON Representation
https://github.com/alexsomai/machine-learning-getting-started
Dummy examples and experiments to get started with Machine Learning
artificial-intelligence deep-learning machine-learning python scikit-learn
Last synced: 13 Apr 2026
https://github.com/anudeepjonnada/phishshield-ai
🛡️ PhishShield AI – An intelligent phishing email detector that uses BERT and Machine Learning to identify phishing attempts in real time. Integrated with the Gmail API, powered by Flask, React, and MongoDB for secure full-stack email analysis and threat detection.
bert flask gmail-api mongodb oauth2 python react scikit-learn
Last synced: 13 Apr 2026
https://github.com/felipeclarindo/energy-predict-api
Api para realizar previsões sobre energia.
api api-development api-rest flask pandas pickle python scikit-learn
Last synced: 13 Apr 2026
https://github.com/flysirin/adstextclassification
Classification of advertisements by topic
docker excel flask pandas python pytorch scikit-learn
Last synced: 02 Jan 2026
https://github.com/dpb24/fake-news-detector
📰 NLP: Fake News Detection using Classical Machine Learning
bag-of-words decision-tree decision-tree-classifier fake-news feature-engineering feature-extraction machine-learning matplotlib natural-language-processing nlp nlp-machine-learning predictive-analytics predictive-modeling scikit-learn text vectorization visual-studio-code xgboost xgboost-classifier xgboost-model
Last synced: 27 Apr 2026
https://github.com/tansudasli/analytics-sandbox
from Statistical approach to Machine learning
feature-engineering machine-learning matplotlib numpy opencv pandas probability regex scikit-learn seaborn statistics
Last synced: 13 Apr 2026
https://github.com/ramadhanabelio/qrcode-generator
Responsive Web for Generate QR Code
flask python python-framework python-library qrcode qrcode-generator scikit-learn tailwindcss web web-app
Last synced: 13 Apr 2026
https://github.com/mirgis/plucky-playground
A modest collection of machine learning and deep learning algorithms, along with examples implemented in diverse toolkits.
bayes bayesian deep-learning examples ipynb keras machine-learning neural-network pandas playground python3 pytorch scikit-learn sklearn statistics tensorflow
Last synced: 13 Apr 2026
https://github.com/srikarveluvali/heart-disease-prediction-ml
This machine learning project aims to predict the presence or absence of heart disease in individuals based on a set of health-related features. By utilizing a dataset containing information about patients, we employ various machine learning techniques and data analysis to build a predictive model.
exploratory-data-analysis machine-learning python scikit-learn
Last synced: 04 May 2026
https://github.com/leabrodyheine/water-pump-status-prediction
This project implements machine learning models to predict the status of water pumps in Tanzania using data from DrivenData's competition. The project includes preprocessing steps, model evaluation using cross-validation, and hyperparameter optimization with Optuna.
argparse cross-validation gradient-boosting-classifier logistic-regression machine-learning multilayer-perceptron numpy optuna pandas random-forest-classifier scikit-learn
Last synced: 11 Apr 2026
https://github.com/devspidr/ml-programs
A collection of foundational machine learning programs covering supervised and unsupervised algorithms, implemented using Python and libraries like scikit-learn, pandas, and matplotlib. Ideal for beginners and students learning core ML concepts through practical coding.
classification machine-learning-algorithms regression scikit-learn supervised-learning unsupervised-learning
Last synced: 30 May 2026
https://github.com/yancotta/anti-aging-epigenetics-ml-app
A thesis MVP for a personalized anti-aging system that analyzes genetic SNPs and lifestyle habits using ML models (Random Forest and Neural Networks) to provide risk assessments and actionable recommendations. Built with FastAPI, React, PostgreSQL, and containerized via Docker for scalability and explainability.
anti-aging bioinformatics docker explainable-ai fastapi genetics healthtech machine-learning mlops personalized-medicine pytorch reactjs scikit-learn synthetic-data thesis-project
Last synced: 16 Sep 2025
https://github.com/aarryasutar/credit_eda
This project focuses on cleaning and analyzing a loan application dataset to gain insights into the factors influencing loan defaults. Through systematic data cleaning, visualization, and merging with previous application data, it provides a robust foundation for further predictive modeling.
binning boxplot correlation-matrix data-cleaning data-splitting dataframe feature-engineering heatmap jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn
Last synced: 13 Apr 2026
https://github.com/uea-geral/rna-perceptron-exercise
🤖Disciplina de RNA: treinamento de um neurônio Perceptron.
jupyter-notebook neural-network numpy perceptron python scikit-learn
Last synced: 13 Apr 2026
https://github.com/ondrejhruby/fashion-mnist
Machine Learning Project: Fashion MNIST Classification using Random Forest and PCA for dimensionality reduction.
classification computer-vision data-processing dimensionality-reduction fashion-mnist feature-extraction image-classification machine-learning machine-learning-algorithms pca pca-analysis python random-forest scikit-learn
Last synced: 03 Jan 2026
https://github.com/colinwu0403/heartbpmusic
Music discovery platform that recommends you a song based on your heart's BPM and your mood using Machine Learning.
django neurokit2 scikit-learn spotify-web-api vuejs
Last synced: 05 May 2026
https://github.com/singhkunwardeep/twitter_sentiment_analysis
A machine learning project to classify Twitter sentiment into positive, negative, categories using Logistic Regression and TF-IDF Vectorization. This project involves data preprocessing, feature extraction, model training, and evaluation of the sentiment of tweets. Built with Python, NLTK, and Scikit-learn.
logistic-regression nltk-python pandas-dataframe python3 scikit-learn tfidf-vectorizer
Last synced: 05 May 2026
https://github.com/rahimizadeh/prediction-api-with-flask-and-mlflow
An end-to-end machine learning project demonstrating model lifecycle management with MLflow and production deployment using Flask.
flask machine-learning mlflow mlops-workflow python random-forest-regression rest-api scikit-learn
Last synced: 13 Apr 2026
https://github.com/snikumbh/seqarchr
seqArchR: Identifying (promoter) sequence architectures de novo using NMF
clustering nmf nonnegative-matrix-factorization promoter-sequence-architectures r r-package scikit-learn sequence-analysis sequence-architectures unsupervised-machine-learning
Last synced: 19 Jun 2025
https://github.com/evanmarshall-dev/evanmarshall-tech
Professional IT services platform featuring serverless AWS infrastructure, ML-powered service recommendations, and automated CI/CD deployment. Built to showcase full-stack development, cloud architecture, and machine learning engineering skills.
api-gateway aws ci-cd cloud-computing cloudfront devops full-stack github-actions infrastructure-as-code lambda machine-learning mlops nextjs portfolio python react s3 scikit-learn serverless terraform
Last synced: 13 Apr 2026
https://github.com/razamehar/naples-diaper-market-geo-analytics-for-potential-estimation
Analyzing Fater company's diaper market potential and enhancing revenue estimation for Naples stores: A Socio-Demographic, Territorial, and Points of Interest Perspective
contingency-table decision-trees diaper-market ensemble-machine-learning feature-importance geo-analytics gridsearchcv hyperparameter-tuning kruskal-wallis-test mann-whitney-u-test market-analysis pareto-analysis python random-forest revenue-estimation scikit-learn
Last synced: 20 May 2026
https://github.com/ahasannn/real-time-expression-detector
A Real Time Expression Detector using Python and Machine Learning
csv expression-detection-from-cam gradient-boosting logistic-regression machine-learning-algorithms mediapipe numpy opencv pandas python random-forest ridge-classifier scikit-learn
Last synced: 07 Apr 2026
https://github.com/srilaasya/handwriting-recognition-using-k-means
Used K-means clustering and scikit-learn to cluster images of handwritten digits.
handwriting-recognition k-means python scikit-learn
Last synced: 13 Apr 2026
https://github.com/sivatsk26/university-admit-eligibility-predictor
This project is created using Machine Learning and Regression methods- a statistical technique to predict the outcome of event which is to verify the users’ admission eligibility level, considering the universities they have chosen. This is achieved based on the algorithms implemented, when is user feed the application with the required information
html-css-javascript ibm-cloud ibm-watson linear-regression machine-learning matplotlib numpy pandas python python-flask random-forest scikit-learn
Last synced: 13 Apr 2026
https://github.com/rohitpawar001/bone_marrow_surival_prediction
Bone marrow transplants can be life-saving, but predicting patient survival is complex. In this project, I used machine learning to analyze key medical factors and improve survival predictions. I also implemented CI/CD pipelines, used MLflow for model tracking, and deployed the model on an AWS EC2 instance.
aws docker ec2-instance flask machine-learning mlflow python scikit-learn
Last synced: 08 Apr 2026
https://github.com/gokulgowthams/smart-premium
An Interactive Premium Amount Detection for user which accurately predicts the required premium amount for a default loan by using series of questions that satisfies the criteria in Streamlit Application
data-preprocessing feature-engineering git github mlflow model-deployment numpy pandas python scikit-learn streamlit xgboost
Last synced: 11 Apr 2026
https://github.com/samarthmule/chatbot
This project implements a generic chatbot using Natural Language Processing (NLP) and Machine Learning techniques. The chatbot is designed to classify user input into predefined intents and provide context-aware responses. The solution is scalable, interactive, and suitable for various domains.
chatbot internship machine-learning machine-learning-algorithms nlp nltk project-repository python python3 scikit-learn streamlit
Last synced: 13 Apr 2026
https://github.com/moustafamohamed01/moustafamohamed01
Software Engineering Student | Specializing in Machine Learning, Deep Learning, and Generative AI
artificial-intelligence artificial-neural-networks config convolutional-neural-networks data-analysis data-science data-visualization deep-learning keras machine-learning matplotlib natural-language-processing numpy pandas python pytorch recurrent-neural-networks scikit-learn seaborn tensorflow
Last synced: 08 Apr 2026
https://github.com/strcoder4007/machine-learning-deep-learning-practice
Implementation of Linear/Logistic Reg, K-NN, SVM, Clustering, K-Means, ConvNet, ResNet, MobileNet, RNN, LSTM etc. using Pandas, SciKitLearn, NumPy & TensorFlow 2
convolutional-neural-networks matplotlib scikit-learn tensorflow2
Last synced: 15 May 2026
https://github.com/imswappy/brain-tumor-detection
🧠 Deep learning project for brain tumor classification using MRI images. Built with transfer learning (VGG16 + fine-tuning), TensorFlow/Keras, and deployed via Streamlit. Dataset & model loaded dynamically from KaggleHub. Includes training notebook, evaluation, and interactive web app.
kagglehub keras numpy pandas scikit-learn streamlit tensorflow vgg16-model
Last synced: 13 Apr 2026
https://github.com/supriya811106/healthcare-recommedation-system
A Flask-based web app that predicts diseases based on symptoms and recommends specialized doctors. It uses machine learning for accurate health predictions and location-based doctor searches.
css flask-application healthcare-application html javascript machine-learning numpy pandas recommendation-system scikit-learn
Last synced: 04 Mar 2026
https://github.com/takkii/pylean
Data analysis ( 🐍 💎 📈 )
analayze matplotlib numpy pandas python scikit-learn
Last synced: 09 Sep 2025
https://github.com/aswinbarath/ml-classifier
A Machine Learning model which performs classification
classification image-classification jupyter-notebook machine-learning matplotlib-pyplot python3 scikit-learn
Last synced: 13 Apr 2026
https://github.com/edisedis777/pyspark-ml-features
A PySpark implementation of 6 lesser-known Scikit-Learn features optimized for Azure Databricks. This project translates powerful machine learning techniques from Scikit-Learn into PySpark's distributed computing framework.
azure databricks databricks-notebooks large-scale machine-learning pyspark python scikit-learn scikitlearn-machine-learning
Last synced: 13 Apr 2026
https://github.com/lordmitrii/win-prediction-django
A web application on Django framework. It predicts a winning team based on given sets of dota2 heroes.
django dota2 jupyter-notebook machine-learning python scikit-learn web
Last synced: 13 Apr 2026
https://github.com/abdullah321umar/internee.pk-dataanalytics_internship-assignment4
🌟 Fraud Detection in Application 🌟 Through Isolation Forest and K-Means Clustering, the project detects suspicious patterns like inconsistent income, duplicate entries, and unrealistic employment data. This end-to-end workflow transforms raw data into actionable fraud insights — enhancing trust and accuracy.
anomaly-detection csv-handling data-cleaning data-exporting data-import data-normalization exploratory-data-analysis export interpretation matplotlib model-evaluation pandas pca python reporting scaling scikit-learn seaborn
Last synced: 06 May 2026
https://github.com/0xsolanaceae/neurolog
ML-Powered Log Analysis
isolation-forest machine-learning scikit-learn
Last synced: 24 Apr 2026
https://github.com/finite-sample/stagecoachml
Build two-stage models when your features arrive in two batches at different times.
machine-learning scikit-learn two-stage-models
Last synced: 14 Jan 2026
https://github.com/adrien-1997/bike-forecast-paris-velib
Bike-sharing demand forecasting in Paris (Vélib’). A data science and machine learning project leveraging open urban mobility data to predict bike availability, analyze time series usage patterns, and provide interactive dashboards for visualization.
bike-sharing dashboard data-science duckdb forecasting machine-learning matplotlib open-data pandas paris predictive-modeling python scikit-learn streamlit transportation urban-mobility velib
Last synced: 11 Apr 2026
https://github.com/uhstray-io/pyrizon
Data Collection, Analysis, Mapping, Pipelining & Transformation, & API using Python
api data-engineering etl numpy pandas plotly python pytorch raw-data scikit-learn seaborne sql sqlite tensorflow
Last synced: 09 Apr 2026
https://github.com/varun-khorgade/churnshield-customer-retention-predictor
Built an ML-based classification model to predict customer churn. Applied data preprocessing, feature engineering, and ensemble algorithms to improve prediction accuracy and help businesses implement retention strategies.
classification-algorithm datapreprocessing f1-score feature-engineering hyperparameter-tuning logistic-regression matplotlib model-evaluation numpy pandas python ran roc-auc scikit-learn seaborn xgboost
Last synced: 07 May 2026
https://github.com/elifirinci/mushrooms-plants-classification
This project features AI models for identifying mushrooms and plants as poisonous or edible using image-based predictions. Both models are tested through an interactive Gradio interface, ensuring user-friendly and accurate identification for foragers and researchers.
classification cnn cnn-classification gradio image-classification machine-learning mushroom-classification plant-classification scikit-learn
Last synced: 17 May 2026
https://github.com/rakibhhridoy/easywaydiveinto-datascience
Data Science is not as easy as it seems at first. The most problem faced by new learner are lack of resource knowledge as well as confusion in using the various resources. I hope this repository will benefit confusion learner.
algorithms algorithms-implemented bayesian-statistics data-science deep-learning deep-neural-networks linear-algebra machine-learning matplotlib multivariate-calculus numpy optimization pandas python scikit-learn scipy seaborn statistics statsmodels tensorflow
Last synced: 06 Apr 2026
https://github.com/ayisha-mohammed/freecodecamp_data_analysis_projects
freeCodeCamp-Data_Analysis projects using python
matplotlib-pyplot numpy python scikit-learn seaborn-plots
Last synced: 07 May 2026
https://github.com/fohlen/stats-experiment
A tiny stats experiment with GENESIS data
matplotlib python3 scikit-learn
Last synced: 17 May 2026
https://github.com/somjit101/human-activity-recognition
This project is to build a model that predicts the human activities such as Walking, Walking Upstairs, Walking Downstairs, Sitting, Standing or Laying using readings from the sensors on a smartphone carried by the user.
decision-tree-classifier eda feature-engineering gradient-boosting-classifier grid-search human-activity-recognition keras logistic-regression lstm random-forest-classifier rbf-kernel scikit-learn seaborn-plots signal-processing support-vector-classifier support-vector-machine t-sne tensorflow uci-har-dataset uci-machine-learning
Last synced: 23 Feb 2026
https://github.com/udityamerit/curafind-powered-by-ai
CuraFind AI is a web-based application leveraging Natural Language Processing (NLP) to intelligently recommend medicines. Users can search using symptoms, medicine names, or free-text descriptions, and receive suggestions along with brand substitutes for drugs
ai machine-learning nlp numpy pandas scikit-learn
Last synced: 18 Sep 2025
https://github.com/evangks/hierarchical-clustering-mall-customers
A comprehensive machine learning project demonstrating hierarchical clustering for customer segmentation on the Mall Customers dataset. Includes EDA, preprocessing, multiple linkage/distance comparisons, and professional visualizations.
clustering data-science hierarchical-clustering jupyter-notebook machine-learning mall-customers portfolio-project python scikit-learn unsupervised-learning
Last synced: 07 Mar 2026
https://github.com/soroush-04/incrementalsvm-road-accident-prediction
Enhance SVM and incremental SVM machine learning models for road accident severity prediction
incremental-learning machine-learning python scikit-learn svm
Last synced: 09 Apr 2026
https://github.com/tasninanika/heart-disease-analysis
The Heart Disease Analysis project is a comprehensive machine learning study aimed at predicting the presence of heart disease using the Heart Disease UCI Dataset.
knn logistic-regression matplotlib numpy pandas python3 random-forest scikit-learn seaborn
Last synced: 09 Apr 2026
https://github.com/weisscharlesj/SciCompChem_Experimental
Scientific Computing for Chemists Jupyter Book
book chemistry jupyter jupyter-lab matplot nmrglue numpy pandas python scikit-image scikit-learn scipy seaborn textbook
Last synced: 07 Nov 2025
https://github.com/psavarmattas/stroke-prediction-ml-model
This is just a theoretical Machine Learning Model that will analyze the data and determine where the stroke can occur.
classification decision-tree-classifier knearest-neighbor-classifier logistic-regression machine-learning multi-layer-perceptron python random-forest scikit-learn sklearn stacking-classifier support-vector-classifier
Last synced: 09 May 2026
https://github.com/1ayanabil1/kaggle-competition
An overview of the competitions and links to their respective folders for detailed information and code
kaggle kaggle-competition kaggle-dataset kaggle-house-prices kaggle-solution kaggle-titanic machine-learning machine-learning-algorithms machinelearning python python-lambda python-script python3 pytorch scikit-learn tensorflow
Last synced: 10 Apr 2026
https://github.com/artikumari28/movie-recommender-system
This project is a content-based movie recommendation system, where movies are recommended based on their similarity in content. The system analyzes various features such as genres, cast, and descriptions to suggest similar movies.
google-colab machine-learning nltk numpy pandas pickle scikit-learn streamlit
Last synced: 06 Apr 2026
https://github.com/armanjscript/fusion-rag
A powerful web-based application designed to answer questions based on the content of uploaded PDF documents. This project leverages the **Fusion-in-Decoder (FiD)** approach for **Retrieval-Augmented Generation (RAG)**, combining semantic similarity, technical term relevance, and recency to deliver accurate and contextually relevant responses
chroma chromadb fusion-rag langchain langchain-ollama ollama pypdf qwen2-5 rag rag-chatbot scikit-learn streamlit tf-idf-score tf-idf-vectorizer vector-database
Last synced: 10 Apr 2026
https://github.com/andystmc/nextflownyc
Developed a machine learning model (Bidirectional LSTM) to forecast NYC traffic volumes using 10 years of automated traffic count data. Achieved strong predictive accuracy, demonstrating the power of deep learning for urban traffic analysis.
data-analysis data-cleaning data-science data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning jupyter-notebook lstm-neural-networks machine-learning numpy pandas predictive-modeling python3 scikit-learn tensorflow-keras traffic-flow-forecasting
Last synced: 07 Apr 2026
https://github.com/jai0212/cash-app-bias-busters
A platform developed with Cash App to help ML engineers detect and visualize biases in models using Fairlearn. Features include a collaborative and interactive dashboard (React, Chart.js), a Flask backend, and a secure MySQL database for data storage and analysis.
bias-detection chartjs fairlearn flask machine-learning mysql numpy pandas pytest python react scikit-learn scipy
Last synced: 16 Feb 2026
https://github.com/tasninanika/coded_data_prediction-knn
K-Nearest Neighbors (KNN) is a supervised machine learning algorithm
knn pandas python3 scikit-learn
Last synced: 07 Apr 2026
https://github.com/guoshijiang/scikit-learn
带你一起学习scikit-learn
nlp-machine-learning scikit-learn
Last synced: 14 Sep 2025
https://github.com/f-aguzzi/ChemFuseKit
Chemometrics library for data fusion, model training and prediction of data from multiple sensor sources.
chemometrics datafusion knn lda pca plsda scikit-learn svm
Last synced: 21 Sep 2025
https://github.com/evangks/k-means-clustering-synthetic-dataset
Customer Segmentation using K-Means Clustering: A complete machine learning workflow for segmenting customers based on synthetic demographic and spending data, with visualizations, evaluation metrics, and reproducible Jupyter notebook.
clustering customer-segmentation data-science jupyter-notebook k-means-clustering machine-learning portfolio-project python27 scikit-learn unsupervised-learning
Last synced: 10 Mar 2026
https://github.com/viveksapkal2793/advertisement-response-analysis
This project analyzes advertisement responses using a Django backend and a Vite+React frontend. It includes scripts to load, clean, and transform data, which are executed within Docker containers. Data is stored in a MongoDB database, and the project can be run with or without Docker by adjusting the MongoDB connection strings.
advertisement advertisement-analysis container-image containerization django docker machine-learning mongodb react scikit-learn vite
Last synced: 23 Sep 2025
https://github.com/catlikeflyer/rsp-recognition
A computer vision project to recognize thumbs up
machine-learning mediapipe-hands python scikit-learn
Last synced: 16 May 2026
https://github.com/rexsimiloluwah/fastapi-ml-apps
Machine learning apps built with FastAPI
docker fastapi machine-learning python scikit-learn tensorflow
Last synced: 05 Apr 2026
https://github.com/charmee123/krishak_vriddhi
https://krishak-vriddhi.onrender.com
bootstrap css flask html javascript machinelearning-python numpy pandas scikit-learn
Last synced: 06 Apr 2026
https://github.com/robson-python/customer-cancellation
Data science and analytics project to reduce customer cancellations.
data-analysis data-science data-visualization jupyter-notebook machine-learning matplotlib pandas python scikit-learn seaborn streamlit vscode
Last synced: 09 Apr 2026
https://github.com/tlapanco/knn-project
Projecto para la materia de Sistemas inteligentes haciendo uso de KNN oversampling.
jupyter-notebook knn pandas python scikit-learn smote
Last synced: 09 Apr 2026
https://github.com/harmonydata/harmony_original
The Harmony project
ai data-science data-visualization harmonisation harmonization machine-learning mentalhealth multilingual multilingual-nlp natural-language-processing natural-language-understanding naturallanguageprocessing nlp psychology python scikit-learn
Last synced: 13 Apr 2026
https://github.com/gokulgowthams/clickstream-customer-conversion
Analyzes clickstream data from an e-commerce platform to predict customer conversions, estimate potential revenue, and segment users for personalized marketing strategies. By leveraging machine learning techniques, the project enhances decision-making for businesses seeking to optimize user engagement and sales.
data-preprocessing feature-engineering machine-learning matplotlib model-deployment numpy pandas pipeline python scikit-learn seaborn streamlit-web-application tensorflow xgboost
Last synced: 07 Apr 2026
https://github.com/gokularaman-c/ev-charging-log-anomaly-detection
EV charging log anomaly detection using Isolation Forest, engineered telemetry features, and a CLI inference pipeline.
anomaly-detection ev-charging feature-engineering isolation-forest machine-learning mlops python scikit-learn time-series
Last synced: 23 May 2026
https://github.com/docsallover/spam-detection
Building a Spam Filter with Python: Using Machine Learning to Combat Spam
datascience flask jinja2 machine-learning numpy numpy-library pandas pandas-python python python3 scikit-learn
Last synced: 09 Apr 2026
https://github.com/vidyasagarmsc/pydataflownote
As I explore Python libraries....
jupyter-notebook numpy python pytorch scikit-learn scipy tensorflow
Last synced: 27 Sep 2025
https://github.com/jersongb22/computervision
Links to my repositories with a wide variety of Computer Vision models using CNNs, Transfer Learning, and Vision Transformer with TensorFlow, PyTorch, Hugging Face and Ultralytics.
cnn computer-vision convnextv2 efficientnetv2 hugging-face image-captioning image-classification image-segmentation lenet-5 object-detection opencv plotly python pytorch scikit-learn tensorflow ultralytics video-classification vision-transformer yolo11
Last synced: 12 Apr 2026
https://github.com/srgrace/data-cleaning-challenge
Data Cleaning Challenge
data-science jupyter-notebook numpy pandas python scikit-learn
Last synced: 09 Apr 2026
https://github.com/tynab/lottery
Xổ số kiến thiết
crawl da data-analyst deep-learning dl jupyter-notebook lottery matplotlib numpy pandas pip pip3 py python scikit-learn sklearn tynab xo-so xo-so-kien-thiet yan
Last synced: 03 Aug 2025
https://github.com/upul/chocolate-quality-analysis
This repository contains a Jupiter notebook which describes how to use basic machine learning tools such Scikit-Learning, Pandas, and Numpy for buiding models.
machine-learning numpy pandas predictive-analytics scikit-learn
Last synced: 04 May 2026
https://github.com/arunagirinathan-k/text-classifier-using-nlp_techniques
A Text Classification using NLP Techniques.
matplotlib nlp nltk numpy pandas scikit-learn seaborn spacy text-classification
Last synced: 09 Apr 2026
https://github.com/gperdrizet/ensembleset
Ensemble dataset generator for tabular data prediction and modeling projects.
classification ensemble feature-engineering machine-learning regression scikit-learn
Last synced: 07 Mar 2026
https://github.com/abhipatel35/automated-machine-learning-pipeline-for-iris-dataset-classification
Automated ML pipeline for Iris dataset classification using Decision Tree. Features PCA dimensionality reduction and standard scaling.
automated-machine-learning classsification data-preprocessing descision-tree dimentionality-reduction end-to-end-ml-workflows iris-dataset machine-learning-pipeline python random-forest scikit-learn
Last synced: 02 May 2026
https://github.com/shreeparab1890/movie-recommender-system
This notebook is trying to build a model which will recommend the movie based on given movie and genre. In this we use Popularity Based Recommendation, Content Based Recommendation and Collaborative Filtering based Recommendation.
bag-of-words cosine-similarity matplotlib numpy pandas python scikit-learn sklearn vectorization
Last synced: 09 Apr 2026
https://github.com/vimal0156/ruaroa-ai
🧙♂️ Zero-Code Machine Learning Wizard - Transform ideas into intelligent solutions without writing code. AI-powered ML pipeline automation with interactive web interface.
ai-agents ai-assistant artificial-intelligence automated-machine-learning code-generation data-analysis data-science deep-learning jupyter machine-learning machine-learning-pipeline neural-networks no-code openai python scikit-learn streamlit visualization
Last synced: 09 Apr 2026
https://github.com/mariamabidi/pinn-based-flow-prediction
This repository contains code and experiments for predicting 3D aerodynamic flow around car geometries using Physics-Informed Neural Networks (PINNs) and for analyzing flow features via autoencoder-based clustering.
computer-vision machine-learning neural-network numpy pytorch pyvista scikit-learn
Last synced: 05 Aug 2025
https://github.com/veb-101/machine-learning-practice
Contains code-works from the Hands on scikit-learn and tensorflow book
deep-learning keras machine-learning python3 scikit-learn tensorflow-gpu
Last synced: 19 Apr 2026
https://github.com/aymen016/film-recommendation-engine
A machine learning-powered movie recommender system designed to provide personalized recommendations based on user preferences and data analysis. This project includes a backend recommendation engine, a Streamlit-based interface, and a web-based frontend for an enhanced user experience.
flask numpy pandas pickle python scikit-learn streamlit
Last synced: 09 Apr 2026
https://github.com/vedanty3/heart-disease-prediction
This project aims to build a machine learning model using K-Nearest Neighbor, LogisticRegression, RandomForestClassifier to classify whether or not a person has heart disease based upon his medical attributes. (accuracy achieved : 88.52%)
confusion-matrix correlation-matrices jupyter-notebook knn-classification logistic-regression machine-learning matplotlib numpy pandas python random-forest randomforestclassifier roccurve scikit-learn sklearn zerotomastery
Last synced: 09 Apr 2026
https://github.com/malleswarigelli/real_estate_house_price_prediction
Build end-to-end ML Regression pipeline for predicting housing price, deploy Flask app to cloud platform:Heroku with Docker, CI/CD tool: GitHub Actions
ci-cd-pipeline docker heroku-deployment machine-learning mlops mongodb python scikit-learn
Last synced: 09 Apr 2026
https://github.com/ttsudipto/sdldpred
SDLDpred - Symptom-based Drugs of Lifestyle-related Diseases prediction
birch bisecting-kmeans clustering css drug-prediction drug-symptom-associations html js kmeans lifestyle-diseases machine-learning mean-shift php scikit-learn semantic-similarity symptoms web-application
Last synced: 09 Apr 2026
https://github.com/vidhi1290/text-classification-model-with-attention-mechanism-nlp
This Python project utilizes PyTorch to perform text classification with an attention mechanism. Pre-trained GloVe embeddings are processed for word representation, and a custom attention model is trained on consumer complaint data to categorize complaints into product categories.🎯
attention-mechanism deeplearning machine-learning nlp nltk numpy pandas python pytorch scikit-learn text-classification tqdm
Last synced: 06 Apr 2026
https://github.com/dustinmichels/bayesian-values-guesser
Uses some user input, data from the World Values Survey <www.worldvaluessurvey.org>, and Bayes Rule to guess a number of beliefs the user might have. STATUS: In progress.
bayes-rule bayesian-values-guesser naive-bayes-classifier pandas python scikit-learn values-survey
Last synced: 09 Apr 2026
https://github.com/renan-siqueira/face-recognition-research
This project pertains to video analysis, face recognition, and image clustering.
data-science deep-learning dlib docker docker-image face-recognition machine-learning neural-network neural-networks numpy opencv opencv-python pillow python scikit-learn scipy
Last synced: 13 Apr 2026
https://github.com/gaurav9364/credit-card-fraud-detection
Credit Card Fraud Detection using Machine Learning – A classification project that detects fraudulent credit card transactions using supervised learning, with data preprocessing, handling class imbalance, and model evaluation (ROC-AUC, Precision, Recall, F1-score).
googlecolab imbalanced-learn matplotlib numpy pandas python scikit-learn seaborn xgboost
Last synced: 08 Apr 2026
https://github.com/PFS-AI/PFS
The AI-powered desktop tool for finding, classifying, and understanding your files. Search by keyword, ask questions, and get insights from your scattered files instantly.
ai cross-platform data-science document-classification fastapi file-management file-organizer file-search huggingface-transformers knowledge-management langchain machine-learning productivity-tools rag scikit-learn search-engine semantic-search vector-search
Last synced: 30 Dec 2025
https://github.com/aaa1928/iris-ml-classifier
PyTorch model that classifies Iris species based on characteristics about the length and width of sepals and petals.
deep-learning iris-classification iris-dataset machine-learning neural-network numpy pandas python pytorch scikit-learn
Last synced: 05 Apr 2026
https://github.com/uhstray-io/pystockbot
Platform & exchange agnostic Stock, Crypto, and Asset automated Machine Learning & AI Trading Bot
automation docker machine-learning python scikit-learn statistical-analysis trading-algorithms
Last synced: 13 Aug 2025
https://github.com/yvesemmanuel/machine_learning
Implements data problems solved with machine learning algorithms.
data-science keras keras-tensorflow linear-algebra machine-learning neural-network python scikit-learn
Last synced: 09 Apr 2026
https://github.com/jedirhymetrix/twittersentiments
Twitter sentiment analyzer written in Python
machine-learning python scikit-learn tensorflow twitter-sentiment-analyzer
Last synced: 09 Apr 2026
https://github.com/rizz1406/spam-email-detector
Spam Email Classifier using Python and Streamlit A simple machine learning project that classifies emails as **spam** or **ham** using the **Naive Bayes algorithm** and **TF-IDF** for text feature extraction. The project includes a user-friendly web app built with Streamlit
nlp pandas pytho3 scikit-learn streamlit
Last synced: 09 Apr 2026
https://github.com/nekruzash/regression-correlation
This is from CS2023 - AI/DS/ML class, trained a model based on different categories of data and predicted using a linear regression for the best feature that has the greatest effect on the housing prices.
jupyter-notebook python scikit-learn
Last synced: 04 May 2026
https://github.com/mhmudfzli/exploring-mental-health-data
This project demonstrates a comprehensive approach to solving a regression problem using various machine learning models. The notebook includes: Data Preprocessing, Exploratory Data Analysis (EDA), Model Training, Hyperparameter Tuning, Model Evaluation, Feature Importance
catboost lightgbm matplotlib numpy pandas scikit-learn seaborn xgboost
Last synced: 09 Apr 2026