scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-06-25 00:23:58 UTC
- JSON Representation
https://github.com/Timelaglepomispunctatus405/quality-guard
Protect AI agents by blocking dangerous shell commands and checking tool output quality in real time with an OpenClaw plugin
ai-agents automation claude-ai claude-code cursor dead-code deptrac design-pattern entsoe event-driven evidence-guard gas-prices machine-learning price-prediction python scikit-learn skills tdd time-series-forecasting
Last synced: 16 May 2026
https://github.com/cs50victor/cspaint
handwritten-text recognition application
ai machine-learning python scikit-learn
Last synced: 09 May 2026
https://github.com/yasolg/ml-bootcamp
⚡ Master machine learning in 10 days with this interactive, open-source bootcamp. Learn Python basics to production ML at no cost.
chatgpt crewai data-science deep-learning jose-portilla langchain langgraph large-language-models llama matplotlib-pyplot mlops ollama python satellite-imagery scikit-learn tensorflow tensorflow2 udemy-course-project
Last synced: 15 Apr 2026
https://github.com/jeniljani-4444/end-to-end-car-price-prediction-model
Predict car prices effortlessly using this machine learning model. Built with Python and Scikit-learn it analyzes features like mileage age brand and more to estimate accurate prices. Perfect for buyers sellers and dealerships.
machine-learning matplotlib numpy pandas python scikit-learn seaborn streamlit
Last synced: 10 Apr 2026
https://github.com/prashantsaini1525/heart-disease-predicition
Project 1 : Heart Disease Prediction A machine learning project that uses logistic regression to predict the likelihood of heart disease based on clinical data. This repository includes data preprocessing, model training, evaluation, and an interactive prediction system.
cardiology classification data-science healthcare heart-disease logistic-regression machine-learning predictive-modeling python scikit-learn
Last synced: 04 Jul 2025
https://github.com/hayatiyrtgl/ml_divorce_classification-with-5-models
The code performs data preprocessing, machine learning model training, evaluation, and model saving for a binary classification problem on the divorce dataset.
anket binary-classification classification keras keras-classification-models keras-neural-networks keras-tensorflow machine-learning machine-learning-algorithms machinelearning-python python pythonmachinelearning scikit-learn scikitlearn-machine-learning
Last synced: 08 Apr 2026
https://github.com/yaswanth1702/chatbot
natural-language-processing scikit-learn tf-idf-vectorization
Last synced: 20 Jan 2026
https://github.com/audy21/data-exploratory-portfolio
An advanced visualizations from my recent practices.
matplotlib nltk pandas plotly scikit-learn seaborn tensorflow
Last synced: 27 Aug 2025
https://github.com/akimuddinshaikh/domain-application-of-predictive-analysis
Data-Driven House Price Prediction "Predicting house prices using Machine Learning techniques
feature-engineering pca python random-forest scikit-learn
Last synced: 09 May 2026
https://github.com/arrhythmia-detection/authorfeatureextracteddecisiontreeoptimizedesp32s3
Deploys an optimized Decision Tree for Arrhythmia classification using Chapman ECG dataset on ESP32-S3 dev kit
arrhythmia-classification decision-tree-classifier decision-trees eloquent esp32-arduino esp32-s3 scikit-learn
Last synced: 27 Aug 2025
https://github.com/rohitdusane/spam-classification-model
A Python-based machine learning model for spam detection leveraging TF-IDF vectorization and multiple classifiers, including Naive Bayes, Logistic Regression, and Random Forest. This project demonstrates preprocessing techniques, model training, and performance evaluation for classifying SMS messages as spam or ham.
data-science flask mlflow natural-language-processing scikit-learn spam-detection text-classification
Last synced: 19 Apr 2026
https://github.com/lakshitalearning/codsoft
Machine Learning Projects - CODSOFT Internship: This repository showcases my machine learning projects completed during my internship at Codsoft. It demonstrates my skills in developing innovative solutions using various ML techniques and tools.
churn-prediction codsoft codsoftinternship deep-learning handwritten-text-recognition internship-project keras machine-learning python rnn-tensorflow scikit-learn spam-detection
Last synced: 11 Feb 2026
https://github.com/davidyslu/PokemonRecognition
Recognize Pokemon's image using scikit-learn in Python
knn-model python scikit-learn svm-model
Last synced: 29 Aug 2025
https://github.com/murshidazher/recommendation-system
🎥 Building a recommendation system using python
python recommendation-engine scikit-learn suprise
Last synced: 08 May 2026
https://github.com/gana36/credit-card-fraud-detection
Production MLOps pipeline for fraud detection with automated testing, monitoring, and zero-downtime deployments
docker evidently fastapi fraud-detection grafana machine-learning mlflow mlops postgresql prometheus scikit-learn
Last synced: 10 Apr 2026
https://github.com/mohammedhaq/safestream
SafeStream is a machine learning project that utilizes machine learning to predict the potability of water. By analyzing various water quality parameters, SafeStream helps in determining whether a water source is safe for consumption. This project leverages Python, PyTorch, and scikit-learn.
logistic-regression machine-learning neural-network python pytorch scikit-learn
Last synced: 23 Jul 2025
https://github.com/jofaval/80-cereals
Data Analysis into almost 80 USA cereals user rating in 1993
cereals classification data-analysis data-science data-visualization google-colab kaggle linear-regression logistic-regression machine-learning matplotlib python regression scikit-learn seaborn
Last synced: 12 Apr 2026
https://github.com/oneapi-src/predictive-asset-health-analytics
AI Starter Kit for Predictive Asset Maintenance using Intel® optimized version of XGBoost
Last synced: 04 Apr 2025
https://github.com/mohammad95labbaf/churn-prediction
This project aims to predict customer churn using machine learning algorithms. The project includes data preprocessing, feature engineering, and model evaluation.
adaboost bagging churn churn-analysis churn-prediction decisiontree ensemble-learning knn randomforest scikit-learn sklearn svm voting
Last synced: 23 Jan 2026
https://github.com/csakig/bike-sharing-demand-analytics
Advanced analytics on Bike Sharing data using Random Forest, Gradient Boosting, and SARIMAX forecasting.
data-science machine-learning portfolio python scikit-learn time-series
Last synced: 10 Apr 2026
https://github.com/thiagohrcosta/machinelearning-temperature
A Small Machine Learning application leveraging Scikit-Learn and statistical learning to extract knowledge from data without explicit programming.
machine-learning numpy pandas python3 scikit-learn
Last synced: 08 Apr 2026
https://github.com/thatguychandan/adoptimization
This project implements an ad optimization system using a hybrid approach combining Thompson Sampling and Upper Confidence Bound (UCB) algorithms. The system learns to select the most effective ads based on user context and historical performance.
numpy pandas plotly python pytorch reinforcement-learning scikit-learn streamlit thompson-sampling upper-confidence-bound
Last synced: 10 Apr 2026
https://github.com/thammami01/simple-recruitment-ml
Simple recruitment app that allows job posting/application, and viewing regression/classification figures based on entries.
flask matplot-lib mongodb python scikit-learn
Last synced: 12 Apr 2026
https://github.com/das-amlan/movierecommendationsystem
End To End Machine Learning Implementation of a movie recommendation system
cosine-similarity machine-learning movie-recommendation nltk-python pandas pickle pyhton recommender-system scikit-learn streamlit vectorization
Last synced: 10 Apr 2026
https://github.com/itsmandrew/diabetes-cs178
Final project for CS178, predicting whether and when will patient with diabetes be readmitted in hospital after the treatment.
knn logistic-regression neural-network python scikit-learn
Last synced: 13 Apr 2026
https://github.com/solanovisitor/moodpredictor
An application that uses Machine Learning to predict one's risk of having mood disorders (currently in Portuguese)
pandas python scikit-learn streamlit xgboost
Last synced: 09 Apr 2026
https://github.com/sobhan-m/comp472-project1
A program that builds various machine learning models on a dataset composed of Reddit posts, their emotions, and their sentiments.
ai jupyter-notebook machine-learning python scikit-learn
Last synced: 14 Apr 2025
https://github.com/s0fft/learning-lab
Code Notes & Test-Learn // Micro Pet-Projects: Python / Asynchrony / FastAPI / Django-Tastypie / DRF / Parsing / Telegram-Bot / SQL / Docker / DS / ML / etc.
asynchrony data-science django-rest-framework docker fastapi jupyter-lab jupyter-notebook mashine-learning matplotlib notes numpy pandas parsing python3 scikit-learn seaborn sql sqlalchemy tastypie telegram-bot
Last synced: 10 Apr 2026
https://github.com/shreeparab1890/duplicate-question-predictor
The ipython notebook is working to build a model which will detect duplicate questions if two questions pair are given.
bag-of-words nlp nlp-machine-learning nltk numpy pandas python random-forest scikit-learn sklearn streamlit
Last synced: 10 Apr 2026
https://github.com/kostadinlambov/algorithmic-trading-bot
The project aims to evaluate the predictive performance of different machine learning (ML) algorithms for Bitcoin trading. The proposed trading strategy integrates key technical indicators, including the Relative Strength Index (RSI), Simple and Exponential Moving Averages, and the Moving Average Convergence Divergence (MACD).
lightgbm machine-learning matplotlib mlflow numpy optuna pandas pickle random-forest scikit-learn scipy seaborn statsmodels xgboost
Last synced: 05 Apr 2026
https://github.com/fatimaafzaal/car_price_prediction
Trains Random Forest and Gradient Boosting models to predict car prices based on user inputs for various car attributes, evaluating models and making predictions using the best-performing model.
car-price-prediction ensemble-learning gradient-boosting matplotlib numpy pandas random-forest regression regression-models scikit-learn seaborn
Last synced: 09 Apr 2026
https://github.com/ravi-aratchige/spacefarer
What type of astronaut would you be?
astronaut astronauts flask jupyter jupyter-notebook machine-learning numpy pandas python scikit-learn space tailwind tailwindcss
Last synced: 10 Apr 2026
https://github.com/yuweaec/wine_quality_prediction
The Wine Quality Prediction project aims to predict the quality of wine based on its chemical properties using machine learning algorithms.
flask jupyter-notebook machine-learning python scikit-learn
Last synced: 11 Apr 2025
https://github.com/sayamalt/airline-passenger-satisfaction-classification
Successfully developed a machine learning model to predict Airline Passenger Satisfaction by building an end-to-end MLOps pipeline. It integrates DVC for data versioning, a Dockerfile for containerization, and CI/CD using GitHub Actions for automated deployment.
azure-web-app-service ci-cd-pipeline classification docker-container dvc-pipeline experiment-tracking exploratory-data-analysis feature-engineering github-actions hyperparameter-tuning machine-learning mlflow mlflow-tracking mlops-workflow model-registry model-training-and-evaluation model-versioning optuna scikit-learn
Last synced: 16 Apr 2026
https://github.com/mabjq/fastapi_iris_ml_project
Simple FastAPI ML application project
fastapi jinja2-templates macine-learning scikit-learn
Last synced: 30 Mar 2025
https://github.com/keremm26/pca-analysis
Principal Component Analysis and Clustering on Survey Data using Python and scikit-learn
k-means-clustering machine-learning pca-analysis scikit-learn unsupervised-learning
Last synced: 04 May 2026
https://github.com/vickshan001/diabetes-prediction-using-machine-learning-ci512-project-
Machine learning project (2021) predicting diabetes using the Pima Indians dataset. Compared KNN, Decision Tree, MLP, and more for accuracy.
classification diabetes-prediction machine-learning mlp pima-indians-dataset python scikit-learn
Last synced: 10 May 2026
https://github.com/dd-se/ml-app
Predict unseen numbers with ML models trained on MNIST dataset.
opencv python scikit-learn streamlit
Last synced: 30 Mar 2025
https://github.com/akashprak/socialnetworkads
Predicting customer purchase behavior from the Social Network Ads dataset.
data-analysis machine-learning mlflow pandas python scikit-learn seaborn xgboost
Last synced: 30 Mar 2025
https://github.com/igormteixeira/tcc-deteccao-ataques-dos
Este repositório contém o código-fonte e o artigo científico do meu TCC sobre detecção de ataques DoS utilizando Machine Learning. O objetivo do trabalho é analisar e implementar um modelo de aprendizado de máquina treinado com uma base pública para identificar atividades maliciosas reais em sistemas computacionais.
cybersecurity dos-attack machine-learning python scikit-learn tcc
Last synced: 03 May 2026
https://github.com/lorenzorottigni/ml-iris-svm
Machine Learning python bootcamp: Support Vector Machines on iris flower dataset
ipynb machine-learning numpy pandas python scikit-learn seaborn support-vector-machines
Last synced: 10 Apr 2026
https://github.com/oneapi-src/customer-churn-prediction
AI Starter Kit for customer churn prediction using Intel® Extension for Scikit-learn*
Last synced: 04 Apr 2025
https://github.com/striderzz/ml-heart-disease-classification
Machine Learning - Heart Disease Classification Project using Sci-Kit Learn
classification-machine-learning machine-learning machine-learning-projects scikit-learn
Last synced: 16 May 2026
https://github.com/gfyoung/tree-decode
Package for removing the black-box around decision trees
blackbox decision-tree machine-learning python scikit-learn
Last synced: 20 Jan 2026
https://github.com/sanggusti/mentoring-skilvul-sic
A repository for teaching and mentoring as instructor of Skilvul Samsung Innovation Campus 2024
computer-vision flask machine-learning pymongo scikit-learn sql
Last synced: 20 Jan 2026
https://github.com/sarincr/training-on-artificial-intelligence
Entree Academy 10 Days free training on Artificial Intelligence. Course will be conducted in a Blended learning way with Daily one hour online training and 3 hour project based training
artificial-intelligence artificial-intelligence-algorithms data-analysis data-science data-visualization decision-trees deep-learning deeplearning logistic-regression machine-learning machine-learning-algorithms machinelearning num numpy pandas regression scikit-learn scipy sklearn
Last synced: 10 Apr 2026
https://github.com/adi3042/credit-card-fault-detection
🔍💳 Secure Your Finances! Detect anomalies and safeguard transactions with our Credit Card Fault Detection system. Dive into cutting-edge classification techniques to identify fraud and protect financial data. Your journey to secure payments starts here! 🚨🔒 FraudDetectionTech
classification credit-card css datetime fault-detection flask functools html ipykernel jupyternotebooks machine-learning numpy pandas python3 readme scikit-learn setuptools venv
Last synced: 03 Apr 2026
https://github.com/korpog/br_cancer
Binary classifier for Breast Cancer Wisconsin Data Set created with scikit-learn and xgboost.
classification data-science machine-learning pandas python scikit-learn xgboost
Last synced: 10 Apr 2026
https://github.com/pramodyasahan/car-safe-predictor
This repository contains a machine learning project that applies the K-Nearest Neighbors (KNN) classification algorithm to predict car safety ratings. The project uses a dataset of cars, with features such as buying price, maintenance cost, number of doors, persons, lug boot size, and safety.
classification k-nearest-neighbours machine-learning numpy pandas scikit-learn
Last synced: 10 Apr 2026
https://github.com/anav5704/honeywell-aog-zero
Data-driven, proactive maintenance sheduling system for APUs
docker fastapi nextjs postgresql scikit-learn
Last synced: 20 Jan 2026
https://github.com/lilivalgo/machine-learning-projects
This repository hosts the machine learning project developed during my learning journey. It showcases my progress and the skills acquired in the field of machine learning
lag-feature linear-regression ml-models scikit-learn scipy-stats seaborn-plots
Last synced: 28 Mar 2025
https://github.com/alyssonmach/machine-learning-com-python
Aplicações de Machine Learning usando a linguagem de programação Python.
ia keras-tensorflow machine-learning matplotlib numpy pandas programming python scikit-learn scipy
Last synced: 10 Apr 2026
https://github.com/devash2/ayur-scan
Indian Medicinal Leaf detection application using ML and DL
flask flutter google-firebase opencv python scikit-learn tensorflow
Last synced: 10 Apr 2026
https://github.com/lorenzorottigni/ml-ecommerce
Machine Learning python bootcamp: linear regression on ecommerce dataset
ipynb linear-regression machine-learning numpy pandas python scikit-learn seaborn
Last synced: 07 Apr 2026
https://github.com/lorenzorottigni/dl-tensorboard
Deep Learning python bootcamp: tensorboard with cancer dataset
deep-learning ipynb machine-learning python scikit-learn tensorboard tensorflow
Last synced: 05 May 2026
https://github.com/sahilk12nayak/hyperspectral-corn-don-prediction-project
This project contains a machine learning pipeline for predicting DON (vomitoxin) concentration in corn samples using hyperspectral imaging data.
matplotlib numpy pandas python scikit-learn seaborn tensorflow
Last synced: 10 Apr 2026
https://github.com/dipto9999/ml_introduction
An Introduction to Machine Learning, primarily using Python scikit-learn library.
data-science decision-trees jupyter-notebook k-means-clustering k-nearest-neighbors linear-regression logistic-regression machine-learning machine-learning-algorithms matplotlib numpy pandas principal-component-analysis python random-forest scikit-learn seaborn support-vector-machines
Last synced: 10 Apr 2026
https://github.com/1egoman/machine-deep-learning-notes
Notes from my machine deep learning exploration
deep-learning machine-learning neural-networks notes python scikit-learn tensorflow udacity
Last synced: 10 May 2026
https://github.com/arasoul/face-recognition-streamlit
🎯 Neural Face Recognition Matrix - Professional AI-powered biometric identification system with real-time face detection, recognition, and cyberpunk-styled interfaces. Features both web (Streamlit) and desktop (Tkinter) applications with comprehensive training pipeline, Docker deployment, and CI/CD automation.
ai bioinformatics computer-vision deep-learning face-recognition facenet gui machine-learning mtcnn neural-network open-source opencv pytorch real-time scikit-learn streamlit svm
Last synced: 02 Apr 2026
https://github.com/ksasi/customer_segments
machine-learning numpy pandas python scikit-learn
Last synced: 10 Apr 2026
https://github.com/adityakumarda/kmeans-web-analytics
Built with Python, Pandas, and Scikit-learn, this machine learning project uses K-Means to cluster website users by behavior. It reveals patterns in engagement and bounce, helping drive data-informed decisions.
cluster-analysis elbow-curves elbow-method elbow-plot jupyter-notebook kmeans-clustering machine-learning matplotlib numpy pandas python python3 relationship scikit-learn seaborn sklearn
Last synced: 10 Apr 2026
https://github.com/dineshdhamodharan24/amazon-reviews-sentiment-analysis
This is a sentiment analysis project that classifies Amazon product reviews as positive or negative using machine learning techniques.
matplotlib numpy pandas python scikit-learn
Last synced: 10 Apr 2026
https://github.com/alphacrypto246/grape-quality-prediction
The Grape Quality Prediction project uses machine learning to predict the quality of grapes based on chemical properties like acidity, sugar content, and alcohol levels. It applies regression models to forecast the quality score, helping in wine production and quality assessment.
machine-learning numpy pandas scikit-learn scikitlearn-machine-learning
Last synced: 19 Apr 2026
https://github.com/sunnyrao07/stroke-risk-prediction
Predicting stroke risk using machine learning models based on healthcare and demographic data.
data-cleaning data-visualization decision-trees feature-engineering label-encoding matplotlib model-evaluation numpy outlier-detection pandas python random-forest scikit-learn seaborn standard-scaler
Last synced: 10 Apr 2026
https://github.com/jelhamm/principle-component-analysis-data-mining
"This repository contains an implementation of the Principal Component Analysis (PCA) algorithm, which is one of the key techniques used for dimensionality reduction in data mining and machine learning."
data-mining data-science jupyter-notebook machine-learning machine-learning-algorithms pca principal-component-analysis python pytorch scikit-learn scipy-library tensorflow
Last synced: 10 Apr 2026
https://github.com/sisolieri/prova_ds_saloocupacio2024
Admission challenge to Hackató Saló Ocupació by Barcelona activa
arima barcelona catboost data-analysis data-visualizations forecasting machine-learning pandas public-funding python scikit-learn time-series xgboost
Last synced: 10 Apr 2026
https://github.com/broodhoney/heart-disease-prediction
This is a machine learning project which has a trained model that classifies whether a patient has a heart-disease or not.
kaggle-dataset matplotlib numpy pandas python scikit-learn scikitlearn-machine-learning uci
Last synced: 10 Apr 2026
https://github.com/thiagohrcosta/movieapp-ml
The Movie APP is a project created to apply some of the concepts learned throughout the post-graduation degree at XP Educação in Artificial Intelligence with an emphasis on Machine Learning. While this project is not integrated into the curriculum of the course, some of the concepts used were learned during the program.
docker flask-api machine-learning mysql-database postgresql python scikit-learn
Last synced: 10 Apr 2026
https://github.com/ameykasbe/credit-card-fraud-detection-on-imbalanced-dataset
Examined data preprocessing techniques and performance of six different predictive models in Python to credit card fraud detection problem on an imbalanced dataset. Algorithms implemented - Logistic Regression, K Nearest Neighbours, Support Vector Classification, Naïve Bayes Classifier, Decision Tree Classifier, and Random Forest Classifier.
classification machine-learning matplotlib numpy pandas python scikit-learn seaborn
Last synced: 10 Apr 2026
https://github.com/vijaykumarr1452/startup_success_predictor
This project demonstrates the use of Multiple Linear Regression to predict the profits of startups based on investment in R&D, Administration, and Marketing of dataset (50_Startups.csv)
machine-learning multi-linear-regression numpy pandas python regression rsquare-values scikit-learn
Last synced: 10 Apr 2026
https://github.com/jol79/python_exercises
Solving interesting python exercises on different topics
matplotlib-pyplot numpy pandas python3 pythonexercises scikit-learn seaborn
Last synced: 10 Apr 2026
https://github.com/snigdho8869/clustering-projects
Repository for various clustering projects including mall customer segmentation and more. Explore data analysis and clustering techniques
cluster-analysis clustering clustering-algorithm customer-segmentation deep-learning gaussian-mixture-models hierarchical-clustering hierarchical-models kmeans-algorithm kmeans-clustering machine-learning mall-customer-segmentation mall-customers numpy pandas scikit-learn segmentation spectral-clustering tensorflow
Last synced: 10 Apr 2026
https://github.com/gamowy/music-classification
Music genre classification using k nearest neighbors classifier based on gtzan dataset
machinelearning python scikit-learn university-assignment
Last synced: 10 Apr 2026
https://github.com/filsan-musa/project-iot_malware_identification
This repository contains the code and data for a project that detects malware from IoT devices using a publish-subscribe model with Confluent and Databricks. The project streams IoT device data to Kafka, analyzes it, and detects malware using machine learning models such as Random Forest and Gradient Boosted Trees.
apache-kafka classification confluent databricks machine-learning-algorithms scikit-learn sql
Last synced: 31 Aug 2025
https://github.com/crispengari/python-sklearn
💎 Introduction to machine learning with scikit-learn in python. A quick walk through the sklearn library for machine learning and understanding different machine learning algorithims.
ai artificial-intelligence classification clustering datascience jupyter-notebook machine-learning ml-python nlp python regression scikit-learn
Last synced: 13 May 2026
https://github.com/arssite/dirty-cleanflooringimageprocessingusingyolov5
Uses YOLOv5 to classify floor cleanliness into five categories based on visual cues. It includes an annotated dataset, trained model,& evaluation outputs. Code covers data preprocessing, training, & testing. A comparative analysis highlights YOLOv5's advantages over traditional methods, providing an efficient solution automated floor cleanliness.
deep-neural-networks github google-colab jupyter-notebook labelimg matplotlib-pyplot numpy-library opencv-python pandas-python pytorch scikit-learn tensorflow yolov5
Last synced: 10 Apr 2026
https://github.com/celineboutinon/faux-billets
OpenClassrooms Data Analyst 2022-2023 - Projet 10
machine-learning python scikit-learn statsmodels
Last synced: 16 May 2026
https://github.com/annasmustafadev/network-intrusion-detection-ml
Machine learning-based Intrusion Detection System (IDS) for classifying network traffic as normal or malicious using supervised learning techniques. Includes data preprocessing, feature selection, model training, and evaluation for improved cybersecurity intelligence.
anomaly-detection classification cyber-security data-science intrusion-detection machine-learning python scikit-learn supervised-learning
Last synced: 29 Apr 2026
https://github.com/chirindaopensource/measuring_economic_outlook_in_news
End-to-End Python implementation of Beck et al.'s (2025) economic sentiment analysis framework for constructing a high-frequency economic sentiment indicator using 1024-dimensional Jina embeddings and LLM-generated training data. Features L2-regularized classification and rigorous POOS econometric validation with DM-HAC tests for GDP forecasting.
claude-ai computational-economics econometrics financial-modeling jina-embeddings llm nlp privacy-preserving-ml python regularized-regression reproducible-research scikit-learn sentiment-analysis statsmodels synthetic-data tensorflow time-series-forecasting transformers weak-supervision
Last synced: 30 Apr 2026
https://github.com/anastasius21/fakenewsmodel
The repo contains the model for fake news detection and a streamlit app for its implementation.
fake-news-detection machine-learning nlp pandas python scikit-learn
Last synced: 05 May 2026
https://github.com/tomgorb/ds-utils
pre-processing of a DataFrame into a sparse matrix for model input
machine-learning preprocessing scikit-learn
Last synced: 16 May 2026
https://github.com/jay4codes/time-series-comparative-analysis
A comparative analysis of various time series models on JP Morgan's stock price data for stock price predictive analysis
scikit-learn tensorflow time-series-analysis
Last synced: 10 May 2026
https://github.com/miteshgupta07/zomato-restaurant-rating-predictor
A Zomato rating prediction app that uses machine learning to forecast restaurant ratings based on various factors, helping users make informed dining decisions.
flask machine-learning python scikit-learn
Last synced: 10 Apr 2026
https://github.com/sakshi2215/email-sms--spam_classifier
nltk-python scikit-learn streamlit-webapp
Last synced: 17 May 2026
https://github.com/gozsari/ml-oneday-course
This is a one-day machine learning introductory course for beginners
anomaly-detection classification clustering course dimensionality-reduction machine-learning machine-learning-algorithms ml-course ml-workflow regression scikit-learn supervised-learning unsupervised-learning
Last synced: 20 Jan 2026
https://github.com/umasivakumar14/real_estate_ml_model
Predicts the price of a home in Bengaluru, Karnataka based on location, urbanization, total square feet, bedrooms, bathrooms, and balconies.
aws flask gridsearchcv http-requests machine-learning machine-learning-algorithms nginx pandas python scikit-learn
Last synced: 02 Feb 2026
https://github.com/usmana5809/quran-recitation-audio-classification
Quran Recitation Audio Classification project aims to classify different recitations of the Quran using machine learning techniques. It involves preprocessing audio data, extracting features, training models, and evaluating their performance
audio-classification classification-model islamic-studies librosa machine-learning python quran scikit-learn
Last synced: 20 Mar 2025
https://github.com/tsungtsetu122/datamining-cifar10-classification
Data mining project on CIFAR-10 extracted features, applying preprocessing, classification models, and evaluation techniques to improve classification performance.
matplotlib numpy pandas python scikit-learn
Last synced: 10 Apr 2026
https://github.com/parthapray/nlp_pipeline_openai
This repo contains nlp pipeline and openai API integration
gradio matplotlib networkx nltk openai rake-nltk scikit-learn seaborn spacy textblob textstat wordcloud
Last synced: 10 Apr 2026
https://github.com/ranimeshehata/softmax-regression-on-mnist
A PyTorch-based project for classifying the MNIST dataset using Softmax Regression, including training, validation, results and visualization.
matplotlib mnist python3 pytorch scikit-learn softmax-regression torchvision
Last synced: 15 Apr 2026
https://github.com/j-sephb-lt-n/unsupervised-fraud-detection
Exploring anomaly detection using unsupervised methods in scikit-learn
anomaly anomaly-detection fraud-detection isolation-forest local-outlier-factor outlier outlier-det scikit-learn sklearn unsupervised
Last synced: 07 May 2026
https://github.com/anras5/songs-classifier
Classify songs' genres with Machine Learning
data-science docker machine-learning mlflow pandas python scikit-learn seaborn streamlit tpot
Last synced: 10 Apr 2026
https://github.com/prasadhiremath1/movie-recommender-system
Select a movie and 5 similar movies are recommended from the tmdb dataset
pandas python3 scikit-learn streamlit
Last synced: 11 Apr 2026
https://github.com/senaayy/adhd-network-efficiency
🧠 End-to-end fMRI analysis pipeline comparing ADHD brain topology vs. Healthy Controls using Graph Theory (Global Efficiency & Clustering). Built with Nilearn, NetworkX, and Docker for reproducible neuroscience.
adhd bioinformatics brain-networks computational-neuroscience data-science docker fmri graph-theory network-analysis networkx neuroscience nilearn python scikit-learn
Last synced: 17 Jun 2026
https://github.com/crispengari/ml-web-applications
✔ This repository contains a series of machine learning web applications, using python.
artificial-intelligence deeplearning flask javascript machinelearning nueral-networks python scikit-learn sentiment-analysis webapplication
Last synced: 11 Apr 2026
https://github.com/mnitin-reddy/collaborative-filtering-based-recommendation-system
This project is a Book Recommendation System that uses two main approaches: Popularity-Based and Collaborative Filtering. It recommends top books based on their rating frequency and average ratings, and also provides personalized book suggestions by analyzing user interactions.
collaborative-filtering numpy pandas popularity-based-recommendation python recommendation-system scikit-learn
Last synced: 11 Apr 2026