An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/akashshnkr/multi-disease-prediction

Developed and integrated three machine learning models for predicting diabetes, Parkinson's, and heart disease into a Streamlit-based web application. The interface allows users to input data and receive accurate health predictions, enhancing early detection and healthcare outcomes.

logistic-regression machine-learning-algorithms numpy pandas python scikit-learn streamlit-webapp svm

Last synced: 02 Jan 2026

https://github.com/kingabzpro/mlops-with-jenkins

From data ingestion to deploying the model using Jenkins.

classification fastapi jenkins mlops scikit-learn

Last synced: 13 Feb 2026

https://github.com/pockerman/tech3python

Collection of Python based algorithms on numerics, statistics, control etc

algorithms control estimation kalman-filter machine-learning numerical-methods particle-filter python3 scikit-learn statistics

Last synced: 18 May 2026

https://github.com/lucasfrag/dengue-prediction-knc

Projeto desenvolvido para realizar previsão de casos de dengue usando o algoritmo de classificação KNeighborsClassifier.

data-science knearest-neighbor-classifier machine-learning pandas python scikit-learn

Last synced: 11 Mar 2025

https://github.com/achuth-0908/hemoguard-anemia-predictor

A Web App incorporated with a Gradient Boosting Classifier Model, to predict Anemia with given data.

css flask html matplotlib numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/tasninanika/k-means-clustering

An interactive and insightful customer segmentation project using K-Means Clustering.

matplotlib numpy pandas plotly python3 scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/jswong65/machine_learning_nanodegree

Projects of Udacity Machine Learning nanodegree

machine-learning numpy pandas python scikit-learn scipy

Last synced: 09 Apr 2026

https://github.com/kaushiksk/ml-kaggle-titanic-data

Application of various Machine Learning Algorithms and Optimization techniques to make Prediction on the Titanic Dataset at Kaggle.com

kaggle machine-learning pandas scikit-learn

Last synced: 05 May 2026

https://github.com/suundumused/weather-forecast-ai-example

The project scope is a weather forecasting model based on behavioral analysis of the last 33 hours (hour-by-hour forecast) with Random Forest Classifier. The program automatically saves and loads the last trained model for prediction.

ai artificial-intelligence artificial-intelligence-algorithms artificial-intelligence-projects artificialintelligence scikit scikit-learn scikit-learn-python scikitlearn scikitlearn-machine-learning weather weather-conditions weather-forecast weather-information

Last synced: 20 May 2026

https://gitlab.com/hylkedonker/statkit

Statistics for sci-kit learn.

machine learning scikit-learn statistics

Last synced: 01 Nov 2025

https://github.com/ladityagogoi/shadowguard

The ShadowGuard Browser Extension is a powerful tool designed to enhance user experience by identifying and highlighting potential dark patterns on websites. Our extension employs a combination of machine learning algorithms and natural language processing (NLP) models to detect and classify various deceptive design practices

css flask html javascript joblib numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/soumyagautam/sign-sense

Deep Learning and Neural Network based Sign Sense or 'Sign Language' to Speech converter is an desktop app which can detect hand signs in a frame and can convert them to Speech, according to their respective meaning. Opposite to this, it can also recognise your voice and can convert it to sign language.

ai cv2 dataprocessing deep-learning keras machine-learning mediapipe moviepy-library neural-network openai-whisper scikit-learn tensorflow tkinter-python

Last synced: 10 Apr 2026

https://github.com/pacatro/lse2text

Deep learning program that translates Spanish Sign Language (LSE) to text in real time.

ai cnn computer-vision deep-learning lse matplotlib numpy pandas python pytorch pytorch-lightning scikit-learn torchmetrics translation

Last synced: 11 Apr 2026

https://github.com/mgobeaalcoba/linear_algebra_for_machine_learning

Explore fundamental linear algebra concepts essential for machine learning in this repository, with code examples and explanations. Get a solid foundation for ML!

machine-learning matplotlib numpy pandas python3 scikit-learn scipy seaborn

Last synced: 12 Apr 2026

https://github.com/camilajaviera91/prediction-of-housing-prices-using-linear-regression

This project provides tools to search for datasets on Kaggle, download and preprocess them, and perform predictions using a Linear Regression model. It includes interactive text-based user interfaces built with `curses`.

curses kaggle linear-regression matplotlib-pyplot mean-absolute-error mean-square-error numpy pandas pathlib python scikit-learn train-test-split

Last synced: 10 Apr 2026

https://github.com/zen204/airbnb-availability

A machine learning model that predicts Airbnb listing availability, utilizing feature engineering and supervised learning techniques to improve guest experience and optimize host management.

binary-classification data-analysis data-preprocessing data-visualization feature-engineering machine-learning matplotlib model-evaluation nlp pandas predictive-modeling python scikit-learn seaborn supervised-learning

Last synced: 21 Jan 2026

https://github.com/flysirin/adstextclassification

Classification of advertisements by topic

docker excel flask pandas python pytorch scikit-learn

Last synced: 02 Jan 2026

https://github.com/aryanpillai2007/credit-card-fraud-detection

The primary goal of this project is to develop a comprehensive fraud detection system that enhances the security and trustworthiness of financial transactions.

anomaly-detection classification credit-card-fraud data-preprocessing data-science data-visualization fraud-detection imbalanced-data logistic-regression machine-learning outlier-detection pca pca-analysis python roc-curve scikit-learn

Last synced: 18 May 2026

https://github.com/prarthana-singh/heart-attack-prediction-model

A Machine Learning model that predicts the risk of a heart attack based on health parameters like cholesterol levels, blood pressure, BMI, smoking habits, and age. Built using Classification models, Scikit-Learn, Pandas, and Python.

classification data-analysis data-science heart-attack-prediction logistic-regression machine-learning numpy pandas python scikit-learn

Last synced: 25 Jun 2025

https://github.com/sanggusti/mentoring-skilvul-sic

A repository for teaching and mentoring as instructor of Skilvul Samsung Innovation Campus 2024

computer-vision flask machine-learning pymongo scikit-learn sql

Last synced: 20 Jan 2026

https://github.com/sarincr/training-on-artificial-intelligence

Entree Academy 10 Days free training on Artificial Intelligence. Course will be conducted in a Blended learning way with Daily one hour online training and 3 hour project based training

artificial-intelligence artificial-intelligence-algorithms data-analysis data-science data-visualization decision-trees deep-learning deeplearning logistic-regression machine-learning machine-learning-algorithms machinelearning num numpy pandas regression scikit-learn scipy sklearn

Last synced: 10 Apr 2026

https://github.com/adi3042/credit-card-fault-detection

🔍💳 Secure Your Finances! Detect anomalies and safeguard transactions with our Credit Card Fault Detection system. Dive into cutting-edge classification techniques to identify fraud and protect financial data. Your journey to secure payments starts here! 🚨🔒 FraudDetectionTech

classification credit-card css datetime fault-detection flask functools html ipykernel jupyternotebooks machine-learning numpy pandas python3 readme scikit-learn setuptools venv

Last synced: 03 Apr 2026

https://github.com/korpog/br_cancer

Binary classifier for Breast Cancer Wisconsin Data Set created with scikit-learn and xgboost.

classification data-science machine-learning pandas python scikit-learn xgboost

Last synced: 10 Apr 2026

https://github.com/pramodyasahan/car-safe-predictor

This repository contains a machine learning project that applies the K-Nearest Neighbors (KNN) classification algorithm to predict car safety ratings. The project uses a dataset of cars, with features such as buying price, maintenance cost, number of doors, persons, lug boot size, and safety.

classification k-nearest-neighbours machine-learning numpy pandas scikit-learn

Last synced: 10 Apr 2026

https://github.com/suvanwita/safescope

Women safety pattern analyzer using public crime datasets, DBSCAN hotspot clustering, Isolation Forest anomaly detection, geospatial heatmaps, and explainable risk scoring to surface historical incident patterns and time-aware safety insights

civic-tech crime-analysis dbscan isolation-forest machine-learning plotly python risk-scoring scikit-learn streamlit women-safety

Last synced: 25 Jun 2026

https://github.com/anav5704/honeywell-aog-zero

Data-driven, proactive maintenance sheduling system for APUs

docker fastapi nextjs postgresql scikit-learn

Last synced: 20 Jan 2026

https://github.com/leabrodyheine/ml-kaggle-cirrhosis-data

This project showcases skills in machine learning, data preprocessing, and model evaluation using Python libraries such as scikit-learn, XGBoost, and Optuna. It involves implementing various machine learning models, handling imbalanced data, and employing imputation techniques to enhance model performance for predicting cirrhosis outcomes.

data-analysis data-pre imbalanced-data imputation machine-learning optuna pipeline scikit-learn xgboost

Last synced: 14 May 2026

https://github.com/alyssonmach/machine-learning-com-python

Aplicações de Machine Learning usando a linguagem de programação Python.

ia keras-tensorflow machine-learning matplotlib numpy pandas programming python scikit-learn scipy

Last synced: 10 Apr 2026

https://github.com/vipulbunny/house-price-prediction

House Price Prediction is a machine learning project that analyzes real estate data to predict house prices based on various features like location, size, and amenities. It involves data preprocessing, exploratory data analysis (EDA), feature engineering, and model training using regression algorithms to provide accurate price estimates. 🚀📊🏡

ai-in-real-estate data-science data-visualization eda feature-engineering house-price-prediction housing-market-analysis machine-learning predictive-modeling python real-estate-analytics regression-models scikit-learn

Last synced: 03 May 2026

https://github.com/devash2/ayur-scan

Indian Medicinal Leaf detection application using ML and DL

flask flutter google-firebase opencv python scikit-learn tensorflow

Last synced: 10 Apr 2026

https://github.com/lorenzorottigni/ml-ecommerce

Machine Learning python bootcamp: linear regression on ecommerce dataset

ipynb linear-regression machine-learning numpy pandas python scikit-learn seaborn

Last synced: 07 Apr 2026

https://github.com/lorenzorottigni/dl-tensorboard

Deep Learning python bootcamp: tensorboard with cancer dataset

deep-learning ipynb machine-learning python scikit-learn tensorboard tensorflow

Last synced: 05 May 2026

https://github.com/sahilk12nayak/hyperspectral-corn-don-prediction-project

This project contains a machine learning pipeline for predicting DON (vomitoxin) concentration in corn samples using hyperspectral imaging data.

matplotlib numpy pandas python scikit-learn seaborn tensorflow

Last synced: 10 Apr 2026

https://github.com/vedanty3/bulldozer-price-prediction

A machine learning project aiming to build a machine learning model which could predict the sales price of bulldozer.

andrew-ng-machine-learning ensemble-machine-learning gridsearchcv jupyter-notebook machine-learning matplotlib numpy pandas python randomforestregressor randomizedsearchcv scikit-learn ztm

Last synced: 05 Apr 2026

https://github.com/arasoul/face-recognition-streamlit

🎯 Neural Face Recognition Matrix - Professional AI-powered biometric identification system with real-time face detection, recognition, and cyberpunk-styled interfaces. Features both web (Streamlit) and desktop (Tkinter) applications with comprehensive training pipeline, Docker deployment, and CI/CD automation.

ai bioinformatics computer-vision deep-learning face-recognition facenet gui machine-learning mtcnn neural-network open-source opencv pytorch real-time scikit-learn streamlit svm

Last synced: 02 Apr 2026

https://github.com/adityakumarda/kmeans-web-analytics

Built with Python, Pandas, and Scikit-learn, this machine learning project uses K-Means to cluster website users by behavior. It reveals patterns in engagement and bounce, helping drive data-informed decisions.

cluster-analysis elbow-curves elbow-method elbow-plot jupyter-notebook kmeans-clustering machine-learning matplotlib numpy pandas python python3 relationship scikit-learn seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/dineshdhamodharan24/amazon-reviews-sentiment-analysis

This is a sentiment analysis project that classifies Amazon product reviews as positive or negative using machine learning techniques.

matplotlib numpy pandas python scikit-learn

Last synced: 10 Apr 2026

https://github.com/jelhamm/principle-component-analysis-data-mining

"This repository contains an implementation of the Principal Component Analysis (PCA) algorithm, which is one of the key techniques used for dimensionality reduction in data mining and machine learning."

data-mining data-science jupyter-notebook machine-learning machine-learning-algorithms pca principal-component-analysis python pytorch scikit-learn scipy-library tensorflow

Last synced: 10 Apr 2026

https://github.com/broodhoney/heart-disease-prediction

This is a machine learning project which has a trained model that classifies whether a patient has a heart-disease or not.

kaggle-dataset matplotlib numpy pandas python scikit-learn scikitlearn-machine-learning uci

Last synced: 10 Apr 2026

https://github.com/thiagohrcosta/movieapp-ml

The Movie APP is a project created to apply some of the concepts learned throughout the post-graduation degree at XP Educação in Artificial Intelligence with an emphasis on Machine Learning. While this project is not integrated into the curriculum of the course, some of the concepts used were learned during the program.

docker flask-api machine-learning mysql-database postgresql python scikit-learn

Last synced: 10 Apr 2026

https://github.com/ameykasbe/credit-card-fraud-detection-on-imbalanced-dataset

Examined data preprocessing techniques and performance of six different predictive models in Python to credit card fraud detection problem on an imbalanced dataset. Algorithms implemented - Logistic Regression, K Nearest Neighbours, Support Vector Classification, Naïve Bayes Classifier, Decision Tree Classifier, and Random Forest Classifier.

classification machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/vijaykumarr1452/startup_success_predictor

This project demonstrates the use of Multiple Linear Regression to predict the profits of startups based on investment in R&D, Administration, and Marketing of dataset (50_Startups.csv)

machine-learning multi-linear-regression numpy pandas python regression rsquare-values scikit-learn

Last synced: 10 Apr 2026

https://github.com/bhazel/dockerfiles

Some Dockerfiles for working with specific technologies or learning resources.

docker dockerfile ocaml python rails ruby scikit-learn

Last synced: 10 Apr 2026

https://github.com/jol79/python_exercises

Solving interesting python exercises on different topics

matplotlib-pyplot numpy pandas python3 pythonexercises scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/gamowy/music-classification

Music genre classification using k nearest neighbors classifier based on gtzan dataset

machinelearning python scikit-learn university-assignment

Last synced: 10 Apr 2026

https://github.com/filsan-musa/project-iot_malware_identification

This repository contains the code and data for a project that detects malware from IoT devices using a publish-subscribe model with Confluent and Databricks. The project streams IoT device data to Kafka, analyzes it, and detects malware using machine learning models such as Random Forest and Gradient Boosted Trees.

apache-kafka classification confluent databricks machine-learning-algorithms scikit-learn sql

Last synced: 31 Aug 2025

https://github.com/marktheo/bike-sharing-demand

Jupyter Notebook - Predicting bike rental numbers based on climate and temporal data

decision-tree-classifier decision-tree-regression jupyter-notebook machine-learning scikit-learn

Last synced: 18 May 2026

https://github.com/ahmed-maher77/signlink___graduation-project

𝐀𝐈-𝐏𝐨𝐰𝐞𝐫𝐞𝐝 𝐒𝐢𝐠𝐧 𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞 𝐓𝐫𝐚𝐧𝐬𝐥𝐚𝐭𝐨𝐫 | A web and mobile app that bridges communication gaps for the deaf and hard-of-hearing community by translating English and Arabic sign language into real-time text and speech, and converting spoken words into text during video calls.

csharp fastapi firebase-realtime-database flutter framer-motion javascript microsoft-dot-net-technologies numpy opencv python pytorch reactjs scikit-learn scss-framework sign-language-recognizer sign-language-translation sql-server tailwindcss webrtc websockets

Last synced: 07 Apr 2026

https://github.com/arssite/dirty-cleanflooringimageprocessingusingyolov5

Uses YOLOv5 to classify floor cleanliness into five categories based on visual cues. It includes an annotated dataset, trained model,& evaluation outputs. Code covers data preprocessing, training, & testing. A comparative analysis highlights YOLOv5's advantages over traditional methods, providing an efficient solution automated floor cleanliness.

deep-neural-networks github google-colab jupyter-notebook labelimg matplotlib-pyplot numpy-library opencv-python pandas-python pytorch scikit-learn tensorflow yolov5

Last synced: 10 Apr 2026

https://github.com/annasmustafadev/network-intrusion-detection-ml

Machine learning-based Intrusion Detection System (IDS) for classifying network traffic as normal or malicious using supervised learning techniques. Includes data preprocessing, feature selection, model training, and evaluation for improved cybersecurity intelligence.

anomaly-detection classification cyber-security data-science intrusion-detection machine-learning python scikit-learn supervised-learning

Last synced: 29 Apr 2026

https://github.com/chirindaopensource/measuring_economic_outlook_in_news

End-to-End Python implementation of Beck et al.'s (2025) economic sentiment analysis framework for constructing a high-frequency economic sentiment indicator using 1024-dimensional Jina embeddings and LLM-generated training data. Features L2-regularized classification and rigorous POOS econometric validation with DM-HAC tests for GDP forecasting.

claude-ai computational-economics econometrics financial-modeling jina-embeddings llm nlp privacy-preserving-ml python regularized-regression reproducible-research scikit-learn sentiment-analysis statsmodels synthetic-data tensorflow time-series-forecasting transformers weak-supervision

Last synced: 30 Apr 2026

https://github.com/anastasius21/fakenewsmodel

The repo contains the model for fake news detection and a streamlit app for its implementation.

fake-news-detection machine-learning nlp pandas python scikit-learn

Last synced: 05 May 2026

https://github.com/nk-works/creditflow-ai

CreditFlow AI predicts loan defaulters using Artificial Neural Networks (ANNs). This model uses historical loan data to predict the likelihood of default for new loan applications.

ai artificial-neural-networks deep-learning jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn seaborn tensorflow

Last synced: 24 Jun 2025

https://github.com/tomgorb/ds-utils

pre-processing of a DataFrame into a sparse matrix for model input

machine-learning preprocessing scikit-learn

Last synced: 16 May 2026

https://github.com/jay4codes/time-series-comparative-analysis

A comparative analysis of various time series models on JP Morgan's stock price data for stock price predictive analysis

scikit-learn tensorflow time-series-analysis

Last synced: 10 May 2026

https://github.com/miteshgupta07/zomato-restaurant-rating-predictor

A Zomato rating prediction app that uses machine learning to forecast restaurant ratings based on various factors, helping users make informed dining decisions.

flask machine-learning python scikit-learn

Last synced: 10 Apr 2026

https://github.com/ledsouza/nlp-article-classification

This project aims to develop a machine learning model capable of classifying news articles into different categories based on their titles. Two different word embedding models (CBOW and Skip-gram) are trained and used to vectorize the article titles. These vectorized representations are then used to train a Logistic Regression classifier.

gensim-word2vec natural-language-processing nlp nlp-machine-learning pandas python scikit-learn spacy spacy-nlp

Last synced: 11 Apr 2026

https://github.com/anuragkush2527/vibesync-3.0

Sentiment analysis in social media involves using natural language processing (NLP) and machine learning to analyze users' opinions, emotions, and attitudes expressed in posts, comments, and reviews. It helps in understanding public sentiment, monitoring trends, and making data-driven decisions.

expressjs fastapi mongodb nltk nodejs numpy pandas python reactjs scikit-learn sentiment-analysis tensorflow

Last synced: 16 Oct 2025

https://github.com/tsungtsetu122/datamining-cifar10-classification

Data mining project on CIFAR-10 extracted features, applying preprocessing, classification models, and evaluation techniques to improve classification performance.

matplotlib numpy pandas python scikit-learn

Last synced: 10 Apr 2026

https://github.com/parthapray/nlp_pipeline_openai

This repo contains nlp pipeline and openai API integration

gradio matplotlib networkx nltk openai rake-nltk scikit-learn seaborn spacy textblob textstat wordcloud

Last synced: 10 Apr 2026

https://github.com/hmasdev/ssbgm

Score Based Generative Model with scikit-learn

generative-model scikit-learn

Last synced: 17 May 2026

https://github.com/ankitjha2202/sentiment_analysis

A simple web application that performs sentiment analysis using logistic regression to predict whether a given text has a positive, negative or neutral sentiment.

classification logistic-regression nlp scikit-learn sentiment

Last synced: 28 Mar 2025

https://github.com/prasadhiremath1/movie-recommender-system

Select a movie and 5 similar movies are recommended from the tmdb dataset

pandas python3 scikit-learn streamlit

Last synced: 11 Apr 2026

https://github.com/zohaib-cheema/defacto

DeFacto is a machine learning-based tool that classifies fake news articles using a hybrid model built with Scikit-learn, TensorFlow, and Keras. The system analyzes social and political content to detect deception in news stories and social media posts, providing a reliable solution to address the growing issue of misinformation.

flask git keras numpy pandas r scikit-learn tensorflow

Last synced: 07 Apr 2026

https://github.com/crispengari/ml-web-applications

✔ This repository contains a series of machine learning web applications, using python.

artificial-intelligence deeplearning flask javascript machinelearning nueral-networks python scikit-learn sentiment-analysis webapplication

Last synced: 11 Apr 2026

https://github.com/mnitin-reddy/collaborative-filtering-based-recommendation-system

This project is a Book Recommendation System that uses two main approaches: Popularity-Based and Collaborative Filtering. It recommends top books based on their rating frequency and average ratings, and also provides personalized book suggestions by analyzing user interactions.

collaborative-filtering numpy pandas popularity-based-recommendation python recommendation-system scikit-learn

Last synced: 11 Apr 2026

https://github.com/akshaya13/recommendation-system

Content Based Recommendation system using tags!

nltk scikit-learn similarity-search tmdb-database

Last synced: 18 May 2026

https://github.com/therayyanshariff/cinereview

A Machine Learning web app for sentiment analysis, using a Scikit-learn NLP model with a custom-styled Streamlit UI.

machine-learning nlp python scikit-learn sentiment-analysis streamlit

Last synced: 04 May 2026

https://github.com/gamowy/urbansounds-classification

Classification of urban sounds using Tensorflow Keras

keras machine-learning python scikit-learn tensorflow

Last synced: 11 Apr 2026

https://github.com/eco786786/spotify-playlist-generator

This project uses machine learning to cluster songs by features like tempo, genre and mood with K-Means. It then creates personalised Spotify playlists based on these clusters, providing dynamic, genre specific track collections. Integrating the Spotify API, it enables users to explore new music within custom groupings.

flask matplotlib pandas python3 scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/archie-cm/churn-analysis-for-bank-customer

The objective from this project are to predict customer churn and provide recommendations to the business team

feature-engineering machine-learning python scikit-learn

Last synced: 11 Apr 2026

https://github.com/nicolas-giacomelli/modelo-regressao-logistica_dockerapi

Modelo de regressao logistica para classificar se uma fruta esta boa ou ruim baseado nas suas caracteristicas

docker fastapi optuna pandas plotly pydantic python3 scikit-learn seaborn uvicorn

Last synced: 11 Apr 2026

https://github.com/rosa-lpz/machine-learning-zoomcamp-2025

Machine Learning Zoomcamp 2025 from DataTalksClub. Based on repository: https://github.com/DataTalksClub/machine-learning-zoomcamp/tree/master

aws deep-learning docker flask kserve kubernetes machine-learning machine-learning-algorithms machine-learning-projects metrics-visualization neural-networks numpy pandas python scikit-learn tensorflow xgboost

Last synced: 06 Apr 2026

https://github.com/labex-labs/scikit-learn-for-beginners

This comprehensive course covers the fundamental concepts and practical techniques of Scikit-learn, the essential machine learning library in Python. Learn to build, train, and evaluate machine learning models using various algorithms and preprocessing techniques.

algorithms beginner-friendly classification clustering course data-science feature-engineering hands-on labex labs machine-learning model-evaluation preprocessing programming python python-programming regression scikit-learn supervised-learning unsupervised-learning

Last synced: 14 May 2026