An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/bala-1409/foreign-exchange-rate-time-series-data-science-project

This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.

data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms model modelfitting predictive-modeling python3 scikit-learn statsmodels time-series time-series-analysis

Last synced: 07 May 2026

https://github.com/mrapp-ke/examplewisef1maximizer

A scikit-learn meta-estimator for multi-label classification that aims to maximize the example-wise F1 measure

machine-learning multilabel-classification scikit-learn

Last synced: 27 Apr 2026

https://github.com/mehuaniket/blog-classifier

blog classifier with scikit random forest.

bag-of-words blog-classifier python scikit-learn

Last synced: 07 May 2026

https://github.com/otuemre/realtimenids

Real-time network intrusion detection system using Zeek flow logs and machine learning (IsolationForest). Detects threats with both signature-based and anomaly-based techniques trained on the CSE-CIC-IDS2018 dataset.

anomaly-detection cybersecurity flow-analysis isolation-forest machine-learning network-intrusion-detection nids scapy scikit-learn zeek

Last synced: 07 May 2026

https://github.com/antonio-f/find-duplicate-questions

Find duplicate questions on StackOverflow by their embeddings. From the Natural Language Processing course - Coursera's Advanced Machine Learning specialization.

cosine-similarity discounted-cumulative-gain embeddings gensim natural-language-processing nlp nltk scikit-learn starspace text-similarity word2vec

Last synced: 27 Apr 2026

https://github.com/tddschn/hack-ncsu-2024

ML and doc part of our Hack_NCState project builtin in less than 1 day | Racial Bias in Criminal Justice Visualized: Code Black

bias machine-learning scikit-learn

Last synced: 08 May 2026

https://github.com/canayter/unsupervised-machine-learning

Utilizing Python and unsupervised learning to predict if cryptocurrencies are affected by 24-hour or 7-day price changes.

k-means-clustering python scikit-learn unsupervised-machine-learning

Last synced: 08 May 2026

https://github.com/cool-japan/sklears

A comprehensive machine learning library in Rust, inspired by scikit-learn's intuitive API and combining it with Rust's performance and safety guarantees.

ai artificial-intelligence machine-learning rust rust-lang scikit-learn scikitlearn-machine-learning

Last synced: 26 Apr 2026

https://github.com/ahmetcansolak/decision-tree-classifier-scikit-learn

A simple decision tree classifier example using scikit-learn

decision-tree-classifier python scikit-learn

Last synced: 28 Apr 2026

https://github.com/official-biswadeb941/clopimedi---your-healths-trusted-care

ClopiMedi is an AI-driven healthcare application that simplifies doctor appointment bookings, offering personalized recommendations based on medical conditions to enhance patient-provider connections.

adam ai flask flask-api flask-api-backend full-stack-web-development joblib machine-learning scikit-learn tensorflow

Last synced: 28 Apr 2026

https://github.com/nirmalyabag20/breast-cancer-prediction-using-machine-learning

This project leverages machine learning to classify breast cancer as malignant or benign based on tumor characteristics. By applying and evaluating multiple algorithms, the model achieves high accuracy, demonstrating the practical application of data-driven solutions in medical diagnostics.

logistic-regression matplotlib numpy pandas python scikit-learn seaborn

Last synced: 12 Feb 2026

https://github.com/kritimbist/365-days-of-github-challenge-ai-machine-learning

This repository is part of my 365 Days Challenge: AI × Machine learning, where I combine my passion for Machine Learning 🤖 to learn, build, and document projects every single day for one year.

data-science data-visualization deep-learning machine-learning matplotlib numpy python scikit-learn

Last synced: 28 Apr 2026

https://github.com/francescopaolol/logisticregression

About predicting survival on the Titanic and get familiar with ML basics

jupyter-notebook kaggle logistic-regression machine-learning ml pandas scikit-learn

Last synced: 16 Apr 2026

https://github.com/aakanksha1406/fake-news-classifier

to identify when an article might be fake news

keras lstm lstm-neural-networks nltk python scikit-learn tensorflow

Last synced: 13 Feb 2026

https://github.com/adithaker/falafel

🤖 A from-scratch implementation of a small scaled federated learning application.

cli-app distributed-systems federated-learning logistic-regression python scikit-learn

Last synced: 28 Apr 2026

https://github.com/loong64/onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

ai-framework deep-learning hardware-acceleration loong64 loongarch64 machine-learning neural-networks onnx pytorch scikit-learn tensorflow

Last synced: 09 May 2026

https://github.com/lakshitalearning/churninsight

Customer Churn prediction means knowing which customers are likely to leave or unsubscribe from your service.

churn-prediction data-science flask google-colab machine-learning predictive-analytics python scikit-learn user-retention web-development

Last synced: 09 May 2026

https://github.com/davidcamilo0710/hate_speech_analysis

Hate speech detection using NLP for linguistic analysis and machine learning (XGBoost) for classification with Python and SpaCy.

hate-speech-detection linguistic-analysis nlp scikit-learn spacy xgboost

Last synced: 09 May 2026

https://github.com/bhuvaneshwarguttula/student-performance-indicator

To understand and predict how the student's performance (test scores) is affected by the other variables (Gender, Ethnicity, Parental level of education, Lunch, Test preparation course).

exploratory-data-analysis machine-learning pandas python scikit-learn student-performance-analysis

Last synced: 07 Mar 2026

https://github.com/vishal-038/attendance_by_face_recogination

This project is a face recognition-based attendance system that uses Python, OpenCV, Scikit-learn, Streamlit, and various other libraries like Pandas, Numpy, Datetime, and OS for different functionalities. It enables adding faces to the database, taking attendance based on face recognition, and showing live attendance through a web interface built

opencv python scikit-learn

Last synced: 14 Feb 2026

https://github.com/hq969/customer-churn-prediction-with-hyperparameter-optimization-and-model-deployment

A complete end-to-end machine learning project that predicts customer churn using the Telco dataset. It includes data preprocessing, exploratory data analysis (EDA), model training with Random Forest, hyperparameter tuning, evaluation, and deployment via a Flask API.

flask numpy pandas python scikit-learn xgboost

Last synced: 02 Apr 2026

https://github.com/rakibhhridoy/supportvectormachinein-medical

Support vector machine in medical disease detection. Both linear and non-linear data can be fitted in svm through its kernel specialization In medical we focus on precision or recall rather than accuracy.

diabetes-prediction machine-learning medical precision-medicine recall-precision scikit-learn support-vector-machines svm

Last synced: 29 Apr 2026

https://github.com/akhil888binoy/intelligent-supplychain-management-system

Blockchain-powered supply chain management system with ML-driven sales prediction. Streamlines supplier-employee transactions and inventory management. Built with MERN stack, Solidity, and Flask.

blockchain decentralized-payments ethereum express flask foundry hackathon-project inventory-management machine-learning mern-stack mongodb nodejs python react sales-prediction scikit-learn smart-contracts solidity supply-chain-management wagmi

Last synced: 09 Oct 2025

https://github.com/RickContreras/StudentPerformancePredictionSaberPro

Modelo de clasificación para predecir el desempeño de estudiantes en las Pruebas Saber Pro en Colombia. Incluye análisis exploratorio de datos, preprocesamiento y modelos de machine learning.

classification colombia data-analysis data-science education educational-assessment exploratory-data-analysis jupyter-notebook machine-learning python saber-pro scikit-learn student-performance

Last synced: 24 Oct 2025

https://github.com/andresmg07/real-time-sign-language-translator

AI-driven real-time American Sign Language translator. Implemented leveraging Support Vector Machines (SVM), OpenCV library and MediaPipe hands module.

ai computer-vision machine-learning mediapipe opencv pattern-recognition scikit-learn support-vector-machines

Last synced: 16 Apr 2026

https://github.com/jasper-koops/easy-gscv

This library allows you to quickly train machine learning classifiers by automatically splitting the data set and using both grid search and cross validation in the training process.

classification machine-learning python3 scikit-learn

Last synced: 14 Feb 2026

https://github.com/siam29/ensemble-majority-voting-hard

In this project, we implemented an ensemble learning approach using majority voting (hard voting) with five machine learning classifiers: DT, RF, XGBC, ANN, and KNN. The ensemble model achieved an impressive accuracy score of 99.95% and an F1 score of 85.51%.

credit-card-fraud ensemble-learning machine-learning matplotlib pandas scikit-learn

Last synced: 09 May 2026

https://github.com/garcane/Income-Prediction-ML

This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.

data data-science machine-learning ml numpy pandas python random-forest scikit-learn

Last synced: 24 Oct 2025

https://github.com/t-abishek/embedded-intent-classifier

A production-grade FastAPI application that uses sentence embeddings to classify user prompts into 4 categories: Built using Python, BGE SentenceTransformer, Scikit-learn, and FastAPI.

classifier embedded huggingface pandas scikit-learn transformer

Last synced: 10 May 2026

https://github.com/zachpinto/xc-rankings-predictions

Applied ML Project predicting cross-country team rankings based on individual-level performances

random-forest scikit-learn

Last synced: 29 Apr 2026

https://github.com/ayyucedemirbas/solar_power_elasticnet

ElasticNet Linear Regression on Solar Power Generation

elasticnet-regression scikit-learn skops tabular-regression

Last synced: 29 Apr 2026

https://github.com/aryansk/customer-segmentation-analysis

Advanced customer segmentation project using K-Means clustering to analyze customer behavior based on annual income, spending score, and age.

elbow-method exploratory-data-analysis machine-learning machine-learning-algorithms python scikit-learn sentiment-analysis sentiment-classification

Last synced: 29 Apr 2026

https://github.com/bestmahdi2/uni__dataminningstackoverflowproject

A university project related to data mining lesson on StackOverflow website data with Python language

cart csv data-mining logistic-regression matplotlib mlp naive-bayes nltk numpy pandas python scikit-learn scipy seaborn stackoverflow svc textblob tqdm xgboost

Last synced: 16 Feb 2026

https://github.com/njorogepaul-moghul/iris-flower-classification

This project predicts the species of an Iris flower (Setosa, Versicolor, Virginica) based on its sepal and petal measurements. We trained and evaluated multiple ML models — with Logistic Regression performing best at 93% accuracy. Finally, we deployed on streamlit:[app] (https://irisflowerapp-ripwlmfmctrzqphjapj97t.streamlit.app/)

iris-classification jupyter-notebook logistic-regression machine-learning python random-forest-classifier scikit-learn

Last synced: 29 Apr 2026

https://github.com/mijisu0103/data-driven-decision-making-risk-analysis

This repository contains my coursework project for ECS7005P - Risk and Decision-Making for Data Science and AI. It applies probabilistic models, Bayesian networks, and decision analysis using Python and PyAgrum to evaluate risk and optimise decision-making under uncertainty.

machine-learning pandas probability-and-statistics pyagrum python quantitative-decision-making risk-assessment scikit-learn

Last synced: 10 May 2026

https://github.com/kshula/cipatala-hospital-management-system

Cipatala Hospital management systempowered by AI and machine learning built with Django and Bootstrap

bootstrap django django-project html-css-javascript python scientific-computing scikit-learn tensorflow

Last synced: 01 Mar 2026

https://github.com/neelanjan-chakraborty/custoclarity

CUSTO CLARITY is a customer segmentation model built in Python. Using clustering on real retail datasets, it identifies 5 customer segments that unlocked strategic retail partnerships. Powered by scikit-learn, pandas, seaborn, and Matplotlib.

clustering-algorithm clustering-algorithms customer-analytics customer-segmentation data-visualization kmeans kmeans-clustering pandas python scikit-learn

Last synced: 11 May 2026

https://github.com/pngo1997/astrophysical-objects-classification

Project applies machine learning techniques to classify astrophysical objects using observational data from the Large Synoptic Survey Telescope (LSST).

adaptive-boosting-algorithm classification down-sampling gradient-boosting keras machine-learning neural-network python random-forest scikit-learn supervised-learning tensorflow time-series

Last synced: 10 May 2026

https://github.com/vaibhavs10/learn-ml

Modified notebooks (single) from kaggle.com/learn with added nuances

decision-trees machine-learning pandas random-forest scikit-learn

Last synced: 11 May 2026

https://github.com/hasanulmukit/spam-email-classifier

This is a Spam Email Classifier built using Python and Streamlit. It uses a pre-trained model to predict whether an email is Spam or Not Spam. The app also provides the probability scores for both categories, enhancing transparency and reliability of the prediction.

email-classifier machine-learning nlp python scikit-learn spam-detection streamlit text-classification

Last synced: 11 May 2026

https://github.com/francescopaolol/decisiontree

About classify iris plants into three species in this classic dataset

decision-tree-classifier jupyter-notebook kaggle machine-learning ml pandas scikit-learn

Last synced: 16 Apr 2026

https://github.com/aditya-ranjan1234/interactive-salary-prediction-with-machine-learning

A Streamlit web application for exploring the UCI Census Income dataset, training machine learning models, and predicting employee salaries.

data-science machine-learning prediction python scikit-learn streamlit xgboost

Last synced: 29 Apr 2026

https://github.com/aravindnathan02/credit-card-fraud-detection

This repository contains a Machine Learning project aimed at detecting fraudulent credit card transactions. The goal is to build a reliable and efficient model that minimizes false positives and false negatives, ensuring financial safety and improving fraud detection capabilities.

classification-model fraud-detection logistic-regression machine-learning python random-forest scikit-learn

Last synced: 11 May 2026

https://github.com/emmanuelezenwere/aind-aiprojects

Portfolio of AI projects developed during my Udacity AI Nanodegree, covering Planning AI, Constraint Satisfaction, Hidden Markov Models, and Search algorithms.

alpha-beta-pruning astar-algorithm bellman-equation breadth-first-search constraint-satisfaction-problem depth-first-search hidden-markov-model kalman-filter minmax-algorithm networkx nltk numpy pandas scikit-learn scipy sympy

Last synced: 29 Apr 2026

https://github.com/texnoforge/texnomagic

TexnoMagic library for digital Magic

gmm magic numpy python recognition scikit-learn scipy

Last synced: 03 Mar 2026

https://github.com/elifftosunn/bert-bank-model

It is a Turkish BERT-based model that will analyze people's bank complaints and classify them according to one of eight categories.

countvectorizer doc2vec f1-score huggingface huggingface-transformer huggingface-transformers nlp nltk python3 scikit-learn stopwords tagged tfidf-transformer train-test-split word-tokenizer wordnetlemmatizer

Last synced: 12 May 2026

https://github.com/gigdevelopment10/neuralfunk

A Machine learning resource library for funky ML-Learners

algorithm keras machine-learning optimization-algorithms py-torch python scikit-learn tensorflow

Last synced: 29 Apr 2026

https://github.com/thevarunsharma/extracting-dominant-colors

A web application that extracts the dominant colors from an image using K-means clustering.

flask-application k-means-clustering machine-learning python scikit-learn unsupervised-learning

Last synced: 12 May 2026

https://github.com/alessiochen/setiment-analysis-ai-project

Application of Sentimental Analysis for Artificial Intelligence class at UNIFI

ai andrew dataset movie-reviews scikit-learn sentiment-analysis

Last synced: 12 May 2026

https://github.com/aliy98/navigation-sensor-data-classification

Classification of a Navigation Robot Sensor Dataset Using SVM, Random Forest and Neural Network

artificial-neural-networks keras multiclass-classification random-forest scikit-learn scitos-g5 support-vector-machines

Last synced: 13 May 2026

https://github.com/ultrasage-danz/scikit-learn-ml

Machine Learning with scikit-learn by Data School

ai data data-school machine-learning macos ml scikit-learn ultrasage-dan

Last synced: 13 May 2026

https://github.com/alam025/customer-churn-prediction

🎯 Predict customer churn with 96%+ accuracy using Random Forest ML. Beautiful visualizations, production-ready code, and real business impact. Save revenue before customers leave! 🚀

churn-prediction classification customer-analytics customer-churn customer-retention data-science machine-learning pandas predictive-analytics python random-forest scikit-learn

Last synced: 11 Jun 2026

https://github.com/dhavaltaunk08/gender-classification

I did this project during my internship at IIT Guwahati. It aimed to perform gender classification in video streaming.

deep-learning librosa opencv-python python scikit-learn

Last synced: 14 May 2026

https://github.com/antoniskl/amsterdam-metro-crowdedness-prediction

The aim of this full-stack project is to predict with RandomForest and visualize crowdedness for metro stations of Amsterdam by using external factors.

amsterdam covid-19 crowded-areas dash full-stack metro prediction-model python random-forest regression scikit-learn ticketmaster-api

Last synced: 14 May 2026

https://github.com/anishshinde01/machine-learning-exercises

Python implementations of machine learning, statistics, and mathematical foundations.

linear-algebra machine-learning machine-learning-algorithms matplotlib numerical-analysis numpy python scikit-learn scipy statistics

Last synced: 11 Jun 2026

https://github.com/the-developer-306/house-price-predictor

House Price Predictor: Harnessing machine learning algorithms to forecast housing prices in Boston, empowering buyers and sellers with accurate predictions based on key factors like location, crime rate, rooms, accessibility, and more.

csv ipynb-jupyter-notebook joblib matplotlib numpy pandas python scikit-learn

Last synced: 23 Feb 2026

https://github.com/pankajarm/tabular_ml_toolkit

A helper library to jumpstart your machine learning project based on tabular or structured data.

data-science feature-engineering hyperparameter-tuning machine-learning parallelism python scikit-learn structured-data tabular xgboost

Last synced: 19 Jan 2026

https://github.com/ricardouchub/colab-ml-pipeline-agent

Agente en Colab que, dado un dataset en CSV, planifica y ejecuta un pipeline de Machine Learning de inicio a fin: análisis inicial, preprocesamiento, entrenamiento con Scikit-Learn y reporte automático con evalcards.

agent ai deepseek evalcards langchain llm ml pipeline-agent scikit-learn

Last synced: 16 Apr 2026

https://github.com/nordszamora/sentiment

This ML project is used to predict the text sentiment positive, negative or neutral. Its used a Reddit dataset in kaggle to train the model and make a prediction on new/unseen data.

django joblib kaggle naive-bayes-classifier pandas python redditdata scikit-learn

Last synced: 11 Apr 2026

https://github.com/tharindanimnajith/deep-learning-spam-detection

Deep Learning classifiers to detect spam SMS messages - LSTM Model, DenseNet CNN Models - NLP, Python, Jupyter Notebook, Tensorflow, Keras, Numpy, Pandas, Matplotlib, Scikit-Learn

deep-learning densenet keras lstm nlp python3 scikit-learn tensorflow

Last synced: 05 Apr 2026

https://github.com/ki3mono/naive_bayes_classifier

This project implements Naive Bayes Classifiers for two data types: Multinomial Naive Bayes Classifier and Gaussian Naive Bayes Classifier. Developed as part of the Probability Theory and Statistics course in the second year of Computer Science at AGH University of Krakow.

iris-dataset mushroom-dataset naive-bayes-classifier numpy python scikit-learn

Last synced: 06 May 2026

https://github.com/aryansingla45/cab-fare-prediction

This repository contains a Cab Fare Prediction Web App that uses machine learning to predict fares based on distance, time, and location, with Google Maps API integration for route and fare calculations. It features a user-friendly interface and secure authentication, powered by a Kaggle dataset.

flask-application google-maps-api html-css-javascript machine-learning machine-learning-algorithms pandas-numpy python scikit-learn weather-api

Last synced: 11 May 2026

https://github.com/tlapanco/knn-project

Projecto para la materia de Sistemas inteligentes haciendo uso de KNN oversampling.

jupyter-notebook knn pandas python scikit-learn smote

Last synced: 09 Apr 2026

https://github.com/gokulgowthams/clickstream-customer-conversion

Analyzes clickstream data from an e-commerce platform to predict customer conversions, estimate potential revenue, and segment users for personalized marketing strategies. By leveraging machine learning techniques, the project enhances decision-making for businesses seeking to optimize user engagement and sales.

data-preprocessing feature-engineering machine-learning matplotlib model-deployment numpy pandas pipeline python scikit-learn seaborn streamlit-web-application tensorflow xgboost

Last synced: 07 Apr 2026

https://github.com/gokularaman-c/ev-charging-log-anomaly-detection

EV charging log anomaly detection using Isolation Forest, engineered telemetry features, and a CLI inference pipeline.

anomaly-detection ev-charging feature-engineering isolation-forest machine-learning mlops python scikit-learn time-series

Last synced: 23 May 2026

https://github.com/docsallover/spam-detection

Building a Spam Filter with Python: Using Machine Learning to Combat Spam

datascience flask jinja2 machine-learning numpy numpy-library pandas pandas-python python python3 scikit-learn

Last synced: 09 Apr 2026

https://github.com/rusiruchapana/blood-group-prediction

Creating a machine learning project to predict blood groups from fingerprint patterns

asp-net-web-api keras matplotlib numpy opencv-python pandas pillow scikit-learn tensorflow

Last synced: 11 Apr 2026

https://github.com/rs2416/Detecting_Social_Anxiety

This repository contains the full dataset and code needed to recreate the classification models and reproduce the results within this paper: https://formative.jmir.org/2021/10/e32656/

jupyter-notebook machine-learning python scikit-learn social-anxiety

Last synced: 07 May 2025

https://github.com/dinhanhx/determination

Scripts to set global random for some machine learning framework

determination deterministic keras pytorch randomness scikit-learn tensorflow2

Last synced: 12 Apr 2026

https://github.com/farrajota/kaggle_titanic

My solutions to the "Titanic: Machine Learning from Disaster" kaggle competition

docker docker-compose kaggle kaggle-competition kaggle-titanic notebook pyspark python scikit-learn

Last synced: 11 Apr 2026

https://github.com/haloapping/ml-workflow

Template alur kerja machine learning.

mahine-learning numpy pandas python3 scikit-learn

Last synced: 11 Apr 2026

https://github.com/jersongb22/computervision

Links to my repositories with a wide variety of Computer Vision models using CNNs, Transfer Learning, and Vision Transformer with TensorFlow, PyTorch, Hugging Face and Ultralytics.

cnn computer-vision convnextv2 efficientnetv2 hugging-face image-captioning image-classification image-segmentation lenet-5 object-detection opencv plotly python pytorch scikit-learn tensorflow ultralytics video-classification vision-transformer yolo11

Last synced: 12 Apr 2026

https://github.com/nordszamora/predictive_lung_cancer

The lung cancer predictive ML project is use to predict a cancer based on the data of smoking intake and common symptoms with low cost.

bootstrap django django-rest-framework python reactjs rest-api scikit-learn vite

Last synced: 11 Apr 2026

https://github.com/upul/chocolate-quality-analysis

This repository contains a Jupiter notebook which describes how to use basic machine learning tools such Scikit-Learning, Pandas, and Numpy for buiding models.

machine-learning numpy pandas predictive-analytics scikit-learn

Last synced: 04 May 2026