An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/jesly-joji/house-price-prediction

House Price Prediction using Linear Regression with Scikit-learn and Flask

flask regression scikit-learn

Last synced: 03 Jan 2026

https://github.com/priboy313/pandasflow

A set of custom python modules for friendly workflow on pandas

catboost data-analysis data-science pandas phik python scikit-learn shap

Last synced: 20 Jan 2026

https://github.com/its-maneeshk/fake-product-detection-system

The Fake Product Review Detection System is a machine learning-powered web application designed to analyze and detect fake reviews on eCommerce platforms. It helps users identify whether a product has genuine or manipulated reviews by leveraging Natural Language Processing (NLP) and supervised learning models.

api beautifulsoup4 fetch-api flask html-css-javascript joblib nlp-machine-learning numpy pandas python reactjs requests scikit-learn

Last synced: 05 Mar 2025

https://github.com/ksasi/boston_housing

Predicting Boston Housing Prices - Udacity

machine-learning numpy pandas python scikit-learn

Last synced: 08 Apr 2026

https://github.com/pkini2002/hpe_cty

Repository to maintain the learnings of the technologies used for the CTY'23 Project Work provided by HPE

computer-networks docker docker-container linux python scikit-learn swarm-learning ubuntu

Last synced: 07 Apr 2026

https://github.com/williyam-m/movie-recommendation-system

Developed a web app with a cosine similarity machine learning model for personalized recommendations based on user history, likes, bookmarks, and activity. Implemented user auth and CRUD operations for movies.

django machine-learning numpy pandas prediction-model python scikit-learn

Last synced: 10 Apr 2026

https://github.com/grachale/predict_titanik

Predicting the survival of Titanic passengers (binary classification) with usage of decision tree and KNN from scikit-learn.

classification decision-tree-classifier knn-classifier matplotlib pandas python scikit-learn titanic-survival-prediction

Last synced: 12 Apr 2026

https://github.com/siam29/exploring-explainable-ai-demystifying-dt-rf-knn-xgbc

Implemented XAI techniques to enhance transparency in fraud detection models. I employed techniques such as SHAP, LIME on DT, RF, XGBC, and KNN to offer lucid explanations for transactions that were flagged.

machine-learning matplotlib pandas scikit-learn xai

Last synced: 15 Apr 2026

https://github.com/alessandrosocc/machine-learning-project-2022

Final project for the Machine Learning course at the University of Cagliari in 2022. Analysis of a dataset, use of Machine Learning techniques with Oversampling and Undersampling techniques. Final report with the results obtained.

imblearn machine-learning matplotlib-pyplot oversampling pandas scikit-learn spambase-dataset undersampling

Last synced: 18 Jan 2026

https://github.com/viveksapkal2793/advertisement-response-analysis

This project analyzes advertisement responses using a Django backend and a Vite+React frontend. It includes scripts to load, clean, and transform data, which are executed within Docker containers. Data is stored in a MongoDB database, and the project can be run with or without Docker by adjusting the MongoDB connection strings.

advertisement advertisement-analysis container-image containerization django docker machine-learning mongodb react scikit-learn vite

Last synced: 23 Sep 2025

https://github.com/amirjahantab/iris_classification

This project analyzes the famous Iris dataset using various machine learning techniques. The goal is to classify the iris flowers into three species: Setosa, Versicolor, and Virginica based on the features provided in the dataset.

classification data-science machine-learning scikit-learn

Last synced: 16 May 2026

https://github.com/nikhiljsk/generic_regression_algo

A Python template to evaluate different regression models on a dataset. Includes metrics to cross-compare models on the data. Assumes the data to be numeric.

dataset generic machine-learning prediction python3 regression scikit-learn seaborn supervised-learning

Last synced: 01 May 2026

https://github.com/aarryasutar/logistic_regression_on_age_prediction

This code evaluates the performance of a logistic regression model on age prediction using various features to predict a binary target variable, calculating metrics to determine the performance. It evaluates the comparison, identifies favorable features, and visualizes the ROC-AUC curve to determine the best model performance.

accuracy-score confusion-matrix f1-score feature-selection logistic-regression model-training numpy pandas precision recall rmse roc-auc-curve scikit-learn visualization

Last synced: 20 Jan 2026

https://github.com/catlikeflyer/rsp-recognition

A computer vision project to recognize thumbs up

machine-learning mediapipe-hands python scikit-learn

Last synced: 16 May 2026

https://github.com/gmontamat/quora-question-pairs

Code for the Kaggle competition "Quora Question Pairs"

kaggle-competition quora-question-pairs scikit-learn spell-checker xgboost

Last synced: 02 Sep 2025

https://github.com/lasithaamarasinghe/stock-market-price-prediction

This ML model predicts the price of the S&P500 Stock Market Index using RandomForestClassifier

jupyter-notebook machine-learning pandas python random-forest-classifier scikit-learn sp500 stock-market-price-prediction yfinance

Last synced: 10 Apr 2026

https://github.com/mohit1106/fraud-detection-in-financial-transactions

an anomaly detection system on 284,807 transactions, achieving an AUC of ~0.972 with CNNs and Autoencoders.

autoencoders cnn-model isolation-forest keras python scikit-learn tensorflow

Last synced: 10 Apr 2026

https://github.com/aahnik/gdsc-ml-ds-bootcamp-2023

This repo contains files given by my seniors as well as assignments and final project done by me during the bootcamp.

data-science machine-learning ml numpy pandas python3 scikit-learn

Last synced: 28 Oct 2025

https://github.com/ishutak/disease_prediction

An AI-powered disease prediction system that uses machine learning to predict diseases based on symptoms. The system employs an ensemble of models including Random Forest and Neural Networks to provide accurate predictions with confidence levels.

css3 htlm5 javascript jquery numpy pandas pytorch scikit-learn select2

Last synced: 11 Apr 2026

https://github.com/lucasfrag/dengue-prediction-knc

Projeto desenvolvido para realizar previsão de casos de dengue usando o algoritmo de classificação KNeighborsClassifier.

data-science knearest-neighbor-classifier machine-learning pandas python scikit-learn

Last synced: 11 Mar 2025

https://github.com/elazzouzihassan/si-fraud-detection-prototype

Système de Détection des Fraudes avec Python (Prototype).

googlecolab matplotlib numpy pandas python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/rexsimiloluwah/fastapi-ml-apps

Machine learning apps built with FastAPI

docker fastapi machine-learning python scikit-learn tensorflow

Last synced: 05 Apr 2026

https://github.com/chitralputhran/tutorial-sklearn-columntransformer

ColumnTransformer was introduced in scikit-learn from version 0.20 onwards. The notebook file contains a quick and easy tutorial on ColumnTransformer to get you started.

scikit-learn

Last synced: 17 May 2026

https://github.com/lasithaamarasinghe/movie-recommender-system

This ML model recommends movies that may align with the user's preferences based on TF-IDF matrix.

jupyter-notebook machine-learning movie-recommendation movielens-dataset numpy pandas python regex scikit-learn tf-idf-vectorizer

Last synced: 12 Apr 2026

https://github.com/francescopaolol/titaniccompetition

It's my first kaggle competition about predict survival on the Titanic and get familiar with ML basics

jupyter-notebook kaggle-competition machine-learning ml pandas scikit-learn

Last synced: 17 Apr 2026

https://github.com/suundumused/weather-forecast-ai-example

The project scope is a weather forecasting model based on behavioral analysis of the last 33 hours (hour-by-hour forecast) with Random Forest Classifier. The program automatically saves and loads the last trained model for prediction.

ai artificial-intelligence artificial-intelligence-algorithms artificial-intelligence-projects artificialintelligence scikit scikit-learn scikit-learn-python scikitlearn scikitlearn-machine-learning weather weather-conditions weather-forecast weather-information

Last synced: 20 May 2026

https://github.com/samarpan-rai/serveitlearn

It creates an extremely thin layer around FastAPI library which allows you to create an end point super fast.

fastapi inference ml pypi scikit-learn

Last synced: 12 Apr 2026

https://github.com/chaitanya1436/student_performance_analysis

A project focused on analyzing college student performance using data on department, assessment scores, and performance labels. Implemented in Google Colab, the analysis includes data preprocessing, feature scaling, and exploratory data analysis to uncover insights and prepare the data for further analysis or modeling.

ata-preprocessing data-preparation exploratory-data-analysis feature-scaling google-colab numpy pandas scikit-learn

Last synced: 07 Feb 2026

https://github.com/gokulgowthams/clickstream-customer-conversion

Analyzes clickstream data from an e-commerce platform to predict customer conversions, estimate potential revenue, and segment users for personalized marketing strategies. By leveraging machine learning techniques, the project enhances decision-making for businesses seeking to optimize user engagement and sales.

data-preprocessing feature-engineering machine-learning matplotlib model-deployment numpy pandas pipeline python scikit-learn seaborn streamlit-web-application tensorflow xgboost

Last synced: 07 Apr 2026

https://github.com/kingabzpro/mlops-with-jenkins

From data ingestion to deploying the model using Jenkins.

classification fastapi jenkins mlops scikit-learn

Last synced: 13 Feb 2026

https://gitlab.com/hylkedonker/statkit

Statistics for sci-kit learn.

machine learning scikit-learn statistics

Last synced: 01 Nov 2025

https://github.com/tlapanco/knn-project

Projecto para la materia de Sistemas inteligentes haciendo uso de KNN oversampling.

jupyter-notebook knn pandas python scikit-learn smote

Last synced: 09 Apr 2026

https://github.com/omar-karimov/ai4renewals-gendes

This project adopts a modular Python architecture within an MLOps framework to enhance subscription renewal predictions, utilizing FastAPI and MongoDB with AWS integration (S3, ECR, EC2). Docker ensures seamless deployment, and GitHub Actions automate the CI/CD workflows. Evidently AI monitors drift to guarantee predictive accuracy and reliability.

api aws deployment docker drift-detection ec2 ecr evidently fastapi github-actions mangodb mlops modularization pipeline s3 scikit-learn workflow yaml-configuration

Last synced: 15 Apr 2026

https://github.com/gokularaman-c/ev-charging-log-anomaly-detection

EV charging log anomaly detection using Isolation Forest, engineered telemetry features, and a CLI inference pipeline.

anomaly-detection ev-charging feature-engineering isolation-forest machine-learning mlops python scikit-learn time-series

Last synced: 23 May 2026

https://github.com/docsallover/spam-detection

Building a Spam Filter with Python: Using Machine Learning to Combat Spam

datascience flask jinja2 machine-learning numpy numpy-library pandas pandas-python python python3 scikit-learn

Last synced: 09 Apr 2026

https://github.com/sanjeetbth7/krishi-nexus

Krishi Nexus revolutionizes agriculture by delivering data-driven crop recommendations via advanced machine learning, maximizing yields and ensuring sustainable practices. This platform empowers farmers with actionable insights, optimizing investments and promoting informed decision-making for a prosperous and eco-conscious future.

api classification expressjs reactjs scikit-learn supervised-learning tail

Last synced: 18 Feb 2026

https://github.com/gaurangdave/house_price_predictions

Machine Learning Application to predict House Prices

hands-on learning-by-doing machine-learning numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/satvikpraveen/fashionmnist-analysis

A comprehensive analysis of the Fashion MNIST dataset using PyTorch. Covers data preparation, EDA, baseline modeling, and fine-tuning CNNs like ResNet. Includes modular folders for data, notebooks, and results. Features CSV exports, visualizations, metrics comparison, and a requirements.txt for easy setup. Ideal for ML workflow exploration.

computer-vision confusion-matrix convolutional-neural-networks deep-learning-algorithms exploratory-data-analysis fashion-mnist-dataset fine-tuning hyperparameter-tuning image-classification jupyter-notebook machine-learning-algorithms matplotlib-pyplot model-evaluation numpy pandas pytorch resnet-18 scikit-learn seaborn vgg

Last synced: 22 Apr 2025

https://github.com/jersongb22/computervision

Links to my repositories with a wide variety of Computer Vision models using CNNs, Transfer Learning, and Vision Transformer with TensorFlow, PyTorch, Hugging Face and Ultralytics.

cnn computer-vision convnextv2 efficientnetv2 hugging-face image-captioning image-classification image-segmentation lenet-5 object-detection opencv plotly python pytorch scikit-learn tensorflow ultralytics video-classification vision-transformer yolo11

Last synced: 12 Apr 2026

https://github.com/f-aguzzi/chemfusekit

Chemometrics library for data fusion, model training and prediction of data from multiple sensor sources.

chemometrics datafusion knn lda pca plsda scikit-learn svm

Last synced: 20 Jan 2026

https://github.com/ishanoshada/matplot3dex

A Matplotlib 3D Extension package for enhanced data visualization

data data-science matplotlib python-packages scikit-learn

Last synced: 05 Jan 2026

https://github.com/victorkiosh/fake-news-detection

Detecting fake news using NLP and machine learning (Logistic Regression, Random Forest, XGBoost)

data-science fake-news-detection machine-learning nlp scikit-learn xgboost

Last synced: 18 May 2026

https://github.com/upul/chocolate-quality-analysis

This repository contains a Jupiter notebook which describes how to use basic machine learning tools such Scikit-Learning, Pandas, and Numpy for buiding models.

machine-learning numpy pandas predictive-analytics scikit-learn

Last synced: 04 May 2026

https://github.com/gperdrizet/ensembleset

Ensemble dataset generator for tabular data prediction and modeling projects.

classification ensemble feature-engineering machine-learning regression scikit-learn

Last synced: 07 Mar 2026

https://github.com/arnabsaha7/piezoelectric-roads-implementation

This repository implements a piezoelectric road system in Python, leveraging Pandas, NumPy, scikit-learn, Matplotlib, and Seaborn. The requirements.txt file ensures version consistency for reproducibility.

pandas-python piezoelectric roads scikit-learn

Last synced: 06 Jan 2026

https://github.com/shreeparab1890/movie-recommender-system

This notebook is trying to build a model which will recommend the movie based on given movie and genre. In this we use Popularity Based Recommendation, Content Based Recommendation and Collaborative Filtering based Recommendation.

bag-of-words cosine-similarity matplotlib numpy pandas python scikit-learn sklearn vectorization

Last synced: 09 Apr 2026

https://github.com/vimal0156/ruaroa-ai

🧙‍♂️ Zero-Code Machine Learning Wizard - Transform ideas into intelligent solutions without writing code. AI-powered ML pipeline automation with interactive web interface.

ai-agents ai-assistant artificial-intelligence automated-machine-learning code-generation data-analysis data-science deep-learning jupyter machine-learning machine-learning-pipeline neural-networks no-code openai python scikit-learn streamlit visualization

Last synced: 09 Apr 2026

https://github.com/saro0307/pre-doctor-ai-model

Pre-Doctor is an AI-driven health advisor using sci-kit-learn, offering quick medical advice based on user-input symptoms, making healthcare accessible and user-friendly. Utilizing Flask and pyttsx3, it seamlessly integrates machine learning for informed well-being.

artificial-intelligence css flask generative-ai generative-model html machine-learning python reinforcement-learning scikit-learn

Last synced: 07 Apr 2026

https://github.com/aryank1511/wattwise

WattWise is an innovative energy-saving app that uses an Arduino-powered device to monitor and predict household electricity usage and bills in real-time.

arduino docker flask machine-learning mqtt nextjs scikit-learn

Last synced: 04 Feb 2026

https://github.com/daniel1kp/diamond-price-predictor

💎Diamond Price Predictor is an user-friendly web application that simplifies the analysis and prediction of diamond prices.

altair pandas python scikit-learn streamlit

Last synced: 13 Apr 2026

https://github.com/metriccoders/metriccoders_datasets

This is the Metric Coders repository containing all the datasets for machine learning.

data datasets machine-learning natural-language-processing scikit-learn

Last synced: 08 Apr 2025

https://github.com/mariamabidi/pinn-based-flow-prediction

This repository contains code and experiments for predicting 3D aerodynamic flow around car geometries using Physics-Informed Neural Networks (PINNs) and for analyzing flow features via autoencoder-based clustering.

computer-vision machine-learning neural-network numpy pytorch pyvista scikit-learn

Last synced: 05 Aug 2025

https://github.com/achuth-0908/hemoguard-anemia-predictor

A Web App incorporated with a Gradient Boosting Classifier Model, to predict Anemia with given data.

css flask html matplotlib numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/andrewquijano/operating_systems_ii

Creating an Intrusion Detection System

ids kdd99 nsl-kdd-dataset scikit-learn

Last synced: 17 Jan 2026

https://github.com/bacross/datamunger

python package for handling nan's and outliers

data data-frame datamunger knn nan outliers python scikit-learn

Last synced: 17 May 2026

https://github.com/mehmoodulhaq570/machine-learning-models

A repository consisting of machine learning models for predicting the future instance. More specifically this repository is a Machine Learning course for those who are interested in learning the basics of machine learning algorithms.

decision-trees gradient-descent gradient-descent-algorithm knn-algorithm linear-regression linear-regression-models logistic-regression-algorithm machine-learning-algorithms machine-learning-models ml naive-bayes-algorithm one-hot-encoding pca python random-forest-classifier scikit-learn svm-model

Last synced: 08 Apr 2025

https://github.com/bestmahdi2/uni__decisiontreesinpythonpredictingdiabetes

A university project designed in Python decision tree for diabetes diagnosis

decision-tree diabetes-prediction graphviz pandas prediction pydotplus python scikit-learn

Last synced: 11 Apr 2026

https://github.com/Tritonix711/FractureAI

This tool helps people upload X-rays to find broken bones. It uses a machine to mark where the breaks are and gives users marked pictures to download. A smart computer also helps people understand their broken bones and gives them advice.

css cv2 flask gorq html javascript matplotlib npm numpy pandas pydantic python react scikit-learn torch torchvision ultralytics

Last synced: 19 Aug 2025

https://github.com/nordszamora/predictive_lung_cancer

The lung cancer predictive ML project is use to predict a cancer based on the data of smoking intake and common symptoms with low cost.

bootstrap django django-rest-framework python reactjs rest-api scikit-learn vite

Last synced: 11 Apr 2026

https://github.com/serhatderya/house-prices---advanced-regression-techniques

This machine learning model was developed for "House Prices - Advanced Regression Techniques" competition in Kaggle by using several machine learning models such as Random Forest, XGBoost and LightGBM.

ai artificial-intelligence data-science ju jupyter-notebook lightgbm lightgbm-regressor machine-learning machinelearning prediction python random-forest random-forest-regression regression scikit-learn xgboost xgboost-regression

Last synced: 28 Apr 2026

https://github.com/farrajota/kaggle_titanic

My solutions to the "Titanic: Machine Learning from Disaster" kaggle competition

docker docker-compose kaggle kaggle-competition kaggle-titanic notebook pyspark python scikit-learn

Last synced: 11 Apr 2026

https://github.com/alisonmitchell/boston-housing

Investigation of the Boston housing dataset to evaluate, train and test a regression model to predict house prices.

data-science machine-learning matplotlib numpy pandas python scikit-learn scipy seaborn

Last synced: 10 Apr 2026

https://github.com/harshita2234/breast-cancer-prediction

A machine learning project to predict breast cancer using logistic regression. This project includes data preprocessing, feature scaling, model training, and evaluation, based on a guided project from Coursera.

breast-cancer-prediction jupyter-notebook logistic-regression machine-learning matplotlib numpy pandas scikit-learn

Last synced: 12 Apr 2026

https://github.com/dinhanhx/determination

Scripts to set global random for some machine learning framework

determination deterministic keras pytorch randomness scikit-learn tensorflow2

Last synced: 12 Apr 2026

https://github.com/yuvraj0412s/proactive-fraud-detection-using-machine-learning

An end-to-end machine learning project for detecting financial fraud using LightGBM, featuring in-depth EDA, advanced feature engineering, and a focus on actionable business insights.

class-imbalance classification-model data-analysis data-science data-visualization exploratory-data-analysis feature-engineering fintech fraud-detection jupyter-notebook lightgbm machine-learning pandas python scikit-learn smote

Last synced: 02 May 2026

https://github.com/hasnainyaqub/6_Months_of_Data_Science_and_AI

A portfolio of my data science projects, including exploratory data analysis (EDA), machine learning, deep learning, and data visualization using Python, Pandas, NumPy, Plotly, Seaborn, and more.

ai data-science deep-learning eda fastapi flask machine-learning pandas plotly powerbi python pytorch scikit-learn scipy sql tableau tensorflow

Last synced: 24 Nov 2025

https://github.com/hokagem/damagedlogginganalyzer

A project about an analyzation of a statistic of damaged logging (wood) in Germany using Python.

analysis csv csv-parser k-fold-cross-validation numpy pandas pandas-dataframe pandas-python polynomial-regression scikit-learn statistics wood

Last synced: 03 May 2026

https://github.com/webcog-pk/recommandation-engine-in-drf-sk-learn

Full Stack Movie Recommendation System Project made in Django REST Framework and React JS

api django django-rest-framework movies reactjs recommender-system scikit-learn

Last synced: 22 Mar 2025

https://github.com/veb-101/machine-learning-practice

Contains code-works from the Hands on scikit-learn and tensorflow book

deep-learning keras machine-learning python3 scikit-learn tensorflow-gpu

Last synced: 19 Apr 2026

https://github.com/aryansingla45/cab-fare-prediction

This repository contains a Cab Fare Prediction Web App that uses machine learning to predict fares based on distance, time, and location, with Google Maps API integration for route and fare calculations. It features a user-friendly interface and secure authentication, powered by a Kaggle dataset.

flask-application google-maps-api html-css-javascript machine-learning machine-learning-algorithms pandas-numpy python scikit-learn weather-api

Last synced: 11 May 2026

https://github.com/rs2416/Detecting_Social_Anxiety

This repository contains the full dataset and code needed to recreate the classification models and reproduce the results within this paper: https://formative.jmir.org/2021/10/e32656/

jupyter-notebook machine-learning python scikit-learn social-anxiety

Last synced: 07 May 2025