An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/oriolventur/assignment-2-model-creation

Assignment 2 from Artificial Intelligence 1 course: Model creation using synthetic data and scikit-learn.

jupyter-notebook model-creation python scikit-learn

Last synced: 08 May 2026

https://github.com/sundarmd/breast-cancer-detection

Breast-Cancer-Detection is a machine learning project that utilizes logistic regression to predict whether a tumor is benign or malignant based on the Breast Cancer Wisconsin (Diagnostic) dataset. The project demonstrates data preprocessing, model training, and evaluation using the `scikit-learn` library.

logistic-regression machine-learning python scikit-learn

Last synced: 09 May 2026

https://github.com/aasjunior/mlapp-api

Esta API fornece endpoints para aplicar algoritmos de aprendizado de máquina, como K-Nearest Neighbors (KNN), Árvore de Decisão e Algoritmo Genético. Realizado como tarefa da disciplina de Laboratório Mobile/Computação Natural no 5º Semestre de Desenvolvimento de Software Multiplataforma.

fastapi machine-learning python scikit-learn

Last synced: 09 May 2026

https://github.com/vijaykumarr1452/customer-churn-prediction

Analysis the data of telecom company and insights gained to reduce customer churn.

anaconda jupyter-notebook machine-learning pandas prediction scikit-learn

Last synced: 09 May 2026

https://github.com/radoslawregula/iris-classification

Jupyter notebook implementing an efficient machine learning method to classify flowers from the Iris data set.

classification iris-dataset jupyter-notebook machine-learning python scikit-learn softmax-classifier

Last synced: 09 May 2026

https://github.com/l1ght14/customer-churn-prediction

Predict customer churn using machine learning models like Logistic Regression and Random Forest. Includes data preprocessing, model evaluation, feature importance, and insights to drive retention strategies.

churn-prediction classification customer-churn customer-churn-prediction data-analysis logistic-regression machine-learning python random-forest scikit-learn telecom

Last synced: 09 May 2026

https://github.com/mayankanand007/yfraud

Credit card fraud detection platform using scikit-learn and xgboost 💳

knearest-neighbor-algorithm linear-regression machine-learning predictive-analytics python3 scikit-learn svm xgboost

Last synced: 09 May 2026

https://github.com/roggersanguzu/tomato-disease-detector

This project Uses transfer learning with MobileNetV2 to accurately classify tomato leaf diseases including Mosaic Virus, Septoria Leaf Spot, Blight, and Healthy leaves.

deep-learning python scikit-learn transfer-learning

Last synced: 09 May 2026

https://github.com/callmerajesh/ames-housing-price-prediction

Predicting house prices using Decision Tree Regressor on the Ames dataset

ames-housing data-science decision-tree machine-learning python regression scikit-learn

Last synced: 09 May 2026

https://github.com/saahilanande/naivebayes

Implimenting Naive Bayes classifier from scratch for sentiment analysis of IMDB dataset

machine-learning naive-bayes-classifier python-3 scikit-learn

Last synced: 09 May 2026

https://github.com/malisha4065/flightdelaypredictiongroup99

This project focuses on predicting flight delays in the United States domestic air traffic system over 500 000+ data using machine learning techniques. Leveraging a dataset from the Bureau of Transportation Statistics for the year 2020, we aim to develop a predictive model that can anticipate flight delays with 93.1 % high accuracy.

k-nearest-neighbors machine-learning python scikit-learn support-vector-machine

Last synced: 09 May 2026

https://github.com/rajan-bhateja/aqi-predictor

Different models trained on Indian Cities to predict AQI

machine-learning-algorithms model-comparison neural-networks scikit-learn tensorflow

Last synced: 09 May 2026

https://github.com/jaswanthv99/basic_ml-model_understanding

This project explains basic ML-Models(KNN, Naive bayes, Logistic Regression, SVM, A neural N/W)

matplotlib-python pandas-python scikit-learn tensorflow

Last synced: 09 May 2026

https://github.com/piras-s/braincancerclassifier

Classifying brain tumors using Gaussian Naive Bayes with MRI-derived features. Includes feature selection, model evaluation, prediction uncertainty, and probability calibration.

baysian-inference calibrated-classification classification data-visualization feature-selection machine-learning medical-imaging naive-bayes-classifier python scikit-learn uncertainty-estimation

Last synced: 09 May 2026

https://github.com/suvasish114/house-price-estimation

A machine learning model that estimate housing prices in California using the California census data

jupyter-notebook machine-learning python scikit-learn

Last synced: 09 May 2026

https://github.com/bhoomikaniranjan/pulmotrainer

A Deep Learning-based Lung Cancer Detection application using a 3D CNN model with TensorFlow and OpenCV, featuring an interactive Tkinter GUI for easy data processing and training.

matplotlib numpy-pandas opencv python scikit-learn seaborn tensorflow-keras

Last synced: 09 May 2026

https://github.com/mpolinowski/fisher-discriminant-analysis

LDA is a widely used dimensionality reduction technique built on Fisher’s linear discriminant.

linear-discriminant-analysis matplotlib-pyplot python scikit-learn

Last synced: 10 May 2026

https://github.com/laavanjan/real_estate_price_prediction

This project predicts the house price per unit area based on various real estate features using a Linear Regression model. The application is built with Dash, a Python framework for building interactive web apps.

dash linear-regression pandas scikit-learn

Last synced: 10 May 2026

https://github.com/amirdora/python_ml_supervisedlearning_example

Building Classification Models with scikit-learn

machine-learning python3 scikit-learn

Last synced: 10 May 2026

https://github.com/macdon112/credit-card-fraud-detection

Comparing ML models (Random Forest, KNN, Decision Tree) for credit card fraud detection using SMOTE and stratified cross-validation.

classification data-analysis fraud-detection imbalanced-data machine-learning python scikit-learn

Last synced: 10 May 2026

https://github.com/chengetanaim/sentimentanalysisforfinancialnews

This is a Django application for predicting whether the sentiment of a financial news headline is positive, negative or neutral (from an investor point of view)

beautifulsoup4 chartjs django html-css-javascript logistic-regression machine-learning natural-language-processing scikit-learn tfidf-vectorizer webscraping

Last synced: 10 May 2026

https://github.com/afonsojramos/feup-iart

Projects developed for Artificial Intelligence class.

feup feup-iart iart neural-network python scikit-learn tensorflow

Last synced: 10 May 2026

https://github.com/i30101/mathworks2024

Coding tools for 2024 MathWorks Math Modeling Challenge

machine-learning mathematical-modelling python scikit-learn

Last synced: 10 Jun 2026

https://github.com/alphacrypto246/student-learning-style-prediction

An interactive web application built with Streamlit that predicts a student's preferred learning style (visual, auditory, or kinesthetic) using machine learning, aiding educators in personalizing teaching strategies.

machine-learning scikit-learn scikitlearn-machine-learning streamlit

Last synced: 11 May 2026

https://github.com/vijaykumarr1452/ipl-first-innings-score-prediction-deployment

Deployment of IPL Score Prediction Analyser Model. https://github.com/vijaykumarr1452/IPL-First-Innings-Score-Prediction)

css deployment gunicorn html machine-learning ml predictive-analytics python scikit-learn

Last synced: 11 May 2026

https://github.com/monarch1108/customerinsights-kmeans

understanding customers using KMeans and RFM(recency, frequency & monetary) analysis

data-analysis data-visualization kmeans-clustering machine-learning matplotlib numpy pandas scikit-learn

Last synced: 11 May 2026

https://github.com/bheemisme/brain-tumor-classification

brain tumor classification using machin learning

deep-learning machine-learning pytorch scikit-learn xgboost

Last synced: 11 May 2026

https://github.com/theladev/machine-learning

This repository is focus on show u my personal projects and interests on Machine Learning and Data Science. Hope u enjoy it.

data-science machine-learning machine-learning-models pandas python scikit-learn

Last synced: 11 May 2026

https://github.com/anras5/criteo-search-data

EDA and statistical tests on CriteoSearchData dataset

data-science pandas scikit-learn statistics

Last synced: 11 May 2026

https://github.com/deaneeth/churn-prediction-model-training

Step-by-step guide to building machine learning models for customer churn prediction, continuing from the data preprocessing phase. The repo covers training, evaluation, and saving of models, with weekly updates.

churn-prediction data-science-projects jupyter-notebook machine-learning model-evaluation model-training model-training-and-evaluation python scikit-learn

Last synced: 11 May 2026

https://github.com/ananyagubba/bike-sharing-demand-prediction

Using machine learning techniques, the model learns from features such as weather conditions, time of day, season, and holiday information to forecast hourly or daily demand.

machine-learning python scikit-learn seaborn

Last synced: 11 May 2026

https://github.com/sharvesh1401/inverse-design-patch-antenna

A machine learning approach to the inverse design of microstrip patch antennas by predicting optimal physical dimensions from desired performance metrics.

antenna-design deep-learning engineering-project gradio jupyter-notebook machine-learning patch-antenna python regression-model scikit-learn

Last synced: 11 May 2026

https://github.com/cptanalatriste/copycat-detector

A Naive-Bayes classifier for detecting plagiarism.

amazon-sagemaker naive-bayes-classifier scikit-learn

Last synced: 12 May 2026

https://github.com/g-eoj/kaggle-rotten-tomatoes

Movie review sentiment analysis with the Stanford parsed Rotten Tomatoes dataset.

cross-validation nlp nltk rotten-tomatoes scikit-learn

Last synced: 12 May 2026

https://github.com/mateusoliveira30/house-prices

This project was developed for the Kaggle competition "House Prices - Advanced Regression Techniques." The goal is to predict house sale prices using advanced regression techniques, including feature engineering, Random Forests, and Gradient Boosting.

kaggle-competition machine-learning scikit-learn

Last synced: 13 May 2026

https://github.com/johanneswiesner/skplot

A python package for extracting, plotting and reporting information from one or multiple sklearn classification & prediction pipelines.

plotting python scikit-learn sklearn visualization

Last synced: 14 May 2026

https://github.com/breezy-codes/machine-learning-for-spam-sms

Real-time SMS spam detection using ML models in simulated cellular networks. Compares 4 algorithms with comprehensive performance analysis.

logistic-regression machine-learning naive-bayes network-simulation random-forest research scikit-learn spam-sms spam-sms-detection svm telecommunication

Last synced: 14 May 2026

https://github.com/sedefkjamili/dengai-ml-prediction

Machine learning project for predicting dengue fever outbreaks using climate and environmental data.

data-science dengue gradient-boosting healthcare machine-learning python scikit-learn time-series

Last synced: 12 Jun 2026

https://github.com/muditnautiyal-21/mudra-ml

Glass-box autonomous data science in Python. Profiles data, builds leakage-safe pipelines, recommends and tunes models, and logs every decision behind the result.

automl classification clustering data-science explainable machine-learning pipeline python regression scikit-learn

Last synced: 12 Jun 2026

https://github.com/nayutalienx/osu-skill-predictor

ML-powered osu! pass probability & accuracy predictor with real-time overlay. Standalone Windows bundle available.

fastapi machine-learning osu overlay predictor scikit-learn

Last synced: 14 Jun 2026

https://github.com/rb-thompson/machine-learning-basics

Implement a machine learning pipeline on the Iris flower dataset.

data-preprocessing data-visualization model-training python scikit-learn

Last synced: 18 Jun 2026

https://github.com/jayemscript/lab-to-code

A complete Python learning roadmap for scientists and researchers — covering data science, biology, chemistry, physics, and mathematics with curated libraries, tools, and resources.

bioinformatics chemistry data-science jupyter-notebook machine-learning mathematics numpy pandas physics python research roadmap scientific-computing scikit-learn

Last synced: 19 Jun 2026

https://github.com/royxlead/production-drift-detection

Production ML monitoring library - KL, PSI, MMD, and ADWIN drift detectors with empirical benchmarks, confidence tracking, and a 6-page FastAPI dashboard.

data-drift drift-detection fastapi kl-divergence mlops mmd model-monitoring production-ml psi pytorch scikit-learn uncertainty-quantification

Last synced: 23 Jun 2026

https://github.com/gamowy/music-classification

Music genre classification using k nearest neighbors classifier based on gtzan dataset

machinelearning python scikit-learn university-assignment

Last synced: 10 Apr 2026

https://github.com/imosudi/unsupervised-ml-kmeans-analysis

K-Means clustering analysis using synthetic datasets generated with scikit-learn, including meshgrid visualisation, silhouette score evaluation, and investigation of cluster count and random seed effects.

clustering data-analysis jupyter-notebook kmeans kmeans-clustering machine-learning matplotlib python3 scikit-learn silhouette-score unsupervised-learning

Last synced: 25 Jun 2026

https://github.com/annasmustafadev/network-intrusion-detection-ml

Machine learning-based Intrusion Detection System (IDS) for classifying network traffic as normal or malicious using supervised learning techniques. Includes data preprocessing, feature selection, model training, and evaluation for improved cybersecurity intelligence.

anomaly-detection classification cyber-security data-science intrusion-detection machine-learning python scikit-learn supervised-learning

Last synced: 29 Apr 2026

https://github.com/tomgorb/ds-utils

pre-processing of a DataFrame into a sparse matrix for model input

machine-learning preprocessing scikit-learn

Last synced: 16 May 2026

https://github.com/shreeparab1890/handwritten-digit-recognition

In this iPython Noetbook we are going to use the MNIST dataset for the implementation of a handwritten digit recognition app using LogisticRegression and SGDClassifier and compare the accuracy and other metrics.

handwritten-digit-recognition image-classification matplotlib mnist-dataset python scikit-learn sklearn

Last synced: 11 Apr 2026

https://github.com/das-amlan/delay-prediction-in-urban-mobility-networks

Predicting delays in Urban mobility netwrok using different ML algorithms.

delay-prediction gradient-boosting machine-learning python r scikit-learn

Last synced: 05 Apr 2026

https://github.com/abdiasarsene/routerwise-api-predictive-analytics-for-shipments

🧭 RouterWise optimise la logistique d’œuvres d’art grâce à une pipeline MLOps automatisée, prédictive et monitorée, intégrée au backend de PrecisioArt.

bentoml docker fastapi jenkins mlflow prometheus scikit-learn

Last synced: 11 Apr 2026

https://github.com/genaray/ml.shopanalytics

A minimalist Python & cloud ML project that trains on Amazon sales & review data to recommend optimal prices/discounts to boost ratings/sales and surface actionable visual insights. Powered end-to-end by AWS CloudFront, S3, ALB & Fargate and Svelte.

ai aws aws-alb aws-cloudfront aws-ecs aws-fargate aws-s3 cicd devops machine-learning python scikit-learn terraform

Last synced: 11 Apr 2026

https://github.com/audy21/datacamp

Learning portfolio documenting my progress, while taking Data Analyst & Data Science certifications from DataCamp.

data-analysis data-science machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/dastogirrudro/machine-learning-and-deep-learning

This is my thesis project which i have done in varsity.Here i used machine learning and deep learning i used LSTM as deep learning.This can identify aggresive spam message. Here i used pandas scikit-learn and many more framework i used python as a programming language.I used many algorithm for highering the accuracy of my project.

deep-learning lstm machine-learning numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/trimoyee-g/phishing-site-predictor

A phishing site prediction model using scikit-learn's Random Forest Classifier, achieving high accuracy and gaining insights into website characteristics.

data-visualization machine-learning python random-forest-classifier scikit-learn

Last synced: 11 Apr 2026

https://github.com/alsult/wine_classification

This is a wine classification project based on 13 numerical features of wines grown in the same region in Italy but derived from three different cultivars.

logistic-regression machine-learning matplotlib multiclass-classification pandas python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/akhileshthite/india-population

ML (simple linear regression) model for predicting India's population.

machine-learning numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/pramodyasahan/model-selection

This repository explores and compares different regression models for predicting continuous outcomes. This repository includes implementations and evaluations of five key regression models. The primary goal is to demonstrate how each model works, evaluate their performance using R-squared values, and guide users in selecting the best model.

machine-learning modelselection numpy pandas python regression scikit-learn

Last synced: 08 Mar 2025

https://github.com/jo-minseok/global-warming-100year

🌡️ 2100년까지의 지구 기온, 해수면, 북극 빙하, 탄소 예측 ML [완료]

arima-model global-warming machine-learning matplotlib numpy pandas scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/anibalalpizar/python-machine-learning-example

This code reads and preprocesses a dataset for classification using pandas, numpy, matplotlib and scikit-learn. The dataset is split into three parts for training, validation and testing. The data is then scaled and optionally oversampled for balanced classes.

machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/nfordumass/hot-seat

Machine Learning Dashboard and Engine for Predicting NFL Coach Firings

astro machine-learning react scikit-learn supabase typescript

Last synced: 09 Mar 2025

https://github.com/nikhilchaudhary1/commodity-price-prediction

A Python application for predicting commodity prices (e.g., Pulses, Bread) based on state, city, year, and month using a Linear Regression model. Trained on over 1 million government dataset entries, featuring efficient data processing and prediction capabilities.

commodity-price-prediction data-processing linear-regression machine-learning pandas python scikit-learn

Last synced: 20 Apr 2026

https://github.com/arnoldchrisoduor1/machines

Testing the limits of machines

pytorch scikit-learn tensorflow

Last synced: 11 Apr 2026

https://github.com/ayushtiwari134/machine_learning_models

A repo where i upload all the models which i train during my journey of learning Machine Learning from scratch

linear-regression logistic-regression machinelearning matplotlib numpy pandas python random-forest scikit-learn

Last synced: 11 Apr 2026

https://github.com/ahmedshahriar/restaurant-menu-pricing

Predict menu prices from 5M+ UberEats menus with an end-to-end MLOps pipeline: crawl → DWH → curate → train → deploy on Azure ML (MLflow) via APIM & CLIs.

azure azureml bert-embeddings docker fastapi github-actions huggingface machine-learning mlflow mlops optuna python restaurant-menu scikit-learn scrapy tensorflow transformers uber-eats web-crawler

Last synced: 03 Feb 2026

https://github.com/rakibhhridoy/differentprojects

Some of my learning projects that I practice to launch in data science. Not all, but some of few that was stored in my local repository. It can be useful for beginner data science enthusiast. Explore and learn!

data-science deep-learning machine-learning mathematics matplotlib numpy pandas python scikit-learn seaborn statistics

Last synced: 11 Apr 2026

https://github.com/dllllb/ds-pipeline

Data Science model pipeline based on SciKit-Learn Estimator API

data-science machine-learning python scikit-learn

Last synced: 16 Apr 2026

https://github.com/divs-spec/skysync

SkySyncSwarm is a unified drone swarm simulation and control platform that merges the best of UAV simulators, swarm coordination libraries, deep learning models, and autonomous mission planning systems into one cohesive project.

ai-agents flask matlab python3 rrt scikit-learn scipy tcp

Last synced: 11 Apr 2026

https://github.com/agnivchtj/us-census-classifier

Find the optimal classification algorithm that can predict salaries above $50k, based on US Census data.

census-data decision-tree-classifier jupyter-notebooks knn-classifier logistic-regression naive-bayes-classifier python scikit-learn svm-classifier

Last synced: 07 May 2026

https://github.com/richardbmk/datascience_machinelearning

projects related with data science and machine learning projects.

data-science machine-learning matplotlib numpy pandas scikit-learn scipy seaborn

Last synced: 11 Apr 2026

https://github.com/lorenzorottigni/dl-lending-club

Deep Learning python bootcamp: deep learning on Lending Club dataset

deep-learning ipynb keras machine-learning numpy pandas python scikit-learn seaborn tensorflow

Last synced: 11 Apr 2026

https://github.com/pb319/california_house-price-prediction

This is going to be my first end to end ML project implementation covering all required stages taking guidence from book called "Hands On Machine Learning".

evaluation-metrics hyperparameter-tuning jupyter-notebook kfold-cross-validation machine-learning matplotlib numpy pandas python scikit-learn seaborn train-test-split

Last synced: 11 Apr 2026

https://github.com/nicolas-giacomelli/modelo-previsao-colesterol-com-gradio

Modelo de predição do colesterol com base em informações Modelo conta com pipelines para tratamento de dados Disponibilizado via Gradio

gradio machine-learning matplotlib pandas pingouin python3 saude scikit-learn scipy seaborn

Last synced: 11 Apr 2026

https://github.com/devinw03/movie-genre-nlp

🎬 Classify movie genres from plot summaries using various models, including Transformers, with clear EDA and MLflow tracking for reproducible results.

cosine-similarity countvectorizer datascience distilroberta gru huggingface imdb machine-learning multi-label-classification nlp python pytorch recommendation-engine scikit-learn slack tabulate text-classification word2vec

Last synced: 11 Apr 2026

https://github.com/stella4444/linear-regression

learning about linear regression (currently a work in progress) ~ working with data

linear-regression machine-learning numpy scikit-learn

Last synced: 20 Jan 2026

https://github.com/cmdecker95/learning-ml

My first foray into machine learning, comprised of my early undergrad work (circa 2021-2022) 🤓

jupyter-notebook keras machine-learning scikit-learn tensorflow2

Last synced: 08 Apr 2025

https://github.com/saurabhjalendra/hrf-retinal-edge-retrieval

🔬 Content-Based Image Retrieval system for retinal fundus images using HRF dataset. Implements handcrafted feature engineering (HOG, LBP, Edge Detection, Gabor, GLCM) with ML models for medical image classification and analysis.

computer-vision content-based-image-retrieval edge-detection feature-engineering gabor-filters hog-features jupyter-notebook machine-learning medical-imaging opencv retinal-analysis scikit-image scikit-learn

Last synced: 12 May 2026

https://github.com/nickklos10/league-of-legends-match-predictor

Predicting League of Legends Match Outcomes Using Logistic Regression and PyTorch.

jupyter-notebook matplotlib numpy pandas pytorch scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/ojhaharsh/rockvsmine-prediction

Streamlit App for Predicting Rock vs. Mine using SONAR Data. The app allows users to input SONAR readings and returns predictions based on trained data.

machine-learning python scikit-learn

Last synced: 03 Jan 2026

https://github.com/tszon/data-science-projects

Included are all the worth-noting Data Science projects in my learning journey with DataCamp.

data-analysis data-science exploratory-data-analysis feature-engineering machine-learning modelling preprocessing-data scikit-learn supervised-learning

Last synced: 15 Mar 2025

https://github.com/pranav-tank/heart-disease-prediction-model

I have created this project as my Python term assignment. In this project I have trained a ML model to predict the heart disease using Scikit-learn library in python.

google-colaboratory jupyter-notebook machine-learning prediction-model python scikit-learn

Last synced: 20 Jan 2026

https://github.com/pakagronglb/istanbul-restaurant-tips-prediction

This project focuses on predicting restaurant tips in Istanbul using machine learning techniques, specifically Linear Regression. The analysis includes data preprocessing, exploratory data analysis (EDA), model training, and evaluation.

kaggle matplotlib numpy pandas prediction python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/ralolooafanxyaiml/neural-evolution-engine

A modular AI evolution simulation engine built with Python and TensorFlow. Uses Deep Learning to predict organism adaptations against environmental threats.

artificial-intelligence deep-learning evolution-simulation keras machine-learning neural-networks numpy oop pandas python scikit-learn simulation tensorflow

Last synced: 07 Jan 2026

https://github.com/itssahilwhat/AI-Fundamentals

A curated collection of fundamental AI concepts, algorithms, and code implementations — including Machine Learning, Deep Learning, and Computer Vision — built from scratch and with practical examples.

computer-vision deep-learning machine-learning numpy pandas python pytorch scikit-learn

Last synced: 02 Jul 2025