An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/gitstar-oc/machine-learning

This are the Machine Learning notes by leading AI website named Deeplearning.AI. This notes will help you to be a machine learner from beginner to advanced level. Welcome Everyone!!

deep-learning deep-neural-networks jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python scikit-learn supervised-learning tensorflow unsupervised-learning

Last synced: 09 Feb 2026

https://github.com/tathithienthanh/datamining-banking-dataset

Implement some learned data mining techniques and predict if the client will subscribe to a term deposit

apriori association-rules classification clustering data-analysis data-mining data-processing google-colab ipynb kmeans naive-bayes py python scikit-learn svm visualization

Last synced: 20 Apr 2026

https://github.com/ewertondrigues02/previsao-de-vendas

Previsão de vendas de uma empresa fictícia onde foi feita análise com ferramentas como Jupyter Notebook, Google Colab, Python e bibliotecas de Machine Learn como: regressão linear, arvore de decisão, scikit-learn

analise-de-dados analise-exploratoria arvore-de-decisao ciencia-de-dados colab excel google-colab jupyter jupyter-notebook machine-learning previsao previsao-de-vendas python3 regressao-linear scikit-learn

Last synced: 10 Feb 2026

https://github.com/idaraabasiudoh/vehicle-co2emission_model

Predicts CO2 emissions from vehicle fuel consumption using a multiple linear regression model trained on sklearn, based on a dataset of engine sizes and corresponding CO2 emissions in Canada.

data-analysis jupyter-notebook machine-learning python3 scikit-learn

Last synced: 06 May 2026

https://github.com/raythurman2386/gis-playground

GIS Playground is a comprehensive web-based GIS application that combines multiple data sources and provides advanced spatial data visualization and analysis capabilities. The application features real-time wildfire data integration, intelligent spatial data processing, and interactive mapping functionality.

flask gdal geopandas leaflet nltk scikit-learn

Last synced: 11 Feb 2026

https://github.com/lemma-osu/sklearn-raster

Fast, parallel raster prediction with scikit-learn estimators

dask raster scikit-learn xarray

Last synced: 20 Apr 2026

https://github.com/tritonix711/fractureai

This tool helps people upload X-rays to find broken bones. It uses a machine to mark where the breaks are and gives users marked pictures to download. A smart computer also helps people understand their broken bones and gives them advice.

css cv2 flask gorq html javascript matplotlib npm numpy pandas pydantic python react scikit-learn torch torchvision ultralytics

Last synced: 27 Feb 2026

https://github.com/nazchanel/fake-news-detection-algorithm

A fake news detection algorithm. This repository contains the various variations of my original project. WIP.

dataset deep-learning fake-news-detection machine-learning-algorithms natural-language-processing scikit-learn work-in-progress

Last synced: 21 Apr 2026

https://github.com/shridhar1504/boston-house-price-prediction-datascience-project

The Boston House Price Prediction project utilizes data science methodologies and machine learning algorithms to provide accurate predictions for housing prices in the Boston area.

boston data-science house-price-prediction machine-learning regression-algorithms regression-models scikit-learn supervised-learning

Last synced: 24 Apr 2026

https://github.com/vishrut-b/ml-project-with-pytorch-breast-cancer-classification

An exploration of machine learning techniques applied to classify breast cancer as malignant or benign.

breast-cancer-classification machine-learning python pytorch scikit-learn

Last synced: 11 Feb 2026

https://github.com/slfagrouche/real-estate-market-analysis

Analysis of 2.2 million Realtor.com listings using Python and machine learning to uncover U.S. real estate market patterns. The project identifies market segments, predicts property prices, and reveals regional trends, providing data-driven insights for real estate professionals and investors.

data-science exploratory-data-analysis linear-regression machine-learning scikit-learn statistical-testing

Last synced: 24 Apr 2026

https://github.com/divyanshugit/kaggle-titanic-machine-learning-from-disaster

A machine learning model that predicts which passengers survived the Titanic shipwreck.

data-science machine-learning machine-learning-algorithms random-forest scikit-learn svm

Last synced: 26 Apr 2026

https://github.com/engageintellect/bitcoin-price-predictor

This Python project predicts whether the price of Bitcoin will increase or decrease on the next day, using historical price data and machine learning. Additionally, the project visualizes Bitcoin's price movements using candlestick charts along with moving averages for different timeframes.

bitcoin machine-learning matplotlib mplfinance numpy pandas python scikit-learn visualization yfinance

Last synced: 23 Oct 2025

https://github.com/camille-maslin/securecard-ai

🛡️ SecureCard-AI: A high-performance credit card fraud detection system implemented in a Jupyter Notebook, achieving 99.97% accuracy.

classification credit-card-fraud-detection data-analysis data-science fraud-detection jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 12 Feb 2026

https://github.com/marella/evaluate

A tool to evaluate the performance of various machine learning algorithms and preprocessing steps to find a good baseline for a given task.

lightgbm machine-learning python scikit-learn xgboost

Last synced: 27 Apr 2026

https://github.com/yagna123k/fresh-farm-ai

Fresh Farm AI - AI-Powered Crop Quality Control System

ai deep-learning machine-learning nextjs python scikit-learn tensorflow

Last synced: 12 Feb 2026

https://github.com/christopherkindl/start-hack-2021

Predictive models for parking space occupation using historical parking occupancy and ticket sales data as well as weather and public holiday data.

python scikit-learn xgboost

Last synced: 08 May 2026

https://github.com/rickiepark/sklearn-tutorial

사이킷런 정주행 튜토리얼

machine-learning python scikit-learn

Last synced: 08 May 2026

https://github.com/spockoo/pylegend

A fusion between Python and legend, a name that suggests that the code is both modern and mythical. 3 months of work, with tons of errors to establish the calculations necessary for the superposition, I want to publish my work and improve it and share it under Apache 2.0 License. Designed to work with NBminer!

crypto crypto-tools how-to-farm-crypto kerastuner matplotlib matrix-multiplication mining nbminer numpy performance-optimization pickle project quantum quantumcircuits quantumcomputing qubits scikit-learn tensorflow

Last synced: 27 Jan 2026

https://github.com/ksalama/gcp-ml-serving

Examples of how to serve ML models on GCP

app-engine dataflow kubernetes machine-learning scikit-learn tensorflow

Last synced: 12 Oct 2025

https://github.com/shreyansh055/dynamic_pricing_strategy_055

Dynamic Pricing Strategy Project: This project utilizes machine learning algorithms in Python to optimize ride-sharing prices through real-time demand and supply analysis. By leveraging historical Uber data, it dynamically adjusts prices to maximize revenue and improve customer satisfaction.

machine-learning numpy pandas python scikit-learn

Last synced: 13 Feb 2026

https://github.com/benzerinsio/datascience

📊 Data Science & Análise de Dados | Projetos de estudo em Exploração de Dados (EDA), Machine Learning e Deep Learning para prática e demonstração de técnicas analíticas.

analise-de-dados analise-exploratoria analise-exploratoria-de-dados aprendizado-de-maquina aprendizado-profundo data-science data-visualization eda exploratory-analysis exploratory-data-analysis machine-learning numpy pandas python scikit-learn seaborn supervised-learning unsupervised-learning

Last synced: 14 Feb 2026

https://github.com/lordhacker756/estate-ai

Estate AI is a machine learning application that predicts the approximate rent a user would need to pay for their requirement across major metro cities of India. It is built using NextJS 13, TailwindCSS, and TypeScript for the frontend, Scikit Learn for Model Training and and Flask for the backend.

fastapi flask machine-learning nextjs13 scikit-learn

Last synced: 09 May 2026

https://github.com/j-i-l/tfb-prediction

Transcription factor binding prediction

bioinformatics machine-learning pandas python scikit-learn

Last synced: 09 May 2026

https://github.com/markdouthwaite/lingo

A package for quickly deploying Scikit-Learn Linear Models in Go.

golang linear-models machine-learning scikit-learn

Last synced: 15 Feb 2026

https://github.com/andreped/nlp-mtl

Training neural networks to solve multiple tasks simultaneously from free text through multi-task learning

bert-embeddings keras multi-task-learning natural-language-processing neural-networks nlp scikit-learn

Last synced: 09 May 2026

https://github.com/wwtg99/predict_height

Predict height by gender and genotypes using machine learning.

genotype height machine-learning scikit-learn

Last synced: 29 Apr 2026

https://github.com/mrapp-ke/mlrl-boomer

A scikit-learn implementation of BOOMER - An Algorithm for Learning Gradient Boosted Multi-Output Rules

gradient-boosting machine-learning multi-target-regression multilabel-classification multioutput-regressor rule-learning scikit-learn

Last synced: 10 May 2026

https://github.com/ajitashwath/nn-visualization

A web application for visualizing various aspects of neural networks.

matplotlib-pyplot python3 scikit-learn streamlit tensorflow

Last synced: 03 May 2026

https://github.com/yaqoah/used-cars-ai

🚗 predicts used car prices using a full ML pipeline

beautifulsoup eda machine-learning pandas regression scikit-learn selenium xgboost

Last synced: 19 Apr 2026

https://github.com/pr38/dask_backward_feature_selection

Backward step-wise feature selection using Dask, scikit-learn compatible

dask feature-selection machine-learning python scikit-learn

Last synced: 16 Apr 2026

https://github.com/jlgarridol/tfg-smartbeds

MINERÍA DE DATOS APLICADA A LA DETECCIÓN DE CRISIS EPILÉPTICAS - GII18.13

bed datamining ensemble epileptic-seizures manifold medical-informatics oneclasssvm pca rotation-forest scikit-learn weka

Last synced: 30 Apr 2026

https://github.com/tschechlovdev/ml2dac

Implementation of "ML2DAC: Meta-Learning to Democratize AutoML for Clustering Analyses", published at SIGMOD 2023. The paper has awarded the "reproducibility" badge by SIGMOD's reproducibility reviewers.

automl clustering meta-learning paper python reproducible-research scikit-learn

Last synced: 13 Oct 2025

https://github.com/27ahmad/medicine-recommendation-system

This project aims to create a medicine recommendation system based on symptoms provided by the user. The system is built using machine learning models trained on a dataset of symptoms and their corresponding diagnoses. The frontend is designed using Bootstrap for an intuitive user interface.

bootstrap machine-learning medicine-applications pandas recommendation-system scikit-learn

Last synced: 25 Oct 2025

https://github.com/drreetusharma/molecular_innovations-for-kpgt-knowledge-guided-pre-training-of-graph-transformer-

Knowledge-guided-Pre-training-of-Graph-Transformer: The primary aim of this project is to leverage knowledge-guided pre-training techniques for enhancing the performance of graph transformers in molecular property prediction and drug discovery.

machine machine-learning neural-network pytorch rdkit scikit-learn

Last synced: 04 Mar 2026

https://github.com/smmariquit/pjdsc-economic-impact

BARLO: Bayani Alert and Response for Local Operations — predicts a storm's economic impact from typhoon forecast data using a PyTorch + scikit-learn model, deployed on Streamlit.

disaster-risk logistics machine-learning philippines pjdsc python pytorch scikit-learn streamlit typhoon

Last synced: 14 Jun 2026

https://github.com/francescopaolol/decisiontree

About classify iris plants into three species in this classic dataset

decision-tree-classifier jupyter-notebook kaggle machine-learning ml pandas scikit-learn

Last synced: 16 Apr 2026

https://github.com/aditya-ranjan1234/interactive-salary-prediction-with-machine-learning

A Streamlit web application for exploring the UCI Census Income dataset, training machine learning models, and predicting employee salaries.

data-science machine-learning prediction python scikit-learn streamlit xgboost

Last synced: 29 Apr 2026

https://github.com/magnuss0/movie-rec-system

The project extracts movie data using TheMovieDB API, processes it using TF-IDF and cosine similarity for generating recommendations, and stores the data in a DuckDB database. The system is encapsulated within a FastAPI web application and can be deployed using Docker. It provides movie recommendations in JSON format.

cosine-similarity docker duckdb movies-recommendation moviesdb-api ploomber poetry-python scikit-learn streamlit tf-idf

Last synced: 14 Apr 2026

https://github.com/aravindnathan02/credit-card-fraud-detection

This repository contains a Machine Learning project aimed at detecting fraudulent credit card transactions. The goal is to build a reliable and efficient model that minimizes false positives and false negatives, ensuring financial safety and improving fraud detection capabilities.

classification-model fraud-detection logistic-regression machine-learning python random-forest scikit-learn

Last synced: 11 May 2026

https://github.com/khaymanii/big_mart_prediction_model

This model was built using Python and Logistic Regression Algorithm

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 01 May 2026

https://github.com/rixiiz/using-knn-to-predict-the-obp-of-mlb-players

Using KNN to predict the On Base Percentage (OBP) of Major League Baseball (MLB) players at the end of the season

artificial-intelligence dataset f1-score jupyter-notebook knn-regression machine-learning matplotlib mse numpy pandas python scikit-learn supervised-learning

Last synced: 05 Apr 2026

https://github.com/snehilsanyal/ee524

Course webpage for IIT Guwahati EE524 Machine Learning Lab (Jul-Nov 2020) Session

course-webpage machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 01 May 2026

https://github.com/kento75/keiba_machine_learning

scikit-learnを用いた競馬予測用スクリプト

machine-learning matplotlib pandas postgresql psycopg2 python3 scikit-learn

Last synced: 18 Apr 2026

https://github.com/omanshu209/ml-basics-2022

Machine Learnings(AI) models developed using the scikit-learn library in Python.

jupyter-notebook machine-learning python python3 scikit-learn

Last synced: 06 May 2026

https://github.com/nemeslaszlo/heart-disease

Heart disease classification project with different models (LogisticRegression, KNeighboursClassifier, RandomForestClassifier) and detailed reports.

classification knearest-neighbor-classifier logistic-regression mathplotlib numpy pandas randomforest-classification scikit-learn seaborn

Last synced: 15 Apr 2026

https://github.com/glencrawford/matchmaker

A k-nearest neighbors machine learning project to perform similarity matching using a dataset of OkCupid dating profiles.

django machine-learning python scikit-learn scipy

Last synced: 06 May 2026

https://github.com/sorna-fast/breast-cancer-diagnosis-neural-network

ANN-based breast cancer classifier using the Wisconsin Diagnostic Dataset. Implements advanced feature engineering and achieves 98.25% test accuracy. Includes comprehensive EDA, model training, and clinical impact analysis

keras-classification-models keras-neural-networks keras-tensorflow matplotlib-pyplot pandas-dataframe scikit-learn seaborn-plots sklearn-library tensorflow

Last synced: 20 Apr 2026

https://github.com/franpog859/titanic-competition

❄️🚢 Machine Learning project workflow reference. Model predicts if given people survive the Titanic disaster basing on among others their age, sex and names

classification data-science kaggle machine-learning scikit-learn titanic workflow

Last synced: 05 May 2026

https://github.com/sralter/happy_customers

Predicting whether a customer is happy based on the results from a survey.

eda ensemble-classifier hyperopt lazypredict ml scikit-learn

Last synced: 21 Apr 2026

https://github.com/mpolinowski/isometric-mapping

Non-linear dimensionality reduction through Isometric Mapping

isomap matplotlib-pyplot python scikit-learn

Last synced: 06 May 2026

https://github.com/kartikdixit2468/advanced-jarvis-ai-using-python

An A.I voice assistant in python using simple machine learning algorithms and BardAPI.

bard bardapi jarvis machine-learning python scikit-learn voice-assistant voice-recognition

Last synced: 16 Apr 2026

https://github.com/dipa09/riot_imgclf

Multi-class image classifier for RIOT-OS

arduino-mega-2560 emlearn esp32-cam m2cgen micromlgen riot-os scikit-learn tinyml

Last synced: 30 Apr 2026

https://github.com/deliprofesor/ridge-regression-for-sales-prediction-model-evaluation-and-hyperparameter-tuning

This project builds and optimizes a model on a dataset using Ridge regression and polynomial features. Model accuracy is enhanced through regularization and polynomial transformations. Grid search and cross-validation are used to find the best parameters, and the model's performance is evaluated.

cross-validation data-science data-visualization grid-search machine-learning model-optimization mse overfitting-prevention polynomial-regression python r2-score regression-analysis regularization ridge-regression rmse scikit-learn

Last synced: 03 May 2026

https://github.com/ccharlesss/financeml

machine learning web application using Python's FastAPI and scikit-learn to predict S&P 500 stock price trends and cluster stocks based on average annual returns and volatility. Utilised the MVC design pattern to structure the application effectively. Implemented a decision tree classifier with 84% accuracy.

cicd docker fastapi finance javascript jenkins machine-learning restful-api scikit-learn webapplication

Last synced: 15 Apr 2026

https://github.com/flexycode/ccmaclrl

🤖 This repository is intended for our Machine Learning CCMACLRL COM231ML by Professor Elizer Ponio Jr

artificial-intelligence linnear-regression machine-learning machine-learning-algorithms python random-forest scikit-learn supervised-learning tensorflow

Last synced: 07 May 2026

https://github.com/marksikaundi/handson-machinelearning

Complete Collection about Machine Learning

matplotlib pandas-python scikit-learn tensorflow

Last synced: 07 May 2026

https://github.com/cbjuan/paper-ijimai-ml-employability

Jupyter notebook developed to support the research presented in the paper "Proposing a machine learning approach to analyze and predict employment and its factors"

jupyter-notebook python research scikit-learn

Last synced: 07 May 2026

https://github.com/kohlerhector/trex-tree-reward-exploration

Using Tree estimators of the MDP models to then count leaves grouping similar transitions and do count-based exploration.

decision-trees drl exploration rl scikit-learn stable-baselines3

Last synced: 04 May 2026

https://github.com/kashifmoin1410/computer-vision-traditional-vs.-deep-learning-approaches

This project compares traditional Bag-of-Words with SVM and a custom ResNet-style CNN for image classification on the CIFAR-10 dataset. It covers the full workflow: feature extraction, model building, training, evaluation, and visualization. Results demonstrate the superior accuracy and robustness of deep learning models over classic ML pipelines.

bag-of-words cifar10 cnn comparative-analysis computer-vision deep-learning feature-extraction image-classification keras knn-classification machine-learning model-evaluation neural-network python3 resnet scikit-learn sift-algorithm svm-classifier

Last synced: 06 May 2026

https://github.com/nirmalyabag20/loan-status-prediction-using-machine-learning

This project focuses on predicting the loan status (approved or not approved) based on various applicant details. The goal is to develop a machine learning model that accurately classifies whether a loan should be approved, helping financial institutions make informed lending decisions.

matplotlib numpy pandas python scikit-learn seaborn support-vector-machine

Last synced: 19 Jan 2026

https://github.com/asut00/machine-learning-program_42ai

Comprehensive Machine Learning path by 42AI: hands-on modules on regression, gradient descent, and real-world ML applications.

linear-regression machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 07 May 2026

https://github.com/aymanmansur/insider-threat-detection-using-cert-dataset-logon-

Detecting anomalies in user logon behavior using the CERT Insider Threat Detection Dataset. This project extracts key features like session duration and logon frequency during non-working hours and applies Isolation Forest to identify suspicious activity.

matplotlib pandas python scikit-learn

Last synced: 07 May 2026

https://github.com/soumya6tiwari/customer-segmentation-using-rfm-analysis

This project focuses on customer segmentation using RFM (Recency, Frequency, Monetary) analysis and K-Means clustering. It enables businesses to identify high-value customers, optimize marketing strategies, and improve customer retention through data-driven insights.

backend clustering flask frontend kmeans-clustering matplotlib numpy pandas python rfm-analysis scikit-learn unsupervised-learning

Last synced: 16 Feb 2026

https://github.com/rajikaimal/emma

:santa: Intelligent mention bot for GitHub organizations

bot emma machine-learning python scikit-learn

Last synced: 24 Apr 2026

https://github.com/noahtigner/discoverdaily

A Spotify Recommender System. Trains a Classifier on your musical tastes and recommends songs daily. Uses the Spotify API and scikit-learn for machine learning.

machine-learning recommender-system scikit-learn spotify spotify-api

Last synced: 24 Apr 2026

https://github.com/emmanuelezenwere/aind-aiprojects

Portfolio of AI projects developed during my Udacity AI Nanodegree, covering Planning AI, Constraint Satisfaction, Hidden Markov Models, and Search algorithms.

alpha-beta-pruning astar-algorithm bellman-equation breadth-first-search constraint-satisfaction-problem depth-first-search hidden-markov-model kalman-filter minmax-algorithm networkx nltk numpy pandas scikit-learn scipy sympy

Last synced: 29 Apr 2026

https://github.com/haloapping/ml-with-me

Kalo dengar istilah ML, biasanya rada ambigu. Soalnya punya beberapa kepanjangan, seperti Mobile Legend, Makan Lontong, dan lain-lain. Tapi pada repo ini membahas Machine Learning :)

ml pusing python3 scikit-learn stress tau-ah-gelap

Last synced: 14 Apr 2026

https://github.com/msikorski93/alzheimer-s-disease-classification

A multi classification using scikit-learn and TensorFlow models on MRI scans of patient's brains.

alzheimers-disease classification efficientnetb0 inceptionv3 knn-classifier mri-brain random-forest scikit-learn svc tensorflow

Last synced: 01 May 2026

https://github.com/analitico-771/creditworthiness_classification_model

This is an Application that trains a model using supervised learning and imbalanced-learn library in order to classify and identify the creditworthiness of borrowers

artificial-intelligence credit-risk fintech imbalanced-learning machine-learning python quantitative-finance scikit-learn supervised-machine-learning

Last synced: 04 May 2026

https://github.com/antim21/spamsense-ai

Classifying emails into Spam or Not Spam categories using Machine Learning techniques

machine-learning nlp python scikit-learn

Last synced: 04 May 2026

https://github.com/adzialocha/notebook

Jupyter notebooks for random experiments with audio processing, data analysis and machine learning

jupyter-notebook keras learning librosa music21 scikit-learn

Last synced: 15 Apr 2026

https://github.com/piyush1927/flightforecast

ML model to predict flight prices based on various features like departure time, arrival time, duration, airline, source, destination, and number of stops.

machine-learning mathplotlib numpy pandas scikit-learn seaborn

Last synced: 16 Apr 2026

https://github.com/vectominist/mednlp

Mandarin Medical Dialogue Analysis with Pytorch.

dialog huggingface mandarin medical pytorch scikit-learn transformers

Last synced: 04 May 2026

https://github.com/shimazadeh/total-perspective-vortex

This subject aims to create a brain computer interface based on electroencephalographic data (EEG data) with the help of machine learning algorithms. Using a subject’s EEG reading, you’ll have to infer what he or she is thinking about or doing - (motion) A or B in a t0 to tn timeframe.

ai algorithm classification datascience dimensionality-reduction eeg scikit-learn

Last synced: 25 Apr 2026

https://github.com/artemxdata/car-price-prediction

Car Price Prediction – Machine learning project for estimating car prices based on technical specifications and market data. The goal is to achieve an RMSE below 2500 by comparing multiple models (Linear Regression, Random Forest, LightGBM) and analyzing training vs. prediction time.

car-price-prediction data-science lightgbm machine-learning notebook python regression rmse scikit-learn supervised-learning used-cars vehicle-pricing

Last synced: 01 May 2026