An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/wwtg99/predict_height

Predict height by gender and genotypes using machine learning.

genotype height machine-learning scikit-learn

Last synced: 29 Apr 2026

https://github.com/kingyiusuen/travelers-insurance-fraud

Detect fraudulent insurance claims using machine learning

data-science end-to-end-machine-learning flask machine-learning scikit-learn

Last synced: 18 Apr 2026

https://github.com/eugeniolr/evolml

Compilation of Machine Learning models and tools that use metaheuristic optimization. Every model is implemented as a scikit-learn style model and metaheuristics are implemented in the metaheuristic-designer framework.

clustering feature-selection genetic-algorithm hyperparameter-optimization hyperparameter-tuning machine-learning machine-learning-algorithms metaheuristics scikit-learn sklearn

Last synced: 17 Mar 2026

https://github.com/shreyansh055/time-series-forecasting_055

The Time Series Forecasting Project predicts future trends using historical data with Python, Pandas, and models like ARIMA, LSTM, and Prophet, focusing on scalable, accurate forecasting for business and finance.

lstm matplotlib numpy pandas python scikit-learn seaborn

Last synced: 27 Jan 2026

https://github.com/mrapp-ke/mlrl-boomer

A scikit-learn implementation of BOOMER - An Algorithm for Learning Gradient Boosted Multi-Output Rules

gradient-boosting machine-learning multi-target-regression multilabel-classification multioutput-regressor rule-learning scikit-learn

Last synced: 10 May 2026

https://github.com/anaconda/intel-green-ai

Code and Experimental Package attached to the article "Greener Machine Learning Computing with Intel AI Acceleration"

anaconda anaconda-environment green-ai machine-learning machine-learning-benchmarks scikit-learn scikit-learn-benchmarks

Last synced: 05 Feb 2026

https://github.com/harshd23/attendance_system_using_face_recognition

The purpose of this Attendance System Using Face System is to record the presence or attendance of employee through a browser by recognizing the faces captured through a webcam. For this record-keeping, a database was built to store the in-time and out-time of the employee.

attendance-management-system face-recognition histogram-of-oriented-gradients machine-learning opencv scikit-learn sqlite-database support-vector-machine

Last synced: 18 May 2026

https://github.com/camille-maslin/securecard-ai

๐Ÿ›ก๏ธ SecureCard-AI: A high-performance credit card fraud detection system implemented in a Jupyter Notebook, achieving 99.97% accuracy.

classification credit-card-fraud-detection data-analysis data-science fraud-detection jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 12 Feb 2026

https://github.com/analitico-771/vc_funding_predictor

This is a machine learning model that predicts whether applicants will be successful if funded by a Venture Capital Firm

machine-learning predictive-modeling python scikit-learn tensorflow venture-capital

Last synced: 15 Apr 2026

https://github.com/engageintellect/bitcoin-price-predictor

This Python project predicts whether the price of Bitcoin will increase or decrease on the next day, using historical price data and machine learning. Additionally, the project visualizes Bitcoin's price movements using candlestick charts along with moving averages for different timeframes.

bitcoin machine-learning matplotlib mplfinance numpy pandas python scikit-learn visualization yfinance

Last synced: 23 Oct 2025

https://github.com/lemma-osu/sklearn-raster

Fast, parallel raster prediction with scikit-learn estimators

dask raster scikit-learn xarray

Last synced: 20 Apr 2026

https://github.com/ajitashwath/nn-visualization

A web application for visualizing various aspects of neural networks.

matplotlib-pyplot python3 scikit-learn streamlit tensorflow

Last synced: 03 May 2026

https://github.com/divyanshugit/kaggle-titanic-machine-learning-from-disaster

A machine learning model that predicts which passengers survived the Titanic shipwreck.

data-science machine-learning machine-learning-algorithms random-forest scikit-learn svm

Last synced: 26 Apr 2026

https://github.com/prashver/house-prices-prediction

Utilizing the House Prices Dataset , this project predicts home prices through a Jupyter notebook-based data science pipeline. It includes exploratory data analysis, cleaning, feature engineering, and modeling. The project explores diverse aspects of residential homes to understand price influences beyond traditional factors.

machine-learning matplotlib numpy pandas regression-models scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/superbderrick/creditratingprediction

A simple data prediction system that evaluates creditRating with a little bit data

scikit-learn tensorflow

Last synced: 05 May 2026

https://github.com/yaqoah/used-cars-ai

๐Ÿš— predicts used car prices using a full ML pipeline

beautifulsoup eda machine-learning pandas regression scikit-learn selenium xgboost

Last synced: 19 Apr 2026

https://github.com/ffstghc/caco2ml

Main code chunks used for models in the publication "Exploring the Potential of Adaptive, Local Machine Learning (ML) in Comparison ton the Prediction Performance of Global Models: A Case Study from Bayer's Caco-2 Permeability Database"

caco-2 local-models machine-learning pharmacokinetics scikit-learn

Last synced: 18 Oct 2025

https://github.com/sanggusti/genre_classification

Reproducible Project Music Genre Classification

hydra mlflow numpy pandas pytest random-forest scikit-learn wandb

Last synced: 01 May 2026

https://github.com/slfagrouche/real-estate-market-analysis

Analysis of 2.2 million Realtor.com listings using Python and machine learning to uncover U.S. real estate market patterns. The project identifies market segments, predicts property prices, and reveals regional trends, providing data-driven insights for real estate professionals and investors.

data-science exploratory-data-analysis linear-regression machine-learning scikit-learn statistical-testing

Last synced: 24 Apr 2026

https://github.com/williamjsdavis/predictsdereversal

Attempting to predict the time until a reversal geomagnetic reversals using machine learning techniques.

geomagnetism geophysics machine-learning random-forest scikit-learn

Last synced: 01 May 2026

https://github.com/kevinliao159/klearn

Statistical & ML Tool Kits for Noisy Data Classification Problems

algorithms data-mining data-science kaggle machine-learning scikit-learn

Last synced: 05 May 2026

https://github.com/vishrut-b/ml-project-with-pytorch-breast-cancer-classification

An exploration of machine learning techniques applied to classify breast cancer as malignant or benign.

breast-cancer-classification machine-learning python pytorch scikit-learn

Last synced: 11 Feb 2026

https://github.com/shridhar1504/boston-house-price-prediction-datascience-project

The Boston House Price Prediction project utilizes data science methodologies and machine learning algorithms to provide accurate predictions for housing prices in the Boston area.

boston data-science house-price-prediction machine-learning regression-algorithms regression-models scikit-learn supervised-learning

Last synced: 24 Apr 2026

https://github.com/alextanhongpin/spam-api

Microservices for spam filtering system

python scikit-learn

Last synced: 21 Apr 2026

https://github.com/glencrawford/australia_rain_tomorrow_binary_classification_prediction

Binary classification model to predict whether or not it will rain tomorrow with a Tensorflow/Keras and scikit-learn neural network.

binary-classification classification keras machine-learning neural-network python scikit-learn tensorflow

Last synced: 01 May 2026

https://github.com/wwunlp/sner

๐’ˆฌ Sumerian Named Entity Recognition

machine-learning ner nlp python scikit-learn

Last synced: 07 Feb 2026

https://github.com/pr38/dask_backward_feature_selection

Backward step-wise feature selection using Dask, scikit-learn compatible

dask feature-selection machine-learning python scikit-learn

Last synced: 16 Apr 2026

https://github.com/jlgarridol/tfg-smartbeds

MINERรA DE DATOS APLICADA A LA DETECCIร“N DE CRISIS EPILร‰PTICAS - GII18.13

bed datamining ensemble epileptic-seizures manifold medical-informatics oneclasssvm pca rotation-forest scikit-learn weka

Last synced: 30 Apr 2026

https://github.com/tschechlovdev/ml2dac

Implementation of "ML2DAC: Meta-Learning to Democratize AutoML for Clustering Analyses", published at SIGMOD 2023. The paper has awarded the "reproducibility" badge by SIGMOD's reproducibility reviewers.

automl clustering meta-learning paper python reproducible-research scikit-learn

Last synced: 13 Oct 2025

https://github.com/27ahmad/medicine-recommendation-system

This project aims to create a medicine recommendation system based on symptoms provided by the user. The system is built using machine learning models trained on a dataset of symptoms and their corresponding diagnoses. The frontend is designed using Bootstrap for an intuitive user interface.

bootstrap machine-learning medicine-applications pandas recommendation-system scikit-learn

Last synced: 25 Oct 2025

https://github.com/shawnzhang31/ml-handson

Machine Learning Giggle using Scikit-Learn, Tensorflow, Keras, Pytorch

pytorch scikit-learn tensorflow

Last synced: 18 Apr 2026

https://github.com/nazchanel/fake-news-detection-algorithm

A fake news detection algorithm. This repository contains the various variations of my original project. WIP.

dataset deep-learning fake-news-detection machine-learning-algorithms natural-language-processing scikit-learn work-in-progress

Last synced: 21 Apr 2026

https://github.com/drreetusharma/molecular_innovations-for-kpgt-knowledge-guided-pre-training-of-graph-transformer-

Knowledge-guided-Pre-training-of-Graph-Transformer: The primary aim of this project is to leverage knowledge-guided pre-training techniques for enhancing the performance of graph transformers in molecular property prediction and drug discovery.

machine machine-learning neural-network pytorch rdkit scikit-learn

Last synced: 04 Mar 2026

https://github.com/smmariquit/pjdsc-economic-impact

BARLO: Bayani Alert and Response for Local Operations โ€” predicts a storm's economic impact from typhoon forecast data using a PyTorch + scikit-learn model, deployed on Streamlit.

disaster-risk logistics machine-learning philippines pjdsc python pytorch scikit-learn streamlit typhoon

Last synced: 14 Jun 2026

https://github.com/rvats20/income-classification-using-ml

Model Training, Implementing various machine learning algorithms such as Logistic Regression, Decision Trees, Random Forests, and Gradient Boosting. Model Evaluation: Assessing model performance using metrics like accuracy, precision, recall, and F1-score. Hyperparameter Tuning

classification machine-learning machine-learning-algorithms ml pandas-dataframe python scikit-learn

Last synced: 11 May 2026

https://github.com/dipa09/riot_imgclf

Multi-class image classifier for RIOT-OS

arduino-mega-2560 emlearn esp32-cam m2cgen micromlgen riot-os scikit-learn tinyml

Last synced: 30 Apr 2026

https://github.com/petrosdemetrakopoulos/flight-passengers-prediction

A supervised learning problem given as a project in the "Data Mining in Databases and World Wide Web" course in Computer Science Department of AUEB in Winter semester of 2019.

classification classifier data-science machine-learning python scikit-learn sklearn university-project

Last synced: 30 Apr 2026

https://github.com/gauravsingh9356/machine_learning

All my practical learning work involved in MACHINE LEARNING (Data Processing to Deep Learning)

deep-learning jupyter-notebook machine-learning machine-learning-algorithms nlp-machine-learning python scikit-learn

Last synced: 30 Apr 2026

https://github.com/mg380/ibm-applied-data-science-capstone

This Capstone is the 10th (final) course in IBM Data Science Professional Certificate specialization, and it actually summarises in the form of project all materials that have been learned during this specialization

capstone data data-analysis data-science datascience ibm machine-learning plotly python scikit-learn sql

Last synced: 05 Mar 2026

https://github.com/nmsby/pca-machine-learning-lab

Principal Component Analysis (PCA) implementation and analysis lab for Machine Learning. Features manual PCA implementation, scikit-learn applications, data compression, and feature extraction with detailed visualizations.

data-analysis dimensionality-reduction jupyter-notebook machine-learning numpy pca python scikit-learn visualization

Last synced: 01 May 2026

https://github.com/charmee123/krishakvriddhi-final

I have also deployed this site on replit you can also check from that. https://replit.com/@charmee123/KrishakVriddhi?v=1

bootstrap css flask html javascript machine-learning python replit scikit-learn weather-api

Last synced: 14 Apr 2026

https://github.com/anarya22/heart-disease-classification

Predicting heart disease using machine learning. This notebook looks into various python base ML and DS libraries in an attempt to build a machine learning model capable of predicting whether or not someone has heart disease based on their medical attributes.

data-cleaning data-visualization machine-learning matplotlib numpy pandas scikit-learn

Last synced: 01 May 2026

https://github.com/rickiepark/ml-ko

๋จธ์‹ ๋Ÿฌ๋‹, ๋”ฅ๋Ÿฌ๋‹ ํ•œ๊ธ€ ๋ฒˆ์—ญ ์ €์žฅ์†Œ

deep-learning keras machine-learning python scikit-learn tensorflow

Last synced: 17 Apr 2026

https://github.com/artemxdata/car-price-prediction

Car Price Prediction โ€“ Machine learning project for estimating car prices based on technical specifications and market data. The goal is to achieve an RMSE below 2500 by comparing multiple models (Linear Regression, Random Forest, LightGBM) and analyzing training vs. prediction time.

car-price-prediction data-science lightgbm machine-learning notebook python regression rmse scikit-learn supervised-learning used-cars vehicle-pricing

Last synced: 01 May 2026

https://github.com/msikorski93/alzheimer-s-disease-classification

A multi classification using scikit-learn and TensorFlow models on MRI scans of patient's brains.

alzheimers-disease classification efficientnetb0 inceptionv3 knn-classifier mri-brain random-forest scikit-learn svc tensorflow

Last synced: 01 May 2026

https://github.com/snehilsanyal/ee524

Course webpage for IIT Guwahati EE524 Machine Learning Lab (Jul-Nov 2020) Session

course-webpage machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 01 May 2026

https://github.com/khaymanii/big_mart_prediction_model

This model was built using Python and Logistic Regression Algorithm

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 01 May 2026

https://github.com/m-rishab/credbet

A loan prediction web app which tells You that you are eligible for loan or not!

decision-tree-classifier matplotlib numpy pandas python scikit-learn

Last synced: 02 Apr 2026

https://github.com/joshi-jyoti/heart-disease-prediction

This repository contains a Python-based project for predicting the likelihood of heart disease using a Logistic Regression machine learning model. It leverages a dataset of patient medical information to train and evaluate the model, providing insights into potential diagnoses.๐Ÿฉบ

heart-disease-prediction heart-disease-predictor kaggle-dataset machine-learning numpy pandas python scikit-learn

Last synced: 01 May 2026

https://github.com/tr-3n/smartsearch-ai

SmartSearchAI is a live semantic search engine powered by Streamlit for the UI, SerpAPI for real-time web search, and SentenceTransformers with FAISS for fast semantic similarity matching. It allows users to ask natural language queries and get intelligent, web-sourced answers without relying on a static dataset.

artificial-intelligence data-science deployment faiss machine-learning nlp pandas scikit-learn sentence-transformers streamlit

Last synced: 01 May 2026

https://github.com/aarryasutar/prodigy_ds_internship

These projects as a part of my Data Science internship involve data visualisation, analysis, & prediction using various datasets and machine learning techniques. They utilize libraries like pandas, matplotlib, seaborn, scikit-learn, and NLTK for tasks ranging from gender and age visualisation to sentiment analysis and decision tree classification.

bank-marketing-analysis barchart data-science eda exploratory-data-analysis heatmap histogram internships matplotlib pandas prodigy-infotech pyplot python scikit-learn seaborn sentiment-analysis

Last synced: 01 May 2026

https://github.com/george-gca/ai_papers_search_tool

Automatic paper clustering and search tool by fastext from Facebook Research

fasttext fasttext-embeddings fasttext-python nlp python scikit-learn

Last synced: 02 May 2026

https://github.com/wesslen/dsba6211-summer2024

DSBA6211 Adv Business Analytics Lab Notebooks

scikit-learn teaching

Last synced: 17 Apr 2026

https://github.com/sralter/classifire

Wildfire Prediction Model: Samuel Alter's BrainStation 2023 Data Science Capstone Project

qgis scikit-learn tensorflow

Last synced: 02 May 2026

https://github.com/ghufranbarcha/codsoft-machine-learning-internship

This repository contain all Machine Learning & NLP task during my internship at Codsoft.

jupyter-notebook machinelearning nlp nltk python scikit-learn

Last synced: 17 Apr 2026

https://github.com/sapsan14/water-quality-ee

Estonian water quality ML โ€” binary classification of Terviseamet open data, Jupyter + scikit-learn.

classification estonia jupyter ml open-data scikit-learn

Last synced: 02 May 2026

https://github.com/bistcuite/plainml

Painless Machine Learning Library for python based on scikit-learn

machine-learning ml plainml python scikit-learn

Last synced: 02 May 2026

https://github.com/umar-saadat/car-price-prediction-ml

๐Ÿš— A Machine Learning project that predicts the price of used cars using Linear Regression. Built with Python, Scikit-learn, and Streamlit, this app takes inputs like car brand, year, mileage, engine size, and more to estimate the selling price in real-time

ai-project car-price-prediction data-science linear-regression machine-learning ml-project python scikit-learn streamlit

Last synced: 02 May 2026

https://github.com/khaymanii/titanic_survival_prediction_-model

This Model was built using Python and Logistic Regression algorithm

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 02 May 2026

https://github.com/rakibhhridoy/machinelearning-featureselection

Before training a model or feed a model, first priority is on data,not in model. The more data is preprocessed and engineered the more model will learn. Feature selectio one of the methods processing data before feeding the model. Various feature selection techniques is shown here.

extratreesclassifier feature-selection gridsearchcv lasso-regression logistic-regression machine-learning numpy pandas pca rfe rfecv scikit-learn selectkbest

Last synced: 02 May 2026

https://github.com/prashver/titanic-survival-prediction

This project tackles the Titanic challenge on Kaggle, predicting passenger survival based on variables like age, sex, and passenger class. The Jupyter notebook covers essential steps of a data science pipeline, including exploratory data analysis, data cleaning, feature engineering, and modeling. The dataset used is the Titanic dataset.

classification-algorithm machine-learning-algorithms matplotlib numpy pandas scikit-learn seaborn

Last synced: 02 May 2026

https://github.com/zazi2002/machine-learning-project

Introduction to Machine Learning project with the goal of improving the classification performance on a dataset by optimizing the number of features and weak learners.

dimentionality-reduction ensemble-learning numpy pca random-forest scikit-learn

Last synced: 02 May 2026

https://github.com/chitralputhran/drive-curve-machine-learning-app

:blue_car: Drive Curve is a web application made with the help of Flask, a microframework for Python based on Werkzeug, Jinja 2, and good intentions. On the backend, a Machine Learning model is used for predicting the price of the car. The machine learning model was trained on the Automobile Dataset from the UCI Machine Learning Repository.

flask machine-learning python scikit-learn webapp

Last synced: 03 May 2026

https://github.com/ivanyu/kaggle-digit-recognizer

Kaggle's "Digit Recognizer" competition

kaggle keras machine-learning scikit-learn

Last synced: 17 Apr 2026

https://github.com/harshitwaldia/stock-price-prediction

An AI-driven stock market analysis dashboard that predicts next-day stock prices using a deep learning LSTM model. The project features: ๐Ÿ”ฎ AI Predictions for stock movements ๐ŸŒ Global market support (US, India, China, Japan, UK) ๐Ÿ“Š Interactive React dashboard with charts & recent searches โšก Flask backend powered by Tensor/Keras & Yahoo Finance

dashboard flask flask-cors keras-tensorflow lstm-neural-networks machine-learning numpy react-typescript scikit-learn stock-price-prediction

Last synced: 03 May 2026

https://github.com/carmoreno/analisisaccidentalidadbogota

Data Analysis about traffic accidents at Bogotรก, Colombia.

data-analysis data-science jupyer-notebook matplotlib numpy pandas scikit-learn

Last synced: 17 Apr 2026

https://github.com/siam29/credit-card-fraud-detection-in-real-time

This project delivers a fast and efficient fraud detection methodology, providing predictions in under a second, emphasizing the importance of both high performance and quick response times.

ensemble-machine-learning feature-selection genetic-algorithm machine-learning matplotlib pandas pca scikit-learn

Last synced: 03 May 2026

https://github.com/baggiponte/ta-statistics-for-big-data-2022

๐ŸŽ“ Introduction to Python and Machine Learning [UniMi โ€ข AY 2021/2022]

clustering data-science data-visualization machine-learning python scikit-learn

Last synced: 03 May 2026

https://github.com/h-fuzzy-logic/python-finding-nsf-award-themes

Using NLP to find themes and concepts in NSF Awards

nltk pandas python scikit-learn

Last synced: 03 May 2026

https://github.com/md-emon-hasan/ai-from-university

๐ŸŽ“ Collection of academic resources, projects, and exercises related to artificial intelligence concepts learned in university coursework.

ai artificial-intelligence linear-regression logestic-regression mahcine-learning ml scikit-learn

Last synced: 17 Apr 2026

https://github.com/byigitt/smartmove

fake data generation and analysis for ankara metro station

ankara cv2 metro numpy pandas scikit-learn

Last synced: 03 May 2026