An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/tathithienthanh/datamining-banking-dataset

Implement some learned data mining techniques and predict if the client will subscribe to a term deposit

apriori association-rules classification clustering data-analysis data-mining data-processing google-colab ipynb kmeans naive-bayes py python scikit-learn svm visualization

Last synced: 20 Apr 2026

https://github.com/ewertondrigues02/previsao-de-vendas

Previsão de vendas de uma empresa fictícia onde foi feita análise com ferramentas como Jupyter Notebook, Google Colab, Python e bibliotecas de Machine Learn como: regressão linear, arvore de decisão, scikit-learn

analise-de-dados analise-exploratoria arvore-de-decisao ciencia-de-dados colab excel google-colab jupyter jupyter-notebook machine-learning previsao previsao-de-vendas python3 regressao-linear scikit-learn

Last synced: 10 Feb 2026

https://github.com/idaraabasiudoh/vehicle-co2emission_model

Predicts CO2 emissions from vehicle fuel consumption using a multiple linear regression model trained on sklearn, based on a dataset of engine sizes and corresponding CO2 emissions in Canada.

data-analysis jupyter-notebook machine-learning python3 scikit-learn

Last synced: 06 May 2026

https://github.com/raythurman2386/gis-playground

GIS Playground is a comprehensive web-based GIS application that combines multiple data sources and provides advanced spatial data visualization and analysis capabilities. The application features real-time wildfire data integration, intelligent spatial data processing, and interactive mapping functionality.

flask gdal geopandas leaflet nltk scikit-learn

Last synced: 11 Feb 2026

https://github.com/tritonix711/fractureai

This tool helps people upload X-rays to find broken bones. It uses a machine to mark where the breaks are and gives users marked pictures to download. A smart computer also helps people understand their broken bones and gives them advice.

css cv2 flask gorq html javascript matplotlib npm numpy pandas pydantic python react scikit-learn torch torchvision ultralytics

Last synced: 27 Feb 2026

https://github.com/lemma-osu/sklearn-raster

Fast, parallel raster prediction with scikit-learn estimators

dask raster scikit-learn xarray

Last synced: 20 Apr 2026

https://github.com/nazchanel/fake-news-detection-algorithm

A fake news detection algorithm. This repository contains the various variations of my original project. WIP.

dataset deep-learning fake-news-detection machine-learning-algorithms natural-language-processing scikit-learn work-in-progress

Last synced: 21 Apr 2026

https://github.com/vishrut-b/ml-project-with-pytorch-breast-cancer-classification

An exploration of machine learning techniques applied to classify breast cancer as malignant or benign.

breast-cancer-classification machine-learning python pytorch scikit-learn

Last synced: 11 Feb 2026

https://github.com/shridhar1504/boston-house-price-prediction-datascience-project

The Boston House Price Prediction project utilizes data science methodologies and machine learning algorithms to provide accurate predictions for housing prices in the Boston area.

boston data-science house-price-prediction machine-learning regression-algorithms regression-models scikit-learn supervised-learning

Last synced: 24 Apr 2026

https://github.com/slfagrouche/real-estate-market-analysis

Analysis of 2.2 million Realtor.com listings using Python and machine learning to uncover U.S. real estate market patterns. The project identifies market segments, predicts property prices, and reveals regional trends, providing data-driven insights for real estate professionals and investors.

data-science exploratory-data-analysis linear-regression machine-learning scikit-learn statistical-testing

Last synced: 24 Apr 2026

https://github.com/engageintellect/bitcoin-price-predictor

This Python project predicts whether the price of Bitcoin will increase or decrease on the next day, using historical price data and machine learning. Additionally, the project visualizes Bitcoin's price movements using candlestick charts along with moving averages for different timeframes.

bitcoin machine-learning matplotlib mplfinance numpy pandas python scikit-learn visualization yfinance

Last synced: 23 Oct 2025

https://github.com/divyanshugit/kaggle-titanic-machine-learning-from-disaster

A machine learning model that predicts which passengers survived the Titanic shipwreck.

data-science machine-learning machine-learning-algorithms random-forest scikit-learn svm

Last synced: 26 Apr 2026

https://github.com/camille-maslin/securecard-ai

🛡️ SecureCard-AI: A high-performance credit card fraud detection system implemented in a Jupyter Notebook, achieving 99.97% accuracy.

classification credit-card-fraud-detection data-analysis data-science fraud-detection jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 12 Feb 2026

https://github.com/marella/evaluate

A tool to evaluate the performance of various machine learning algorithms and preprocessing steps to find a good baseline for a given task.

lightgbm machine-learning python scikit-learn xgboost

Last synced: 27 Apr 2026

https://github.com/yagna123k/fresh-farm-ai

Fresh Farm AI - AI-Powered Crop Quality Control System

ai deep-learning machine-learning nextjs python scikit-learn tensorflow

Last synced: 12 Feb 2026

https://github.com/christopherkindl/start-hack-2021

Predictive models for parking space occupation using historical parking occupancy and ticket sales data as well as weather and public holiday data.

python scikit-learn xgboost

Last synced: 08 May 2026

https://github.com/spockoo/pylegend

A fusion between Python and legend, a name that suggests that the code is both modern and mythical. 3 months of work, with tons of errors to establish the calculations necessary for the superposition, I want to publish my work and improve it and share it under Apache 2.0 License. Designed to work with NBminer!

crypto crypto-tools how-to-farm-crypto kerastuner matplotlib matrix-multiplication mining nbminer numpy performance-optimization pickle project quantum quantumcircuits quantumcomputing qubits scikit-learn tensorflow

Last synced: 27 Jan 2026

https://github.com/rickiepark/sklearn-tutorial

사이킷런 정주행 튜토리얼

machine-learning python scikit-learn

Last synced: 08 May 2026

https://github.com/shreyansh055/dynamic_pricing_strategy_055

Dynamic Pricing Strategy Project: This project utilizes machine learning algorithms in Python to optimize ride-sharing prices through real-time demand and supply analysis. By leveraging historical Uber data, it dynamically adjusts prices to maximize revenue and improve customer satisfaction.

machine-learning numpy pandas python scikit-learn

Last synced: 13 Feb 2026

https://github.com/meyiapir/nlu-api

Web API for accessing NLU models. With tools for training models.

api chat fastapi learning machine-learning nlu python ru russian-language scikit-learn

Last synced: 15 Feb 2026

https://github.com/benzerinsio/datascience

📊 Data Science & Análise de Dados | Projetos de estudo em Exploração de Dados (EDA), Machine Learning e Deep Learning para prática e demonstração de técnicas analíticas.

analise-de-dados analise-exploratoria analise-exploratoria-de-dados aprendizado-de-maquina aprendizado-profundo data-science data-visualization eda exploratory-analysis exploratory-data-analysis machine-learning numpy pandas python scikit-learn seaborn supervised-learning unsupervised-learning

Last synced: 14 Feb 2026

https://github.com/lordhacker756/estate-ai

Estate AI is a machine learning application that predicts the approximate rent a user would need to pay for their requirement across major metro cities of India. It is built using NextJS 13, TailwindCSS, and TypeScript for the frontend, Scikit Learn for Model Training and and Flask for the backend.

fastapi flask machine-learning nextjs13 scikit-learn

Last synced: 09 May 2026

https://github.com/j-i-l/tfb-prediction

Transcription factor binding prediction

bioinformatics machine-learning pandas python scikit-learn

Last synced: 09 May 2026

https://github.com/markdouthwaite/lingo

A package for quickly deploying Scikit-Learn Linear Models in Go.

golang linear-models machine-learning scikit-learn

Last synced: 15 Feb 2026

https://github.com/andreped/nlp-mtl

Training neural networks to solve multiple tasks simultaneously from free text through multi-task learning

bert-embeddings keras multi-task-learning natural-language-processing neural-networks nlp scikit-learn

Last synced: 09 May 2026

https://github.com/wwtg99/predict_height

Predict height by gender and genotypes using machine learning.

genotype height machine-learning scikit-learn

Last synced: 29 Apr 2026

https://github.com/mrapp-ke/mlrl-boomer

A scikit-learn implementation of BOOMER - An Algorithm for Learning Gradient Boosted Multi-Output Rules

gradient-boosting machine-learning multi-target-regression multilabel-classification multioutput-regressor rule-learning scikit-learn

Last synced: 10 May 2026

https://github.com/ajitashwath/nn-visualization

A web application for visualizing various aspects of neural networks.

matplotlib-pyplot python3 scikit-learn streamlit tensorflow

Last synced: 03 May 2026

https://github.com/yaqoah/used-cars-ai

🚗 predicts used car prices using a full ML pipeline

beautifulsoup eda machine-learning pandas regression scikit-learn selenium xgboost

Last synced: 19 Apr 2026

https://github.com/pr38/dask_backward_feature_selection

Backward step-wise feature selection using Dask, scikit-learn compatible

dask feature-selection machine-learning python scikit-learn

Last synced: 16 Apr 2026

https://github.com/tschechlovdev/ml2dac

Implementation of "ML2DAC: Meta-Learning to Democratize AutoML for Clustering Analyses", published at SIGMOD 2023. The paper has awarded the "reproducibility" badge by SIGMOD's reproducibility reviewers.

automl clustering meta-learning paper python reproducible-research scikit-learn

Last synced: 13 Oct 2025

https://github.com/jlgarridol/tfg-smartbeds

MINERÍA DE DATOS APLICADA A LA DETECCIÓN DE CRISIS EPILÉPTICAS - GII18.13

bed datamining ensemble epileptic-seizures manifold medical-informatics oneclasssvm pca rotation-forest scikit-learn weka

Last synced: 30 Apr 2026

https://github.com/27ahmad/medicine-recommendation-system

This project aims to create a medicine recommendation system based on symptoms provided by the user. The system is built using machine learning models trained on a dataset of symptoms and their corresponding diagnoses. The frontend is designed using Bootstrap for an intuitive user interface.

bootstrap machine-learning medicine-applications pandas recommendation-system scikit-learn

Last synced: 25 Oct 2025

https://github.com/drreetusharma/molecular_innovations-for-kpgt-knowledge-guided-pre-training-of-graph-transformer-

Knowledge-guided-Pre-training-of-Graph-Transformer: The primary aim of this project is to leverage knowledge-guided pre-training techniques for enhancing the performance of graph transformers in molecular property prediction and drug discovery.

machine machine-learning neural-network pytorch rdkit scikit-learn

Last synced: 04 Mar 2026

https://github.com/smmariquit/pjdsc-economic-impact

BARLO: Bayani Alert and Response for Local Operations — predicts a storm's economic impact from typhoon forecast data using a PyTorch + scikit-learn model, deployed on Streamlit.

disaster-risk logistics machine-learning philippines pjdsc python pytorch scikit-learn streamlit typhoon

Last synced: 14 Jun 2026

https://github.com/sigilbyte/choquet-classifier

Implementation of the Choquet classifier using the scikit-learn API design.

machine-learning regression regression-models scikit-learn scikitlearn-machine-learning

Last synced: 05 May 2026

https://github.com/nirmalyabag20/loan-status-prediction-using-machine-learning

This project focuses on predicting the loan status (approved or not approved) based on various applicant details. The goal is to develop a machine learning model that accurately classifies whether a loan should be approved, helping financial institutions make informed lending decisions.

matplotlib numpy pandas python scikit-learn seaborn support-vector-machine

Last synced: 19 Jan 2026

https://github.com/mpolinowski/isometric-mapping

Non-linear dimensionality reduction through Isometric Mapping

isomap matplotlib-pyplot python scikit-learn

Last synced: 06 May 2026

https://github.com/thevarunsharma/extracting-dominant-colors

A web application that extracts the dominant colors from an image using K-means clustering.

flask-application k-means-clustering machine-learning python scikit-learn unsupervised-learning

Last synced: 12 May 2026

https://github.com/kashifmoin1410/computer-vision-traditional-vs.-deep-learning-approaches

This project compares traditional Bag-of-Words with SVM and a custom ResNet-style CNN for image classification on the CIFAR-10 dataset. It covers the full workflow: feature extraction, model building, training, evaluation, and visualization. Results demonstrate the superior accuracy and robustness of deep learning models over classic ML pipelines.

bag-of-words cifar10 cnn comparative-analysis computer-vision deep-learning feature-extraction image-classification keras knn-classification machine-learning model-evaluation neural-network python3 resnet scikit-learn sift-algorithm svm-classifier

Last synced: 06 May 2026

https://github.com/haloapping/ml-with-me

Kalo dengar istilah ML, biasanya rada ambigu. Soalnya punya beberapa kepanjangan, seperti Mobile Legend, Makan Lontong, dan lain-lain. Tapi pada repo ini membahas Machine Learning :)

ml pusing python3 scikit-learn stress tau-ah-gelap

Last synced: 14 Apr 2026

https://github.com/flexycode/ccmaclrl

🤖 This repository is intended for our Machine Learning CCMACLRL COM231ML by Professor Elizer Ponio Jr

artificial-intelligence linnear-regression machine-learning machine-learning-algorithms python random-forest scikit-learn supervised-learning tensorflow

Last synced: 07 May 2026

https://github.com/marksikaundi/handson-machinelearning

Complete Collection about Machine Learning

matplotlib pandas-python scikit-learn tensorflow

Last synced: 07 May 2026

https://github.com/cbjuan/paper-ijimai-ml-employability

Jupyter notebook developed to support the research presented in the paper "Proposing a machine learning approach to analyze and predict employment and its factors"

jupyter-notebook python research scikit-learn

Last synced: 07 May 2026

https://github.com/aarryasutar/prodigy_ds_internship

These projects as a part of my Data Science internship involve data visualisation, analysis, & prediction using various datasets and machine learning techniques. They utilize libraries like pandas, matplotlib, seaborn, scikit-learn, and NLTK for tasks ranging from gender and age visualisation to sentiment analysis and decision tree classification.

bank-marketing-analysis barchart data-science eda exploratory-data-analysis heatmap histogram internships matplotlib pandas prodigy-infotech pyplot python scikit-learn seaborn sentiment-analysis

Last synced: 01 May 2026

https://github.com/tr-3n/smartsearch-ai

SmartSearchAI is a live semantic search engine powered by Streamlit for the UI, SerpAPI for real-time web search, and SentenceTransformers with FAISS for fast semantic similarity matching. It allows users to ask natural language queries and get intelligent, web-sourced answers without relying on a static dataset.

artificial-intelligence data-science deployment faiss machine-learning nlp pandas scikit-learn sentence-transformers streamlit

Last synced: 01 May 2026

https://github.com/asut00/machine-learning-program_42ai

Comprehensive Machine Learning path by 42AI: hands-on modules on regression, gradient descent, and real-world ML applications.

linear-regression machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 07 May 2026

https://github.com/aymanmansur/insider-threat-detection-using-cert-dataset-logon-

Detecting anomalies in user logon behavior using the CERT Insider Threat Detection Dataset. This project extracts key features like session duration and logon frequency during non-working hours and applies Isolation Forest to identify suspicious activity.

matplotlib pandas python scikit-learn

Last synced: 07 May 2026

https://github.com/mgobeaalcoba/data_champions_meli

Algorithms and work carried out within the framework of data champions by Mercado Libre

algorithms canvas classification clustering data-science machine-learning python3 scikit-learn

Last synced: 18 Apr 2026

https://github.com/rajikaimal/emma

:santa: Intelligent mention bot for GitHub organizations

bot emma machine-learning python scikit-learn

Last synced: 24 Apr 2026

https://github.com/noahtigner/discoverdaily

A Spotify Recommender System. Trains a Classifier on your musical tastes and recommends songs daily. Uses the Spotify API and scikit-learn for machine learning.

machine-learning recommender-system scikit-learn spotify spotify-api

Last synced: 24 Apr 2026

https://github.com/joshi-jyoti/heart-disease-prediction

This repository contains a Python-based project for predicting the likelihood of heart disease using a Logistic Regression machine learning model. It leverages a dataset of patient medical information to train and evaluate the model, providing insights into potential diagnoses.🩺

heart-disease-prediction heart-disease-predictor kaggle-dataset machine-learning numpy pandas python scikit-learn

Last synced: 01 May 2026

https://github.com/alessiochen/setiment-analysis-ai-project

Application of Sentimental Analysis for Artificial Intelligence class at UNIFI

ai andrew dataset movie-reviews scikit-learn sentiment-analysis

Last synced: 12 May 2026

https://github.com/shuddha2021/stellar-candidate-selector

A sophisticated candidate selection algorithm leveraging multi-criteria analysis and machine learning to identify top software engineering candidates. This tool features flexible filtering, score adjustment, and detailed visualizations to streamline the recruitment process.

candidate-selection data-analysis data-visualization machine-learning pandas plotting-in-python python python-data-analysis recruitment scikit-learn

Last synced: 05 May 2026

https://github.com/anupam0202/contextual-rag-chatbot

Contextual RAG Chatbot that processes PDF documents using the Google Gemini API

google-generativeai numpy pypdf2 scikit-learn streamlit

Last synced: 05 May 2026

https://github.com/piyush1927/flightforecast

ML model to predict flight prices based on various features like departure time, arrival time, duration, airline, source, destination, and number of stops.

machine-learning mathplotlib numpy pandas scikit-learn seaborn

Last synced: 16 Apr 2026

https://github.com/petrosdemetrakopoulos/flight-passengers-prediction

A supervised learning problem given as a project in the "Data Mining in Databases and World Wide Web" course in Computer Science Department of AUEB in Winter semester of 2019.

classification classifier data-science machine-learning python scikit-learn sklearn university-project

Last synced: 30 Apr 2026

https://github.com/markoshb/my-data-science-learning-projects

Short but illustrative notebooks to showcase data-analysis in Python

data-science matplotlib-pyplot pandas python pythorch scikit-learn tensorflow

Last synced: 05 Apr 2026

https://github.com/solanovisitor/keratoconusdetector

A repository to train and evaluate CNN-LSTM models aiming to detect Keratoconus on Galilei G6 optical biometer data.

cnn deep-learning keras lstm machine-learning pandas python scikit-learn tensorflow

Last synced: 05 Apr 2026

https://github.com/shimazadeh/total-perspective-vortex

This subject aims to create a brain computer interface based on electroencephalographic data (EEG data) with the help of machine learning algorithms. Using a subject’s EEG reading, you’ll have to infer what he or she is thinking about or doing - (motion) A or B in a t0 to tn timeframe.

ai algorithm classification datascience dimensionality-reduction eeg scikit-learn

Last synced: 25 Apr 2026

https://github.com/aliy98/navigation-sensor-data-classification

Classification of a Navigation Robot Sensor Dataset Using SVM, Random Forest and Neural Network

artificial-neural-networks keras multiclass-classification random-forest scikit-learn scitos-g5 support-vector-machines

Last synced: 13 May 2026

https://github.com/cool-japan/sklears

A comprehensive machine learning library in Rust, inspired by scikit-learn's intuitive API and combining it with Rust's performance and safety guarantees.

ai artificial-intelligence machine-learning rust rust-lang scikit-learn scikitlearn-machine-learning

Last synced: 26 Apr 2026

https://github.com/idaraabasiudoh/knn-customer-classification

Labels telecommunication customer base to respective groups to determine service type required for each customer.

data-analysis jupyter-notebook machine-learning pyhton3 scikit-learn

Last synced: 07 May 2026

https://github.com/joseprsm/nectarine

🍑 Neural Enhanced Collaborative Tool for Automated Recommendation and INtelligent Exploration

argo-workflows recommender-systems scikit-learn tensorflow tensorflow-recommenders

Last synced: 07 May 2026

https://github.com/md-emon-hasan/6-classification-iris-ml-apps

A ML project on the classification of the Iris dataset, demonstrating data preprocessing, model training, and evaluation using Python and scikit-learn.

classification data-science iris-classification iris-dataset iris-flower-classification predictive-modeling scikit-learn

Last synced: 26 Apr 2026

https://github.com/ultrasage-danz/scikit-learn-ml

Machine Learning with scikit-learn by Data School

ai data data-school machine-learning macos ml scikit-learn ultrasage-dan

Last synced: 13 May 2026

https://github.com/singhrahuldps/myscikitlearn

My implementation of some Machine Learning Algorithms from scratch.

classifier-model decision-trees machine-learning scikit-learn

Last synced: 27 Apr 2026

https://github.com/chirindaopensource/measuring_corruption_from_text_data

End-to-End Python implementation of Muço’s (2025) corruption measurement framework. Combines NLP pipeline (regex extraction, Porter stemming, TF-IDF), PCA-based dimensionality reduction, and fixed-effects OLS to quantify institutional quality from Brazilian audit reports. Includes supervised learning robustness checks and LOO sensitivity analysis.

audit-analysis brazilian-data corruption-measurement dictionary-based-classification dimensionality-reduction econometrics fixed-effects government-transparency institutional-quality natural-language-processing nltk political-economy portuguese-nlp principal-component-analysis research-replication scikit-learn supervised-learning text-as-data text-classification text-mining

Last synced: 27 Apr 2026

https://github.com/mrapp-ke/examplewisef1maximizer

A scikit-learn meta-estimator for multi-label classification that aims to maximize the example-wise F1 measure

machine-learning multilabel-classification scikit-learn

Last synced: 27 Apr 2026

https://github.com/bala-1409/foreign-exchange-rate-time-series-data-science-project

This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.

data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms model modelfitting predictive-modeling python3 scikit-learn statsmodels time-series time-series-analysis

Last synced: 07 May 2026

https://github.com/grampers-dev/co2oracle

The CO2 Oracle project uses machine learning and AI to analyze and predict CO2 emissions for environmental management. Using a Kaggle dataset, it demonstrates predictive analytics to understand and forecast emissions. Written in Python, it employs libraries like Pandas, NumPy, and Scikit-Learn.

artificial-intelligence machine-learning numpy pandas python scikit-learn

Last synced: 07 Feb 2026

https://github.com/mehuaniket/blog-classifier

blog classifier with scikit random forest.

bag-of-words blog-classifier python scikit-learn

Last synced: 07 May 2026

https://github.com/otuemre/realtimenids

Real-time network intrusion detection system using Zeek flow logs and machine learning (IsolationForest). Detects threats with both signature-based and anomaly-based techniques trained on the CSE-CIC-IDS2018 dataset.

anomaly-detection cybersecurity flow-analysis isolation-forest machine-learning network-intrusion-detection nids scapy scikit-learn zeek

Last synced: 07 May 2026

https://github.com/gauravsingh9356/machine_learning

All my practical learning work involved in MACHINE LEARNING (Data Processing to Deep Learning)

deep-learning jupyter-notebook machine-learning machine-learning-algorithms nlp-machine-learning python scikit-learn

Last synced: 30 Apr 2026