An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/samarjitsahoo/house-price-prediction

Explore my Machine Learning repository featuring a House Price Predictor project. Leveraging advanced algorithms, this project predicts house prices based on various features like location, size, amenities, and market trends. Dive into the world of predictive analytics and gain insights into the dynamic real estate market.

machine-learning matplot pandas scikit-learn

Last synced: 31 Jul 2025

https://github.com/ragibhasan894/phishing_website_detection

This project is based on detecting phishing/fraud/malicious website using Random Forest Classification formula. Implemented using Python programming language and Django framework.

cyber-security data-mining data-science django django-framework machine-learning phsihing python random-forest scikit-learn security

Last synced: 26 Oct 2025

https://github.com/howardyclo/kmeans-dbscan-tutorial

A clustering tutorial with scikit-learn for beginners.

clustering-algorithm dbscan ipython-notebook kmeans scikit-learn tutorial

Last synced: 01 Apr 2026

https://github.com/rubydamodar/loan-approval-prediction-

Loan approval prediction is a popular machine learning project, especially in the banking and finance industry. The goal of this project is to build a predictive model that can determine whether a loan application will be approved or not based on the applicant's information such as income, credit history, and loan amount.

ai-in-finance banking classification classification-internal credit-risk data-science exploratory-data-analysis feature-engineering financial-analytics loan-approval machine-learning matplotlib pandas predictive-modeling python scikit-learn seaborn visualization

Last synced: 17 Jul 2025

https://github.com/gauravpandeylab/eipy

Ensemble Integration: a customizable pipeline for generating multi-modal, heterogeneous ensembles

classification ensemble interpretation machine-learning multimodal nested-cross-validation predictive-modeling scikit-learn

Last synced: 24 Oct 2025

https://github.com/ax-va/python-machine-learning-recipes-gallatin-albon-2023

Machine learning recipes in Python with scikit-learn, OpenCV, PyTorch, and other libraries, including classical machine learning and neural networks, based on the book "Machine Learning with Python Cookbook: Practical Solutions from Preprocessing to Deep Learning", Second Edition, by Kyle Gallatin and Chris Albon published by O'Reilly Media in 2023

ax-va data-science deep-learning image-processing machine-learning neural-networks opencv opencv-python python pytorch scikit-learn

Last synced: 08 Oct 2025

https://github.com/smarie/python-m5p

An implementation of M5 and model trees in python, compliant with scikit-learn.

m5 machine-learning model prime regression scikit-learn tree

Last synced: 26 Oct 2025

https://github.com/jakoch/jupyter-devbox

A Docker-based devcontainer for Jupyter Notebook's with a focus on Computer Vision, Machine Learning, Finance, Statistics and Visualization.

debian devcontainer docker imutils jupyter-notebook keras opencv pandas python3 scikit-learn scipy seaborn tensorflow

Last synced: 16 Jan 2026

https://github.com/pateash/kisanmitra

Crop Yield Prediction Web App Built using Sklearn and Laravel Web Framework

crop-yeild-prediction farmers laravel machinelearning scikit-learn

Last synced: 03 Mar 2026

https://github.com/mapbox/gabbar

Guarding OpenStreetMap from harmful edits using machine learning

banished jupyter-notebook machine-learning openstreetmap scikit-learn vandalism

Last synced: 12 Apr 2025

https://github.com/octalpixel/skin-extraction-from-image-and-finding-dominant-color

Project is an implementation of skin segmentation using OpenCV and dominant color extraction using SciKit-Learn

image-processing kmeans kmeans-clustering machine-learning opencv python scikit-learn

Last synced: 12 Apr 2025

https://github.com/fbruzzesi/sklearn-smithy

Toolkit to forge scikit-learn compatible estimators

cli data-science machine-learning python scikit-learn webui

Last synced: 16 Sep 2025

https://github.com/scikit-multilearn-ng/scikit-multilearn-ng

A new maintained "successor" to scikit-multilearn, a scikit-learn based module for multi-label et. al. classification

classification clustering label-prediction machine-learning multi-label multi-label-classification partitioning scikit scikit-learn scikit-multilearn

Last synced: 21 Aug 2025

https://github.com/san089/big_data_project

Fake News Detection - Feature Extraction using Vectorization such as Count Vectorizer, TFIDF Vectorizer, Hash Vectorizer,. Then used an Ensemble model to classify whether the news is fake or not.

classifiers ensemble-model fakenewsdetection machine-learning news-classification scikit-learn text-mining textclassification vectorization vectorizers

Last synced: 13 Mar 2026

https://github.com/kencyke/hopfield-mnist

A scikit-learn implementation of hopfield network for MNIST

hopfield-network mnist python scikit-learn

Last synced: 17 Jan 2026

https://github.com/trainingbypackt/machine-learning-fundamentals

Use Python and scikit-learn to get up and running with the hottest developments in machine learning

jupyter-notebook machine-learning python3 scikit-learn

Last synced: 08 Mar 2026

https://github.com/ozguraslank/flexml

Easy-to-use and flexible AutoML library for Python

automl data-science machine-learning python scikit-learn

Last synced: 18 Jul 2025

https://github.com/rohankishore/graphyte

A simple, and responsive math graphing app powered by PyQt6, NumPy and Matplotlib. Create and analyze functions with ease.

equation-solver geogebra graph graph-algorithms matplotlib numpy pyqt6 python scikit-learn visualization

Last synced: 14 Mar 2026

https://github.com/dunnkers/fseval

Benchmarking framework for Feature Selection and Feature Ranking algorithms 🚀

automl benchmarking benchmarking-framework benchmarks feature-rankers feature-ranking feature-selection hydra machine-learning python scikit-learn wandb

Last synced: 13 Apr 2025

https://github.com/mauroluzzatto/explainy

explainy is a Python library for generating machine learning model explanations for humans

data-science explanation machine-learning machine-learning-explainability python scikit-learn

Last synced: 07 Oct 2025

https://github.com/curiousily/Reproducible-ML-with-DVC

Tutorial on experiment tracking and reproducibility for Machine Learning projects with DVC

deep-learning dvc experiment-tracking linear-regression machine-learning metrics python random-forest reproducibility scikit-learn tracking

Last synced: 29 Jul 2025

https://github.com/esvs2202/concrete-compressive-strength-prediction

The aim of this project is to develop a solution using Data science and machine learning to predict the compressive strength of a concrete with respect to the its age and the quantity of ingredients used.

anaconda data-visualization flask gunicorn-web-server heroku-deployment html5 joblib jupyter-notebook machine-learning-algorithms matplotlib-pyplot numpy pandas pycharm-ide python3 randomizedsearchcv scikit-learn seaborn statsmodels xgboost-regression

Last synced: 12 Oct 2025

https://github.com/curiousily/reproducible-ml-with-dvc

Tutorial on experiment tracking and reproducibility for Machine Learning projects with DVC

deep-learning dvc experiment-tracking linear-regression machine-learning metrics python random-forest reproducibility scikit-learn tracking

Last synced: 07 Apr 2026

https://github.com/991o2o9/smart-cardiologist

Intelligent Python service with FastAPI for real-time heart disease predictions using machine learning. Features AI-assisted consultations, user authentication, analysis history, RESTful API, and comprehensive error handling. Secure and scalable solution for healthcare applications.

api artificial-intelligence data-science fastapi healthcare healthcare-technology heart-disease machine-learning medical-ai medical-diagnosis prediction predictive-analytics pydantic python rest-api scikit-learn swagger uvicorn

Last synced: 30 Aug 2025

https://github.com/younader/dnnr

The Python package of differential nearest neighbors regression (DNNR): Raising KNN-regression to levels of gradient boosting method. Build on-top of Numpy, Scikit-Learn, and Annoy.

annoy knn machine-learning machine-learning-algorithms numpy python3 scikit-learn tabular-data

Last synced: 15 Apr 2025

https://github.com/tpsatish95/indus-script-ocr

The Indus script optical grapheme recognition engine (from archaeological artifact images)

ancient-texts caffe computer-vision deep-learning digital-humanities epigraphy opencv optical-character-recognition pipeline scikit-image scikit-learn

Last synced: 14 Apr 2025

https://github.com/sudipbishwakarma/msc-dissertation-2021

Stock Prediction using LSTM, Linear Regression, ARIMA and GARCH models. Hyperparameter Optimization using Optuna framework for LSTM variants.

arima exploratory-data-analysis garch googlecolab hyperparameter-optimization jupyter-notebook kaggle linearregression lstm nepse optuna scikit-learn stock-price-prediction tensorflow time-series-analysis

Last synced: 14 Oct 2025

https://github.com/aifred-health/vulcanai

A high level deep learning framework for quickly prototyping networks with added tools in data visualisation, model interpretability and performance metrics

data-analysis data-cleaning data-science data-visualization deep-learning deep-neural-networks feature-engineering mental-health python3 pytorch scikit-learn

Last synced: 01 Aug 2025

https://github.com/manasvigoyal/gmail-classification

Extract Emails from Gmail account, convert to Excel file and classify using various classification algorithms.

beautifulsoup classification email-classification excel gmail jupyter-notebooks machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 28 Oct 2025

https://github.com/mantreshkhurana/twitter-toxicity-detection-flask

This is a simple python program which uses a machine learning model to detect toxicity in tweets, developed in Flask.

flask hate-speech-detection hate-speech-detection-flask linear-regression ml python scikit-learn sklearn toxicity-detection twitter-api

Last synced: 05 Mar 2025

https://github.com/bgu-cs-vil/pdc-dp-means

"Revisiting DP-Means: Fast Scalable Algorithms via Parallelism and Delayed Cluster Creation" [Dinari and Freifeld, UAI 2022]

clustering dpmeans kmeans machine-learning minibatch scikit-learn

Last synced: 10 Apr 2025

https://github.com/fabianp/mash_2016_sklearn_intro

Material for the MASH course on introduction to scikit-learn

machine-learning notebooks scikit-learn tutorial

Last synced: 15 Apr 2025

https://github.com/balins/fuzzytree

A Fuzzy Decision Tree implementation for Python.

classification decision-trees fuzzy-logic machine-learning scikit-learn

Last synced: 14 Apr 2025

https://github.com/KarelZe/tclf

A scikit-learn compatible classifier to perform trade classification in Python.

empirical finance microstructure python rule-based-classifier scikit-learn trade-classification

Last synced: 06 Aug 2025

https://github.com/sa-y-an/omrnet

Neural Networks to evaluate OMR Sheets

computer-vision neural-network quantization scikit-learn tensorflow

Last synced: 15 Oct 2025

https://github.com/lucaangioloni/fit-covid19

Easy model to fit logistic curve to COVID19 data from Italy. Demo: https://fit-covid19.herokuapp.com

contagi-giornalieri coronavirus covid-19 covid-virus covid19 demo forecasting italy logistic prediction python regression scikit-learn totale-contagi

Last synced: 10 Sep 2025

https://github.com/aliktk/python_chilla

This repository contains practice materials on Python, used to deliver online training course. The course was sponsered by codenics and Scholership Network. Pakistan

course data-science eda machine-learning-algorithms pandas-python python scikit-learn training

Last synced: 10 Apr 2025

https://github.com/microsoft/python-sklearn-regression-cookiecutter

Cookiecutter template for testing Python scikit-learn regression learners.

cookiecutter machine-learning python scikit-learn template

Last synced: 27 Apr 2026

https://github.com/amirmardan/ml_course

This repository belongs to the course of machine learning with Python which is getting ready for AUT

data-analysis-python data-science deep-learning keras machine-learning python pytorch scikit-learn tensorflow

Last synced: 28 Jul 2025

https://github.com/ccrvlh/autoclass

Script that automatically classifies your bank statement into pre-determined labels.

banking banking-applications machine-learning personal-finance python scikit-learn

Last synced: 11 Apr 2025

https://github.com/scikit-learn-contrib/sklearn-ann

Integration with (approximate) nearest neighbors libraries for scikit-learn + clustering based on with kNN-graphs.

approximate-nearest-neighbor-search clustering knn knn-graphs scikit-learn

Last synced: 10 Apr 2025

https://github.com/aianytime/machine-learning-models-implementation

Implementation of several ML models on real-world datasets with detailed explanation in notebooks.

eda machine-learning machine-learning-algorithms ml numpy pandas pycaret python scikit-learn scikitlearn-machine-learning sklearn

Last synced: 28 Jun 2025

https://github.com/karelze/tclf

A scikit-learn compatible classifier to perform trade classification in Python.

empirical finance microstructure python rule-based-classifier scikit-learn trade-classification

Last synced: 26 Oct 2025

https://github.com/huangcongqing/sklearn

scikit-learn (sklearn) 常用的机器学习,包括回归(Regression)、降维(Dimensionality Reduction)、分类(Classfication)、聚类(Clustering)等方法

scikit-learn sklearn

Last synced: 01 Mar 2026

https://github.com/mohammadreza-mohammadi94/data-analysis-and-machine-learning-projects

A comprehensive collection of data analysis and machine learning projects, showcasing techniques and models for various data challenges. Dive in to explore code examples, analyses, and machine learning workflows.

data-analysis data-science dataframes deep-learning exploratory-data-analysis hyperparameter-tuning machine-learning machine-learning-algorithms pandas python scikit-learn visualization

Last synced: 06 Oct 2025

https://github.com/godfanmiao/pyai-github-2024

《 Python人工智能编程实践(2024年度版)》全书数据和开源代码【已出版】

git paddlepaddle pandas pyspark python3 pytorch scikit-learn tensorflow

Last synced: 15 Apr 2025

https://github.com/soda-inria/sklearn-numba-dpex

Experimental plugin for scikit-learn to be able to run (some estimators) on Intel GPUs via numba-dpex.

gpu intel numba-dpex scikit-learn

Last synced: 24 Nov 2025

https://github.com/yashksaini-coder/floraloracle--iris-inference-hub

The Objective is to combine the Prediction & classification scenarios of Machine Learning algorithms on the morphological Flower dataset

classification data-science jupyter-notebook machine-learning machine-learning-algorithms machinelearning prediction-model python3 scikit-learn scikitlearn-machine-learning

Last synced: 11 Apr 2025

https://github.com/rclement/datasette-ml

A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models

ai datasette datasette-plugin machine-learning mlops python scikit-learn sql sqlite

Last synced: 26 Oct 2025

https://github.com/alexioannides/lime-interpretable-ml

An example of how the LIME algorithm can be used to provide real-world insight into the decision processes of a 'black-box' machine learning algorithm - in this case a Radom Forest regressor.

data-science interpretability lime machine-learning numpy pandas pydata python scikit-learn

Last synced: 29 Oct 2025

https://github.com/ashishpatel26/MLflow_End_to_End_Example

MLflow is Open source platform for the machine learning lifecycle so here you can learn MLflow End to End Example with Prediction.

mlflow mlflow-example mlflow-tracking scikit-learn xgboost

Last synced: 08 May 2025

https://github.com/ashishpatel26/sklearn-ranking

Sklearn-ranking is ranking algorithm used for recommendation system algorithm. RANKSVM, RANKBOOST, RANKNET is included in this package

ranking ranking-algorithm ranking-methods scikit-learn sklearn sklearn-ranking

Last synced: 13 Apr 2025

https://github.com/aleximb/automl-streams

AutoML framework for implementing automated machine learning on data streams

automl data-streams scikit-learn

Last synced: 18 Mar 2025

https://github.com/hbldh/skboost

MILBoost and other boosting algorithms, compatible with scikit-learn

boosting boosting-algorithms machine-learning scikit-learn

Last synced: 14 Apr 2025

https://github.com/gjbex/machine-learning-with-python

Repository for participants of the "Machine learning with Python" training

deep-learning keras machine-learning python python-training scikit-learn training

Last synced: 13 Jul 2025

https://github.com/orehga/machine-learning-starter-pack

Heard of Machine Learning? What's it all about? 🤷🏾‍♀️ This repo will contain tutorials on different models required to give you an intro into the world of Machine Learning

machine-learning machine-learning-algorithms nextbillionusers scikit-learn

Last synced: 15 Jul 2025

https://github.com/centre-for-humanities-computing/stormtrooper

Zero/few shot learning components for scikit-learn pipelines with LLMs and transformers.

chatgpt few-shot-learning gpt-4 large-language-models llm scikit-learn transformer transformers zero-shot-learning

Last synced: 24 Oct 2025

https://github.com/sravb/nba-predictive-analytics

Being able to perform gameplay analysis of NBA players, NBA Predictive Analytics is a basketball coach's new best friend.

basketball data-mining data-science data-visualization decision-tree k-nearest-neighbors kaggle-dataset machine-learning matplotlib nba-analytics pandas predictive-analytics python scikit-learn scipy

Last synced: 07 May 2025

https://github.com/nathadriele/machine-learning-zoomcamp

The Machine Learning Zoomcamp teaches foundational and advanced ML concepts using tools like NumPy, Pandas, Scikit-Learn, TensorFlow, XGBoost, Flask, Docker, AWS, Kubernetes, and KServe. It covers regression, classification, evaluation metrics, neural networks, deployment strategies, and end-to-end projects to bridge theory and practice.

aws classification deployment docker flask kserve kubernetes metrics metrics-visualization mlops-project neural-networks numpy pandas scikit-learn tensorflow xgboost

Last synced: 20 Jun 2025

https://github.com/zeke/github-avatars

A machine learning model to detect whether a GitHub user has a custom or default avatar

cog machine-learning replicate scikit-learn

Last synced: 28 Apr 2025

https://github.com/krypty/trefle

Trefle is a scikit-learn compatible estimator implementing the FuzzyCoCo algorithm that uses a cooperative coevolution algorithm to find and build interpretable fuzzy systems.

data-science deap evolutionary-algorithm fuzzy-logic interpretability machine-learning python scikit-learn

Last synced: 29 Oct 2025