An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/sayakpaul/generating-categories-from-arxiv-paper-titles

This project takes the arXiv dataset and builds an automatic tag classifier from the arXiv article/paper titles

arxiv deep-learning gcp natural-language-processing scikit-learn tensorflow wandb

Last synced: 07 May 2025

https://github.com/jorgemunozl/physical_simulations

A set of physics/mathematics scripts for different concepts.

mathematics nn numpy physics python scikit-learn sympy

Last synced: 04 May 2026

https://github.com/snehankekre/streamlit-yellowbrick

Streamlit component for the Yellowbrick visualization and model diagnostics library

data-visualization machine-learning matplotlib python scikit-learn streamlit visualization yellowbrick

Last synced: 24 Oct 2025

https://github.com/veb-101/machine-learning-algorithms

One notebook to learn it all - Algorithms from scratch

matplotlib numpy pandas python-3-7 scikit-learn scipy seaborn

Last synced: 27 Aug 2025

https://github.com/zeke/github-avatars

A machine learning model to detect whether a GitHub user has a custom or default avatar

cog machine-learning replicate scikit-learn

Last synced: 28 Apr 2025

https://github.com/genfifth/cvopt

Machine learning's parameter search and feature selection module which is integrated log management and visualization.

bayesian-optimization deep-learning feature-selection hyperopt hyperparameter-optimization integrated-visualization keras logmanagement machine-learning python scikit-learn

Last synced: 10 Apr 2025

https://github.com/amirhosseinhonardoust/underwriting-decision-safety-lab

A decision-safety lab for loan approval: trains a baseline classifier, calibrates probabilities (ECE/Brier), sweeps confidence thresholds to build a coverage, quality frontier and outputs a defensible abstention policy (auto-decide vs review). Includes a Streamlit dashboard for report cards, triage UI, and data quality checks.

abstention calibration classification credit-risk data-quality data-science decision-policy loan-approval machine-learning mlops model-evaluation monitoring pandas reliability responsible-ai scikit-learn selective-classification streamlit uncertainty underwriting

Last synced: 10 Jun 2026

https://github.com/shukkkur/tennis-match-prediction

Predicting the winner of the Tennis match

machine-learning prediction python scikit-learn tennis

Last synced: 08 Sep 2025

https://github.com/kenlimmj/fightin-words

A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.

bayesian-methods evaluation-metrics nlp scikit-learn

Last synced: 30 Oct 2025

https://github.com/chandraprakash-bathula/apparel-recommendations

This project implements a personalized apparel recommendation engine using content-based search with the Amazon API, NLTK, and Keras libraries.

boxplot cnn-keras data-analysis data-science deep-learning linear-regression machine-learning numpy pandas scatter-plot scikit-learn svm tensorflow xgboost

Last synced: 23 Mar 2025

https://github.com/mantreshkhurana/twitter-toxicity-detection-tkinter

This is a simple python program which uses a machine learning model to detect toxicity in tweets, GUI in Tkinter.

hate-speech-detection linear-regression ml python scikit-learn sklearn tkinter toxicity-detection twitter-api

Last synced: 05 Mar 2025

https://github.com/doubleml/doubleml-serverless

DoubleML-Serverless - Distributed Double Machine Learning with a Serverless Architecture

aws-lambda causal-inference data-science double-machine-learning econometrics machine-learning python scikit-learn serverless statistics

Last synced: 07 May 2025

https://github.com/anaclumos/heart-diagnosis-engine

2019년 민족사관고등학교 졸업 프로젝트

data-science machine-learning pandas python scikit-learn

Last synced: 22 Aug 2025

https://github.com/rasmusrynell/predicting-nhl

The project explores the idea of using different machine learning techniques to determine different stats in NHL games.

ai algorithms data-science database machine-learning ml nhl nhl-api python scikit-learn sports sports-analytics sports-stats sportsanalytics

Last synced: 14 Apr 2025

https://github.com/qoyyuum/forex-mt5-bot

To auto learn, analyze and predict the Forex Market and autotrade with Metatrader 5 API and Python

docker docker-compose metatrader5 python3 pytorch scikit-learn

Last synced: 18 Aug 2025

https://github.com/snoop2head/instagram_hashtag_analysis

📷 Crawl and Analyze Instagram Hashtag Data: KoNLPY to gensim word2Vec & scikit-learn TF-IDF

adjective gensim gensim-word2vec instagram-hashtag-analysis konlpy natural-language-processing noun scikit-learn scikitlearn tf-idf word2vec

Last synced: 03 Apr 2025

https://github.com/ashishpatel26/regressionmetrics

Regression Metrics Calculation Made easy for tensorflow2 and scikit-learn

keras metrics regression-metrics scikit-learn tensorflow2

Last synced: 27 Feb 2026

https://github.com/supercowpowers/scp-labs

SCP Labs (Open Source Team for SuperCowPowers)

data-analysis data-science pandas python scikit-learn security

Last synced: 06 May 2025

https://github.com/torkamanilab/zoish

Zoish is a Python package that streamlines machine learning by leveraging SHAP values for feature selection and interpretability, making model development more efficient and user-friendly

automl data-science feature-engineering feature-selection machine-learning python scikit-learn

Last synced: 10 Apr 2025

https://github.com/djeada/numerical-methods

Comprehensive library of numerical methods implemented in Python. It includes solutions to various mathematical problems, detailed explanations of each method, illustrative examples, and comparisons with prominent scientific libraries like Numpy, Scikit-Learn, and SciPy.

jupyter-notebook linear-algebra matplotlib numerical-methods numpy python scikit-learn scipy

Last synced: 13 Apr 2025

https://github.com/ncar/bridgescaler

Bridge your scikit-learn scaler parameters between Python sessions and users. Distribute your scaling across multiple processes and data subsets.

ai machine-learning scikit-learn

Last synced: 05 May 2026

https://github.com/mohd-faizy/career-track-data-scientist-with-python

This Repo contains tools that allow us to import, clean, manipulate, and visualize data —Includes Python libraries, like pandas, NumPy, Matplotlib, and many more to work with real-world datasets to learn the statistical and machine learning techniques.

data-science data-visualization datascience-machinelearning datasciencecoursera datascientist datascientisttraining decision-trees hypothesis hypothesis-testing machine-learning machine-learning-algorithms nlp-machine-learning numpy pandas python scikit-learn seaborn statistical

Last synced: 12 Jul 2025

https://github.com/axegon/sklite

Transpile scikit-learn models to Flutter

fluttter python36 scikit-learn

Last synced: 23 Jul 2025

https://github.com/edikedik/eboruta

Flexible and transparent Python Boruta implementation

ensemble-models feature-selection machine-learning python scikit-learn

Last synced: 10 Apr 2025

https://github.com/yu9824/kennard_stone

This is an algorithm for evenly partitioning.

kfold-cross-validation python scikit-learn train-test-split

Last synced: 16 Mar 2025

https://github.com/r-m-n/sklearn-deltatfidf

DeltaTfidfVectorizer for scikit-learn

delta-tf-idf python scikit-learn sentiment-analysis sklearn tf-idf

Last synced: 10 Sep 2025

https://github.com/starkblaze01/sentiment-analyzer

Model for Sentiment Analysis using Naive Bayes and CNN, and implementation of Model on Tweets and Web Application using React

flask-server keras-tensorflow python reactjs scikit-learn sentiment-analysis sklearn tweepy typescript

Last synced: 16 Mar 2025

https://github.com/chrislemke/sk-transformers

A collection of pandas & scikit-learn compatible transformers for preprocessing and feature engineering 🛠

data-science feature-engineering feature-selection machine-learning pandas preprocessing python scikit-learn scikit-learn-pipelines scikit-learn-transformer

Last synced: 17 Jun 2025

https://github.com/tsg405/applied-machine-learning-in-python

This Repo contains - Starter files, Coursework, Programming Assignments for the course --> Applied Machine Learning in Python, University of Michigan [COURSERA]

applied-machine-learning assignment classification coursera data-science fruit-dataset machine-learning matplotlib-pyplot numpy pandas python quiz regression scikit-learn scipy seaborn supervised-machine-learning university-of-michigan unsupervised-machine-learning

Last synced: 14 Apr 2025

https://github.com/jeffzi/pandas-select

Supercharged pandas indexing

pandas pandera python scikit-learn

Last synced: 22 Apr 2025

https://github.com/azure/mlops-starter-sklearn

Azure Machine Learning と GitHub を利用した MLOps のサンプルコード

azure azure-machine-learning devops machine-learning microsoft mlops mypy pytest python responsible-ai scikit-learn

Last synced: 04 Sep 2025

https://github.com/vuthanhhai2302/apply-machine-learning-on-data-analytics

My project of applied machine learning on data analytics, using pandas, numpy and scikit-learn to analyze data

data-analysis numpy pandas scikit-learn

Last synced: 28 Apr 2025

https://github.com/akoury/ml-helper

Python library with helpers to speed up and structure machine learning projects.

data data-visualization machine-learning ml python scikit-learn sklearn

Last synced: 24 Oct 2025

https://github.com/nossbigg/fyp_py

Source code for A Study on Rumour Detection on Online Social Networks final year research project

machine-learning nltk python scikit-learn sentiment-analysis twitter

Last synced: 17 Jun 2025

https://github.com/tushar50896/cuss_inspect

A basic and simple yet powerful Python library to detect toxicity/profanity of a review or list of reveiws.

abusive-language-detection cusswords logistic-regression profanity profanity-detection python review-checks scikit-learn swearing-detector toxic-comment-classification

Last synced: 05 Jul 2025

https://github.com/contextlab/data-wrangler

Wrangle messy numerical, image, and text data into consistent well-organized formats

data data-analysis data-science data-wrangling hugging-face image-data machine-learning nlp numpy pandas python scikit-learn

Last synced: 10 Apr 2025

https://github.com/sshh12/csgo-market-analysis

Some interesting stats from the CS:GO Steam community market.

csgo csgo-skins scikit-learn steam-market

Last synced: 19 Mar 2025

https://github.com/zainulmustafa/stock-prediction-rnn-lstm

Stock prediction done using RNN and LTSM to resolve vanishing gradient problem. Dataset used is obtained from Pakistan Stock Exchange

json keras-tensorflow matplotlib numpy python3 scikit-learn tensorflow

Last synced: 31 Jul 2025

https://github.com/lazarust/sklx

A scikit-learn compatible neural network library that wraps MLX.

mlx scikit-learn

Last synced: 07 May 2025

https://github.com/isala404/r2d2

Line Following Robot Powered By OpenCV and Machine Learing

ai linefollower machine-learing numpy opencv python3 rasberrypi scikit-learn tensorflow

Last synced: 07 Sep 2025

https://github.com/herrfeder/ai_cybersecurity_ids_poc

Winning Contribution of Michael Schwabe and David Lassig to BWI Data Analytics Hackathon 2020 in the Category Cyber Security. Proof of Concept Intrusion Detection using Zeek with selfmade MachineLearning in a nice WebApp.

circleci cloudformation cyber-security dash docker-container intrusion-detection keras kubernetes machine-learning plotly python scikit-learn tensorflow zeek

Last synced: 13 Apr 2025

https://github.com/gagolews/analiza_danych_w_jezyku_python

M. Gągolewski, M. Bartoszuk, A. Cena, Przetwarzanie i analiza danych w języku Python, PWN, 2016

data-science matplotlib numpy pandas polski python scikit-learn

Last synced: 14 Jul 2025

https://github.com/qiancao/hskl

A library for hyperspectral image analysis using scikit-learn.

hyperspectral image-analysis-toolbox machine-learning scikit-learn

Last synced: 13 Apr 2025

https://github.com/alexioannides/bodywork-mlops-demo

Demonstrating how Bodywork can be used to deploy a simulation of the lifecycle of a train-and-serve ML pipeline, responding to new data undergoing concept drift.

aws data-science docker kubernetes machine-learning mlops numpy python scikit-learn

Last synced: 29 Oct 2025

https://github.com/time-series-machine-learning/tsml-py

A toolkit for time series machine learning algorithms that don't fit in aeon. Use aeon instead if you can!

data-science machine-learning python scikit-learn time-series time-series-classification time-series-clustering time-series-regression

Last synced: 02 Apr 2026

https://github.com/bnediction/scboolseq

scBoolSeq: scRNA-Seq data binarisation and synthetic generation from Boolean dynamics

bioinformatics boolean-networks computational-biology machine-learning pandas python3 scikit-learn scrna-seq single-cell-rna-seq

Last synced: 09 Mar 2026

https://github.com/markdouthwaite/serverless-scikit-learn-demo

A repository providing demo code for deploying a lightweight Scikit-Learn based ML pipeline modelling heart disease data as a Google Cloud Function.

data-science google-cloud google-cloud-function machine-learning machine-learning-api machine-learning-projects scikit-learn serverless tutorial

Last synced: 25 Jul 2025

https://github.com/greed2411/plinearregression

Scikit-Learn's linear regression extended with p-values.

hypothesis-testing p-values python3 regression scikit-learn

Last synced: 24 Aug 2025

https://github.com/talha1503/stackoverflow_questions_tagger

Predicting tags for StackOverflow Questions

beautifulsoup scikit-learn scikit-multilearn

Last synced: 18 Jun 2025

https://github.com/wilmeragsgh/adabnn

Code related to thesis work: "AdaBnn: Binarized Neural Networks trained with adaptive structural learning"

deep-learning keras neural-networks scikit-learn tensorflow

Last synced: 11 Jun 2025

https://github.com/brackendev/scikit-learn-hy

An introduction to scikit-learn (machine learning in Python) and Hy (a Lisp dialect embedded in Python)

hy hylang machine-learning python scikit-learn tutorial

Last synced: 09 Oct 2025

https://github.com/ayrna/dlordinal

Open-source Python toolkit focused on deep learning with ordinal methodologies

deep-learning ordinal-classification python pytorch scikit-learn

Last synced: 26 Sep 2025

https://github.com/sayakpaul/dockerml

Contains my explorations of using Docker to automate ML workflows.

ci-cd docker scikit-learn tensorflow wandb

Last synced: 05 Sep 2025

https://github.com/jlgarridol/sslearn

The sslearn library is a Python package for machine learning over Semi-supervised datasets. It is an extension of scikit-learn.

classification-algorithm machine-learning scikit-learn scikit-learn-api semi-supervised semi-supervised-learning semisupervised-learning

Last synced: 30 Oct 2025

https://github.com/mrankitgupta/python-libraries-roadmap

I am sharing lessons in various Python Libraries from scratch to intermediate including practice sets which were useful into my journey of Data Science.

66daysofdata ai analytics ankitgupta artificial-intelligence data-science data-visualization libraries library machine-learning matplotlib mrankitgupta numpy pandas python python-libraries python-library pythonlib scikit-learn tensorflow

Last synced: 22 Apr 2025

https://github.com/smohiudd/pedestrian-collision-prediction

Pedestrian collision prediction using GeoVex embeddings and linear classifier

scikit-learn sdv srai

Last synced: 26 Oct 2025

https://github.com/lai-bluejay/diego

Diego: Data in, IntElliGence Out. A fast framework that supports the rapid construction of automated learning tasks. Simply create an automated learning study (Study) and generate correlated trials (Trial). Then run the code and get a machine learning model. Implemented using Scikit-learn API glossary, using Bayesian optimization and genetic algorithms for automated machine learning. Inspired by [Fast.ai](https://github.com/fastai/fastai).

automl autosklearn bayesian-optimization generation-algorithms hyperparameter-optimization machine-learning scikit-learn

Last synced: 13 Apr 2025

https://github.com/posit-dev/orbital

Turn SciKitLearn pipelines into SQL

machine-learning python scikit-learn sql

Last synced: 09 Feb 2026

https://github.com/claudiucreanga/hands-on-machine-learning-scikit-learn-tensorflow-oreilly-geron

Book Hands on Machine Learning with Scikit-Learn and Tensorflow from O'reilly - Geron

machine-learning python scikit-learn tensorflow

Last synced: 14 Oct 2025

https://github.com/adrienc21/vulpes

Vulpes: Test many classification, regression models and clustering algorithms to see which one is most suitable for your dataset

automl data-analysis data-science machine-learning models package python scikit-learn statistics

Last synced: 25 Oct 2025

https://github.com/ammirsm/automatic-pancake

Active learning agent-based-simulation for systematic reviews and other types of technology assisted review (TAR) which will include PDF documents and other meta-datas in itself and it's based on both fulltext-screening decisions and title-screening decisions.

active-learning agent-based-simulation machine-learning pdf-document-processor python scikit-learn systematic-review systematic-reviews technology-assisted-review

Last synced: 12 Apr 2025

https://github.com/learnables/torchml

Scikit-learn implemented with PyTorch

machine-learning pytorch scikit-learn

Last synced: 14 Apr 2025

https://github.com/gbolmier/sklearn-neighbors-benchmark

:bar_chart: Scikit-learn nearest neighbors algorithms benchmark

benchmark nearest-neighbors-algorithms scikit-learn

Last synced: 11 Apr 2025

https://github.com/integeralex/netflix-recommendation-system

This Netflix Recommendation System is a web application developed using Node.js and Express. It utilizes a recommendation engine written in Python

ai collaborate docker express netflix nodejs pandas recommendation-system scikit-learn

Last synced: 22 Apr 2025

https://github.com/cyberfantics/bitcoin-price-prediction

A deep learning-based web app for predicting future Bitcoin prices using historical data. Users can interactively select prediction days and view recent price data in real-time.

artificial-intelligence artificial-neural-networks bitcoin deep-learning machine-learning neural-network prediction-model scikit-learn tensorflow

Last synced: 13 Aug 2025

https://github.com/octo-technology/ddui

Airflow's plugin for Data Science pipeline visualisation

airflow airflow-plugin datadriver datascience ml pandas-python scikit-learn

Last synced: 24 Oct 2025

https://github.com/prakharchoudhary/mlworld

A collection of simple machine learning projects, that got me started in this wonderful domain!

classification clustering iris-dataset keras-neural-networks knn machine-learning neural-networks numpy pandas python scikit-learn

Last synced: 09 Apr 2025

https://github.com/tlapusan/woodpecker

A python library used for tree structure interpretation.

decision-trees machine-learning random-forest scikit-learn sklearn visualization

Last synced: 08 May 2025

https://github.com/yash22222/ibm-csrbox-internship-project

The objective of the Data Analytics internship at CSRBOX is to provide interns with hands-on experience in applying data analytics techniques to real-world projects in the field of corporate social responsibility (CSR). Interns will gain practical skills in data collection, cleaning, analysis, visualization, and reporting, while working on projects

data-mining data-preprocessing data-science exploratory-data-analysis feature-engineering lemmatization machine-learning pandas pos-tagging random-forest random-forest-classifier scikit-learn sentiment-analysis web-scraping wordcloud

Last synced: 22 Apr 2025

https://github.com/tatevkaren/deep-learning-for-data-science

Deep Learning Case Studies with Tensorflow and Keras for Beginners-Advanced: ANN, CNN, RNN, Self-Organizing Maps, Boltzmann Machines, Stacked Autoencoders

ann artificial-intelligence artificial-neural-networks data-preprocessing data-science deep-learning ds keras modelling modelling-framework neural-networks numpy pandas python scikit-learn sklearn tensorflow

Last synced: 10 Apr 2025

https://github.com/marty1885/scirknn

Convert and run scikit-learn MLPs on Rockchip NPU.

inference-acceleration npu rk3566 rk3588 rknn rknpu2 rockchip scikit-learn

Last synced: 31 Jul 2025

https://github.com/rebelosa/random-subgroups

A machine learning python package for learning ensembles of subgroups for predictive tasks.

interpretability interpretable-machine-learning pysubgroup python python-package python3 random-forest scikit-learn subgroup-discovery subgroups

Last synced: 30 Jul 2025