An open API service indexing awesome lists of open source software.

Machine learning

Machine learning is the practice of teaching a computer to learn. The concept uses pattern recognition, as well as other forms of predictive algorithms, to make judgments on incoming data. This field is closely related to artificial intelligence and computational statistics.

https://github.com/daturkel/learning-papers

Landmark Papers in Machine Learning

machine-learning papers

Last synced: 03 Feb 2026

https://github.com/rougier/ML-Recipes

A collection of stand-alone Python machine learning recipes

algorithm awesome machine-learning neural-network python recipes reinforcement-learning

Last synced: 30 Mar 2025

https://github.com/summergift/embeddedsystem

:books: 计算机体系架构、嵌入式系统基础与主流编程语言相关内容总结

c cnn computer-vision data-structures iot linux machine-learning network python

Last synced: 15 May 2025

https://github.com/argoproj-labs/hera

Hera makes Python code easy to orchestrate on Argo Workflows through native Python integrations. It lets you construct and submit your Workflows entirely in Python. ⭐️ Remember to star!

argo argo-events argo-workflows cloud-native hera kubernetes library machine-learning pypi python sdk workflow-automation workflow-management workflows

Last synced: 02 Apr 2026

https://github.com/rougier/ml-recipes

A collection of stand-alone Python machine learning recipes

algorithm awesome machine-learning neural-network python recipes reinforcement-learning

Last synced: 25 Mar 2025

https://github.com/hrnbot/Basic-Mathematics-for-Machine-Learning

The motive behind Creating this repo is to feel the fear of mathematics and do what ever you want to do in Machine Learning , Deep Learning and other fields of AI

ai algebra beginner calculus linear-algebra machine-learning machinelearning-python mathematical-analysis mathematical-functions mathematics maths notebook prerequisites probability python pytorch siraj siraj-raval statistics

Last synced: 26 Apr 2025

https://github.com/mackelab/sbi

sbi is a Python package for simulation-based inference, designed to meet the needs of both researchers and practitioners. Whether you need fine-grained control or an easy-to-use interface, sbi has you covered.

bayesian-inference likelihood-free-inference machine-learning parameter-estimation pytorch simulation-based-inference

Last synced: 05 Apr 2025

https://github.com/github/codespaces-jupyter

Explore machine learning and data science with Codespaces

codespaces data-science jupyter-notebook machine-learning

Last synced: 11 Apr 2025

https://github.com/doubangotelecom/ultimatealpr-sdk

World's fastest ANPR / ALPR implementation for CPUs, GPUs, VPUs and NPUs using deep learning (Tensorflow, Tensorflow lite, TensorRT, OpenVX, OpenVINO). Multi-Charset (Latin, Korean, Chinese) & Multi-OS (Jetson, Android, Raspberry Pi, Linux, Windows) & Multi-Arch (ARM, x86).

alpr amlogic-npu android anpr anpr-sdk artificial-intelligence deep-learning jetson jetson-nano khadas-vim3 khadas-vims-boards license-plate license-plate-detection license-plate-recognition linux machine-learning openvino raspberry-pi tensorflow windows

Last synced: 15 May 2025

https://github.com/perpetual-ml/perpetual

Perpetual is a high-performance gradient boosting machine. It delivers optimal accuracy in a single run without complex tuning through a simple budget parameter. It features out-of-the-box support for causal ML, continual learning, native calibration, and robust drift monitoring, along with Rust core and zero-copy bindings for Python and R

data-science gbdt gbm gradient-boosted-trees gradient-boosting gradient-boosting-decision-trees kaggle machine-learning python rust

Last synced: 02 Apr 2026

https://github.com/The-FinAI/PIXIU

This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).

aifinance chatgpt fintech gpt-4 large-language-models llama machine-learning named-entity-recognition natural-language-processing nlp pixiu question-answering sentiment-analysis stock-price-prediction text-classification

Last synced: 12 Mar 2025

https://github.com/stratosphereips/StratosphereLinuxIPS

Slips, a free software behavioral Python intrusion prevention system (IDS/IPS) that uses machine learning to detect malicious behaviors in the network traffic. Stratosphere Laboratory, AIC, FEL, CVUT in Prague.

ai docker endpoint-protection gsoc-2023 gsoc-2024 ids intrusion-detection-system intrusion-prevention-system ips machine-learning network-analysis network-security pcap stratosphere-ips zeek

Last synced: 30 Mar 2025

https://github.com/arunponnusamy/cvlib

A simple, high level, easy to use, open source Computer Vision library for Python.

computer-vision deep-learning image-processing machine-learning python

Last synced: 09 May 2025

https://github.com/motiwari/BanditPAM

BanditPAM C++ implementation and Python package

clustering machine-learning python

Last synced: 05 Mar 2026

https://github.com/BlackSamorez/tensor_parallel

Automatically split your PyTorch models on multiple GPUs for training & inference

deep-learning machine-learning natural-language-processing nlp python pytorch pytorch-transformers

Last synced: 22 Jul 2025

https://github.com/michaelthwan/searchGPT

Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.

ai chatgpt grounded-api grounded-bot language-model llm machine-learning nlp nlp-machine-learning openai python retrieval retrieval-model

Last synced: 27 Apr 2025

https://github.com/fabsig/GPBoost

Combining tree-boosting with Gaussian process and mixed effects models

artificial-intelligence boosting cpp data-science gaussian-processes machine-learning mixed-effects python r

Last synced: 04 Feb 2026

https://github.com/blacksamorez/tensor_parallel

Automatically split your PyTorch models on multiple GPUs for training & inference

deep-learning machine-learning natural-language-processing nlp python pytorch pytorch-transformers

Last synced: 14 Dec 2025

https://github.com/qgallouedec/panda-gym

Set of robotic environments based on PyBullet physics engine and gymnasium.

artificial-intelligence deep-learning franka-emika machine-learning python reinforcement-learning robotics

Last synced: 15 May 2025

https://github.com/ch-sa/labelCloud

A lightweight tool for labeling 3D bounding boxes in point clouds.

3d-object-detection 6d-pose-estimation annotation bounding-boxes computer-vision labeling machine-learning point-clouds tool

Last synced: 20 Mar 2025

https://github.com/google/grain

Library for reading and processing ML training data.

data-pr jax machine-learning python

Last synced: 14 Jan 2026

https://github.com/fastai/fastai2

Temporary home for fastai v2 while it's being developed

data-science deep-learning fastai jupyter machine-learning nbdev python pytorch

Last synced: 19 Jul 2025

https://github.com/IntelPython/sdc

Numba extension for compiling Pandas data frames, Intel® Scalable Dataframe Compiler

big-data compilers machine-learning numpy pandas parallel-computing python

Last synced: 09 May 2025

https://github.com/blackhc/tfpyth

Putting TensorFlow back in PyTorch, back in TensorFlow (differentiable TensorFlow PyTorch adapters).

machine-learning pytorch tensorflow

Last synced: 04 Apr 2025

https://github.com/faktionai/awesome-ai-usecases

A list of awesome and proven Artificial Intelligence use cases and applications

data-science machine-learning

Last synced: 14 Mar 2025

https://github.com/Ranlot/single-parameter-fit

Real numbers, data science and chaos: How to fit any dataset with a single parameter

chaos-theory goodness-of-fit machine-learning

Last synced: 15 Mar 2025

https://github.com/BlackHC/tfpyth

Putting TensorFlow back in PyTorch, back in TensorFlow (differentiable TensorFlow PyTorch adapters).

machine-learning pytorch tensorflow

Last synced: 20 Mar 2025

https://github.com/wanmeihuali/taichi_3d_gaussian_splatting

An unofficial implementation of paper 3D Gaussian Splatting for Real-Time Radiance Field Rendering by taichi lang.

3d-reconstruction 3d-rendering computer-graphics computer-vision machine-learning nerf python pytorch real-time-rendering taichi

Last synced: 11 Apr 2025

https://github.com/rnchg/apt

AI Productivity Tool - Free and open source, improve user productivity, protect privacy and data security. Provide efficient and convenient AI solutions, built-in local exclusive ChatGPT, Phi, DeepSeek, one-click batch intelligent processing of pictures, videos, audio, etc.

ai ai-framework aigc audio-processing chatgpt computer-vision deep-learning deepseek generative-ai image-processing inference llm machine-learning machinelearning neural-network onnx onnxruntime video-processing

Last synced: 15 May 2025

https://github.com/yinboc/few-shot-meta-baseline

Meta-Baseline: Exploring Simple Meta-Learning for Few-Shot Learning, in ICCV 2021

few-shot-learning machine-learning pytorch

Last synced: 27 Oct 2025

https://github.com/gregversteeg/corex_topic

Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx

information-theory machine-learning python topic-modeling unsupervised-learning

Last synced: 10 Feb 2026

https://github.com/tessellate-imaging/monk_object_detection

A one-stop repository for low-code easily-installable object detection pipelines.

computervision deeplearning hacktoberfest machine-learning python3

Last synced: 08 Apr 2025

https://github.com/Tessellate-Imaging/Monk_Object_Detection

A one-stop repository for low-code easily-installable object detection pipelines.

computervision deeplearning hacktoberfest machine-learning python3

Last synced: 05 May 2025

https://github.com/Yuan-ManX/ai-audio-datasets

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.

aigc artificial-intelligence audio audio-effect audio-generation datasets deep-learning machine-learning music-generation

Last synced: 17 Mar 2025

https://github.com/ashishpatel26/real-time-ml-project

A curated list of applied machine learning and data science notebooks and libraries across different industries.

application deep-learning deeplearning dl keras machine-learning machine-learning-algorithms machinelearning ml ml-application project pytorch real-time real-time-data rl tensorflow theano

Last synced: 13 Apr 2025

https://github.com/ashishpatel26/Real-time-ML-Project

A curated list of applied machine learning and data science notebooks and libraries across different industries.

application deep-learning deeplearning dl keras machine-learning machine-learning-algorithms machinelearning ml ml-application project pytorch real-time real-time-data rl tensorflow theano

Last synced: 29 Mar 2025

https://github.com/vkosuri/courseramachinelearning

Coursera Machine Learning By Prof. Andrew Ng

coursera coursera-machine-learning machine-learning

Last synced: 04 Apr 2025

https://github.com/rbbrdckybk/ai-art-generator

For automating the creation of large batches of AI-generated artwork locally.

clip-guided-diffusion deep-learning generative-art image-generation machine-learning stable-diffusion vqgan-clip

Last synced: 10 Apr 2025

https://github.com/tensorflow/tcav

Code for the TCAV ML interpretability project

interpretability machine-learning tcav

Last synced: 08 Apr 2025

https://github.com/waikato/moa

MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection and recommender systems) and tools for evaluation.

clustering data-stream-mining java machine-learning machine-learning-algorithms moa streaming-algorithms

Last synced: 15 May 2025

https://github.com/rnchg/Apt

AI Productivity Tool - Free and open source, improve user productivity, protect privacy and data security. Provide efficient and convenient AI solutions, built-in local exclusive ChatGPT, Phi, DeepSeek, one-click batch intelligent processing of pictures, videos, audio, etc.

ai ai-framework aigc audio-processing chatgpt computer-vision deep-learning deepseek generative-ai image-processing inference llm machine-learning machinelearning neural-network onnx onnxruntime video-processing

Last synced: 24 Mar 2025

https://github.com/kubeflow-kale/kale

Kubeflow’s superfood for Data Scientists

jupyter-notebook kubeflow kubeflow-pipelines machine-learning

Last synced: 15 May 2025

https://github.com/perone/euclidesdb

A multi-model machine learning feature embedding database

cpp database deep-learning machine-learning pytorch search

Last synced: 04 Apr 2025

https://github.com/sforaidl/kd_lib

A Pytorch Knowledge Distillation library for benchmarking and extending works in the domains of Knowledge Distillation, Pruning, and Quantization.

algorithm-implementations benchmarking data-science deep-learning-library knowledge-distillation machine-learning model-compression pruning pytorch quantization

Last synced: 16 May 2025

https://github.com/bminixhofer/nlprule

A fast, low-resource Natural Language Processing and Text Correction library written in Rust.

grammar grammatical-error-correction machine-learning natural-language-processing nlp proofreading rust spellcheck style-checker

Last synced: 15 May 2025

https://github.com/cerndb/dist-keras

Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.

apache-spark data-parallelism data-science deep-learning distributed-optimizers hadoop keras machine-learning optimization-algorithms tensorflow

Last synced: 03 Oct 2025

https://github.com/lean-dojo/LeanDojo

Tool for data extraction and interacting with Lean programmatically.

lean lean4 machine-learning theorem-proving

Last synced: 27 Mar 2025

https://github.com/google/yggdrasil-decision-forests

A library to train, evaluate, interpret, and productionize decision forest models such as Random Forest and Gradient Boosted Decision Trees.

cart cli cpp decision-forest decision-trees distributed-computing go gradient-boosting interpretability javascript machine-learning ml pypi python random-forest tensorflow

Last synced: 12 Jan 2026

https://github.com/chakki-works/sumeval

Well tested & Multi-language evaluation framework for text summarization.

bleu machine-learning rouge text-summarization

Last synced: 16 May 2025

https://github.com/rstojnic/lazydata

Lazydata: Scalable data dependencies for Python projects

data-science datamanagement machine-learning python

Last synced: 26 Mar 2025

https://github.com/Jakobovski/free-spoken-digit-dataset

A free audio dataset of spoken digits. An audio version of MNIST.

audio dataset machine-learning mnist speech-recognition spoken-digits spoken-language

Last synced: 14 Apr 2025

https://github.com/d2l-ai/d2l-vi

Một cuốn sách về Học Sâu đề cập đến nhiều framework phổ biến, được sử dụng trên 300 trường Đại học từ 55 đất nước bao gồm MIT, Stanford, Harvard, và Cambridge.

computer-vision d2l deep-learning kaggle keras machine-learning mlbvn mxnet python pytorch tensorflow vietnamese-language

Last synced: 04 Apr 2025

https://github.com/farukalamai/advanced-machine-learning-engineer-roadmap-2024

A Full Stack ML (Machine Learning) Roadmap involves learning the necessary skills and technologies to become proficient in all aspects of machine learning, including data collection and preprocessing, model development, deployment, and maintenance.

aws computer-vision data-analysis data-science data-visualization deep-learning git-github machine-learning machine-learning-roadmap mlops natural-language-processing neural-network nlp opencv pandas python pytorch statistics tensorflow yolo

Last synced: 04 Apr 2025

https://github.com/michaelthwan/searchgpt

Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.

ai chatgpt grounded-api grounded-bot language-model llm machine-learning nlp nlp-machine-learning openai python retrieval retrieval-model

Last synced: 21 Apr 2025

https://github.com/eugeneyan/ml-design-docs

📝 Design doc template & examples for machine learning systems (requirements, methodology, implementation, etc.)

design design-docs machine-learning

Last synced: 11 Feb 2026

https://github.com/meesho/bharatmlstack

BharatMLStack is an open-source, end-to-end machine learning infrastructure stack built at Meesho to support real-time and batch ML workloads at Bharat scale

ai feature-engineering feature-store-online machine-learning ml mlops

Last synced: 15 Apr 2026

https://github.com/ydli-ai/csl

[COLING 2022] CSL: A Large-scale Chinese Scientific Literature Dataset 中文科学文献数据集

chinese-nlp dataset machine-learning scientific-publications

Last synced: 05 Apr 2025

https://github.com/alankbi/detecto

Build fully-functioning computer vision models with PyTorch

computer-vision faster-rcnn machine-learning object-detection python pytorch

Last synced: 21 Oct 2025