An open API service indexing awesome lists of open source software.

Machine learning

Machine learning is the practice of teaching a computer to learn. The concept uses pattern recognition, as well as other forms of predictive algorithms, to make judgments on incoming data. This field is closely related to artificial intelligence and computational statistics.

https://github.com/nabla-ml/nabla

Build and train Neural Networks in Mojo

ai arrays compiler jax machine-learning modular mojo numpy pytorch

Last synced: 01 Apr 2026

https://github.com/modernatx/seqlike

Unified biological sequence manipulation in Python

biological-sequences biopython machine-learning sequence

Last synced: 21 Oct 2025

https://github.com/statusfailed/catgrad

a categorical deep learning compiler

deep-learning machine-learning python

Last synced: 27 Jan 2026

https://github.com/helmut-hoffer-von-ankershoffen/jetson

Helmut Hoffer von Ankershoffen experimenting with arm64 based NVIDIA Jetson (Nano and AGX Xavier) edge devices running Kubernetes (K8s) for machine learning (ML) including Jupyter Notebooks, TensorFlow Training and TensorFlow Serving using CUDA for smart IoT.

ansible archiconda cuda docker edge-devices hoffer-von-ankershoffen jupyter k8s kubeflow kubernetes kustomize machine-learning ml nvidia-jetson-nano nvidia-jetson-xavier skaffold smart-iot software-engineering tensorflow-serving virtualbox

Last synced: 14 Apr 2025

https://github.com/alexandrainst/danlp

DaNLP is a repository for Natural Language Processing resources for the Danish Language.

danish machine-learning named-entity-recognition natural-language-processing nlp nlp-library part-of-speech word-embeddings

Last synced: 22 Nov 2025

https://github.com/Ronak-59/Stock-Prediction

Smart Algorithms to predict buying and selling of stocks on the basis of Mutual Funds Analysis, Stock Trends Analysis and Prediction, Portfolio Risk Factor, Stock and Finance Market News Sentiment Analysis and Selling profit ratio. Project developed as a part of NSE-FutureTech-Hackathon 2018, Mumbai. Team : Semicolon

algorithms artificial-intelligence data-science lstm-neural-network machine-learning risk-analysis sentiment-analysis stock-prediction stock-price-prediction visualisation

Last synced: 02 Jun 2026

https://github.com/huggingface/obelics

Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M documents, 115B text tokens and 353M images.

dataset machine-learning multimodal

Last synced: 14 Oct 2025

https://github.com/mybridge/amazing-machine-learning-opensource-2019

Amazing Machine Learning Open Source Tools and Projects for the Past Year (v.2019)

artificial-intelligence deep-learning machine-learning neural-network reinforcement-learning

Last synced: 28 Oct 2025

https://github.com/flairox/kinetix

Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.

machine-learning physics-engine reinforcement-learning

Last synced: 11 Sep 2025

https://github.com/LeapLabTHU/EfficientTrain

1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundation visual backbones.

computer-vision deep-learning efficient-training machine-learning pytorch

Last synced: 05 Apr 2025

https://github.com/owkin/FLamby

Cross-silo Federated Learning playground in Python. Discover 7 real-world federated datasets to test your new FL strategies and try to beat the leaderboard.

dataset deep-learning differential-privacy federated-learning healthcare machine-learning python

Last synced: 09 May 2025

https://github.com/awjuliani/neuro-nav

A library for neuroscience-inspired navigation and decision making research.

cognitive-science deep-reinforcement-learning gym-environment machine-learning reinforcement-learning

Last synced: 04 Apr 2025

https://github.com/albumentations-team/autoalbument

AutoML for image augmentation. AutoAlbument uses the Faster AutoAugment algorithm to find optimal augmentation policies. Documentation - https://albumentations.ai/docs/autoalbument/

augmentation automated-machine-learning automl computer-vision deep-learning image-augmentation machine-learning pytorch

Last synced: 07 Apr 2025

https://github.com/iterative/vscode-dvc

Machine learning experiment tracking and data versioning with DVC extension for VS Code

data data-science dvc machine-learning python visual-studio-code vscode vscode-extension

Last synced: 18 Jun 2025

https://github.com/oracle-samples/oci-data-science-ai-samples

This repo contains a series of tutorials and code examples highlighting different features of the OCI Data Science and AI services, along with a release vehicle for experimental programs.

ai conda data-science data-science-notebooks deep-learning jupyter-notebook machine-learning oci oracle-cloud-infrastructure python

Last synced: 15 May 2025

https://github.com/insight-platform/Similari

A framework for building high-performance real-time multiple object trackers

artificial-intelligence deepsort feature-matching machine-learning object-tracking rust sort

Last synced: 17 Apr 2025

https://github.com/netflix/metaflow-service

:rocket: Metadata tracking and UI service for Metaflow!

ai data-science machine-learning metaflow ml ml-infrastructure ml-platform productivity ui

Last synced: 01 Jul 2025

https://github.com/gantman/learn-tfjs

The code for the book Learning TensorFlow.js by Gant Laborde - Published by O'Reilly Media

hacktoberfest machine-learning tensorflow tensorflow-tutorials tensorflowjs tensorflowjs-tutorial

Last synced: 05 Oct 2025

https://github.com/fluxml/nnlib.jl

Neural Network primitives with multiple backends

deep-learning julia machine-learning

Last synced: 11 Apr 2026

https://github.com/coteries/cedille-ai

✒️ Cedille is a large French language model (6B), released under an open-source license

machine-learning nlg nlp

Last synced: 04 Apr 2025

https://github.com/greydanus/mnist1d

A 1D analogue of the MNIST dataset for measuring spatial biases and answering Science of Deep Learning questions.

convnet dataset machine-learning neural-networks pytorch research

Last synced: 15 Nov 2025

https://github.com/Laurae2/Laurae

Advanced High Performance Data Science Toolbox for R by Laurae

data-science laurae machine-learning r supervised-learning xgboost

Last synced: 20 Jul 2025

https://github.com/alrevuelta/cONNXr

Pure C ONNX runtime with zero dependancies for embedded devices

ai-framework embedded-devices machine-learning onnx protocol-buffers

Last synced: 11 Apr 2025

https://github.com/waikato/meka

Multi-label classifiers and evaluation procedures using the Weka machine learning framework.

machine-learning multi-label multi-target weka

Last synced: 15 May 2025

https://github.com/neuml/rag

🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.

large-language-models llm machine-learning nlp python rag retrieval-augmented-generation search txtai

Last synced: 06 Mar 2025

https://github.com/ahammadmejbah/machine-learning-book-collections

Machine learning is the study and development of data-driven strategies to enhance task performance. AI includes it.

data-science deep-learning machine-learning

Last synced: 05 Mar 2025

https://github.com/locuslab/e2e-model-learning

Task-based end-to-end model learning in stochastic optimization

deep-learning machine-learning optimization paper pytorch stochastic-optimizers

Last synced: 23 Apr 2025

https://github.com/alrevuelta/connxr

Pure C ONNX runtime with zero dependancies for embedded devices

ai-framework embedded-devices machine-learning onnx protocol-buffers

Last synced: 09 Apr 2025

https://github.com/imbrianj/switchboard

Control of Internet connected devices within a given network via web interface.

home-automation javascript machine-learning raspberry-pi switchboard

Last synced: 09 Apr 2025

https://github.com/kevin-hanselman/dud

A lightweight CLI tool for versioning data alongside source code and building data pipelines.

data-engineering data-pipelines data-science dataset dvcs machine-learning mlops

Last synced: 29 Dec 2025

https://github.com/danaugrs/go-tsne

t-Distributed Stochastic Neighbor Embedding (t-SNE) in Go

3d data-science dimensionality-reduction go machine-learning tsne unsupervised-learning visualization

Last synced: 30 Apr 2025

https://github.com/huangcongqing/deeplearning.ai-note

网易云课堂终于官方发布了吴恩达经过授权的汉化课程-“”深度学习专项课程“”,这是自己做的一些笔记以及代码。下为网易云学习链接

ai deep-learning machine-learning

Last synced: 08 Apr 2025

https://github.com/fcurella/django-recommends

A django app that builds item-based suggestions for users.

django machine-learning recommendation-system

Last synced: 27 Jun 2025

https://github.com/dsgissin/DiscriminativeActiveLearning

Code and website for DAL (Discriminative Active Learning) - a new active learning algorithm for neural networks in the batch setting. For the blog:

active-learning adversarial-active-learning bayesian-active-learning core-set deep-learning egl machine-learning uncertainty-sampling

Last synced: 05 Apr 2025

https://github.com/uber-research/atari-model-zoo

A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release that enables easy visualization and analysis of models, and comparison across training algorithms.

ai artificial-intelligence atari deep-learning deep-reinforcement-learning machine-learning machinelearning research training-algorithm uber

Last synced: 21 Aug 2025

https://github.com/zyfra/ebonite

machine learning lifecycle framework

ai ebonite machine-learning python

Last synced: 19 Jul 2025

https://github.com/acellera/moleculekit

MoleculeKit: Your favorite molecule manipulation kit

drug-discovery machine-learning molecular-modeling molecular-simulation molecule proteins

Last synced: 28 Apr 2026

https://github.com/analysiscenter/batchflow

BatchFlow helps you conveniently work with random or sequential batches of your data and define data processing and machine learning workflows even for datasets that do not fit into memory.

data-science machine-learning pipeline pipeline-framework python python3 workflow workflow-engine

Last synced: 25 Oct 2025

https://github.com/ethanrosenthal/skits

scikit-learn-inspired time series

machine-learning time-series

Last synced: 30 Dec 2025

https://github.com/franck-dernoncourt/pubmed-rct

PubMed 200k RCT dataset: a large dataset for sequential sentence classification.

corpus machine-learning medical nlp randomized-controlled-trials sentence-classification

Last synced: 06 Jan 2026

https://github.com/huangjia2019/let-us-machine-learning

极客时间:Machine Learning from Scratch(零基础实战机器学习)

dataanalytics deep-learning machine-learning

Last synced: 17 Apr 2025

https://github.com/dragen1860/maml-tensorflow

Faster and elegant TensorFlow Implementation of paper: Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks

machine-learning metalearning tensorflow

Last synced: 08 May 2025

https://github.com/yanndubs/neural-process-family

Code for the Neural Processes website and replication of 4 papers on NPs. Pytorch implementation.

conditional-neural-process deep-learning machine-learning meta-learning neural-processes pytorch stochastic-processes uncertainty-estimation

Last synced: 07 Apr 2025

https://github.com/Kivy-CN/ml-for-humans-zh

:book: [译] 写给人类的机器学习

machine-learning tutorial

Last synced: 13 Apr 2025

https://github.com/eugeneyan/applyingml

📌 Papers, guides, and mentor interviews on applying machine learning for ApplyingML.com—the ghost knowledge of machine learning.

applied-machine-learning gatsby machine-learning

Last synced: 07 Apr 2025

https://github.com/GantMan/learn-tfjs

The code for the book Learning TensorFlow.js by Gant Laborde - Published by O'Reilly Media

hacktoberfest machine-learning tensorflow tensorflow-tutorials tensorflowjs tensorflowjs-tutorial

Last synced: 26 Mar 2025

https://github.com/studiomoniker/Quickdraw-appendix

Dataset of 25k penises: an appendix to the Quick, Draw! Dataset

censorship dataset machine-learning penis quickdraw quickdraw-dataset

Last synced: 29 Apr 2025

https://github.com/kyegomez/visionmamba

Implementation of Vision Mamba from the paper: "Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model" It's 2.8x faster than DeiT and saves 86.8% GPU memory when performing batch inference to extract features on high-res images

ai machine-learning mamba pytorch recurrent-neural-network ssm

Last synced: 05 Apr 2025

https://github.com/shreyas-bk/U-2-Net-Demo

Demonstration using Google Colab to show how U-2-NET can be used for Background Removal, Changing Backgrounds, Bounding Box Creation, Salient Feature Highlighting and Salient Object Cropping.

background-removal bounding-boxes deep-learning image-cropping machine-learning python pytorch saliency-map tensorflow u2net

Last synced: 07 Mar 2025

https://github.com/dssg/triage

General Purpose Risk Modeling and Prediction Toolkit for Policy and Social Good Problems

artificial-intelligence dssg early-warning-systems inspection-prioritization machine-learning python tool triage

Last synced: 01 Mar 2026

https://github.com/shreyas-bk/u2netdemo

Demonstration using Google Colab to show how U-2-NET can be used for Background Removal, Changing Backgrounds, Bounding Box Creation, Salient Feature Highlighting and Salient Object Cropping.

background-removal bounding-boxes deep-learning image-cropping machine-learning python pytorch saliency-map tensorflow u2net

Last synced: 17 Jan 2026

https://github.com/giacbrd/ShallowLearn

An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.

fasttext gensim machine-learning neural-network online-learning scikit-learn shallow-learning supervised-learning text-classification text-mining word-embeddings word2vec

Last synced: 19 Jul 2025

https://github.com/mjx-project/mjx

Mjx: A framework for Mahjong AI research

ai cpp game machine-learning mahjong python

Last synced: 05 Apr 2026

https://github.com/giacbrd/shallowlearn

An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.

fasttext gensim machine-learning neural-network online-learning scikit-learn shallow-learning supervised-learning text-classification text-mining word-embeddings word2vec

Last synced: 08 Oct 2025

https://github.com/ayush1997/visualize_ML

Python package for consolidated and extensive Univariate,Bivariate Data Analysis and Visualization catering to both categorical and continuous datasets.

data-analysis machine-learning matplotlib python statisics visualization

Last synced: 14 Mar 2025

https://github.com/romeric/fastapprox

Approximate and vectorized versions of common mathematical functions

libm machine-learning math-functions simd vectorization

Last synced: 14 Sep 2025

https://github.com/krshrimali/no-reference-image-quality-assessment-using-brisque-model

Implementation of the paper "No Reference Image Quality Assessment in the Spatial Domain" by A Mittal et al. in OpenCV (using both C++ and Python)

computer-vision cpp image-processing image-quality image-quality-assessment libsvm machine-learning opencv python svm

Last synced: 24 Oct 2025

https://github.com/pkhungurn/talking-head-anime-4-demo

Demo Programs for the "Talking Head(?) Anime from a Single Image 4: Improved Models and Its Distillation" Project

animation image-processing machine-learning vtuber

Last synced: 17 Mar 2025

https://github.com/TensorLab/tensorfx

TensorFlow framework for training and serving machine learning models

machine-learning ml python tensorflow tensorfx

Last synced: 06 May 2025

https://github.com/teddykoker/blog

Source code for my personal blog

blog machine-learning

Last synced: 17 Jan 2026

https://github.com/tensorlab/tensorfx

TensorFlow framework for training and serving machine learning models

machine-learning ml python tensorflow tensorfx

Last synced: 02 Mar 2025

https://github.com/microsoft/presidio-research

This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.

deep-learning flair machine-learning named-entity-recognition natural-language-processing ner nlp pii privacy spacy transformers

Last synced: 12 Apr 2025

https://github.com/microsoft/finnts

Microsoft Finance Time Series Forecasting Framework (FinnTS) is a forecasting package that utilizes cutting-edge time series forecasting and parallelization on the cloud to produce accurate forecasts for financial data.

business data-science feature-selection finance finnts forecasting machine-learning microsoft r r-package rstats time-series

Last synced: 15 May 2025

https://github.com/stevenygd/NFGP

Pytorch implementation of NeurIPS 2021 paper: Geometry Processing with Neural Fields.

geometry-processing machine-learning neural-fields neural-network

Last synced: 11 Apr 2025