An open API service indexing awesome lists of open source software.

Machine learning

Machine learning is the practice of teaching a computer to learn. The concept uses pattern recognition, as well as other forms of predictive algorithms, to make judgments on incoming data. This field is closely related to artificial intelligence and computational statistics.

https://github.com/trailofbits/fickling

A Python pickling decompiler and static analyzer

machine-learning python security

Last synced: 04 Mar 2026

https://github.com/facebookresearch/NeuralCompression

A collection of tools for neural compression enthusiasts.

compression deep-learning jax machine-learning neural-compression python pytorch

Last synced: 15 Jul 2025

https://github.com/GRAAL-Research/poutyne

A simplified framework and utilities for PyTorch

data-science deep-learning keras machine-learning neural-network python pytorch

Last synced: 27 Mar 2025

https://github.com/yassouali/ML-paper-notes

:notebook: Notes and summaries of various ML, Computer Vision & NLP papers.

computer-vision deep-learning machine-learning natural-language-processing nlp summary

Last synced: 03 Apr 2025

https://github.com/l3p-cv/lost

Label Objects and Save Time (LOST) - Design your own smart Image Annotation process in a web-based environment.

annotation-framework annotation-process annotation-tool bounding-boxes computer-vision image-annotation machine-learning machine-vision polygon-annotations

Last synced: 11 May 2025

https://github.com/LearnDataSci/articles

A repository for the source code, notebooks, data, files, and other assets used in the data science and machine learning articles on LearnDataSci

data-analysis data-science data-visualization machine-learning machine-learning-algorithms machinelearning python

Last synced: 13 Apr 2025

https://github.com/RelevanceAI/vectorhub

Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)

artificial-intelligence audio-processing deep-learning deeplearning embeddings encodings image2vec machine-learning neural-network python pytorch tensorflow tfhub transformers vector vector-similarity video-processing word2vec

Last synced: 27 Apr 2025

https://github.com/Maknee/minigpt4.cpp

Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)

c cpp deep-learning ggml machine-learning minigpt4 multimodal quantization

Last synced: 15 Apr 2025

https://github.com/roboflow/roboflow-python

The official Roboflow Python package. Manage your datasets, models, and deployments. Roboflow has everything you need to build a computer vision application.

computer-vision deep-learning machine-learning python

Last synced: 15 Apr 2026

https://github.com/inoryy/reaver

Reaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft II. Supports Gym, Atari, and MuJoCo.

actor-critic artificial-intelligence deep-learning deepmind machine-learning pysc2 reinforcement-learning starcraft-ii starcraft2 tensorflow

Last synced: 13 Apr 2025

https://github.com/educationaltestingservice/skll

SciKit-Learn Laboratory (SKLL) makes it easy to run machine learning experiments.

hacktoberfest machine-learning python scikit-learn

Last synced: 14 May 2025

https://github.com/hi-abhi/tensorflow-value-iteration-networks

TensorFlow implementation of the Value Iteration Networks (NIPS '16) paper

deep-learning machine-learning neural-networks reinforcement-learning tensorflow

Last synced: 19 Jul 2025

https://github.com/EducationalTestingService/skll

SciKit-Learn Laboratory (SKLL) makes it easy to run machine learning experiments.

hacktoberfest machine-learning python scikit-learn

Last synced: 15 Mar 2025

https://github.com/dabit3/gpt-travel-advisor

reference architecture for building a travel application with GPT3

artificial-intelligence gpt-3 machine-learning nextjs openai

Last synced: 17 Nov 2025

https://github.com/facebookresearch/neuralcompression

A collection of tools for neural compression enthusiasts.

compression deep-learning jax machine-learning neural-compression python pytorch

Last synced: 16 May 2025

https://github.com/rushter/heamy

A set of useful tools for competitive data science.

data-science machine-learning stacking

Last synced: 16 May 2025

https://github.com/azure/mlops-v2

Azure MLOps (v2) solution accelerators. Enterprise ready templates to deploy your machine learning models on the Azure Platform.

azure azuremachinelearning azureml deep-learning devops machine-learning microsoft mlops mlops-environment mlops-project mlops-template mlops-workflow

Last synced: 12 Apr 2025

https://github.com/mmasana/facil

Framework for Analysis of Class-Incremental Learning with 12 state-of-the-art methods and 3 baselines.

continual-learning deep-learning framework incremental-learning lifelong-learning machine-learning reproducible-research survey

Last synced: 03 Oct 2025

https://github.com/cstjean/scikitlearn.jl

Julia implementation of the scikit-learn API https://cstjean.github.io/ScikitLearn.jl/dev/

julia machine-learning

Last synced: 16 May 2025

https://github.com/ydli-ai/CSL

[COLING 2022] CSL: A Large-scale Chinese Scientific Literature Dataset 中文科学文献数据集

chinese-nlp dataset machine-learning scientific-publications

Last synced: 14 Apr 2025

https://github.com/cstjean/ScikitLearn.jl

Julia implementation of the scikit-learn API https://cstjean.github.io/ScikitLearn.jl/dev/

julia machine-learning

Last synced: 15 Mar 2025

https://github.com/firmai/pandapy

PandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai)

algorithmic-trading arrays data-science data-structures finance machine-learning numpy pandas structured-data

Last synced: 06 May 2025

https://github.com/logpai/drain3

A robust streaming log template miner based on the Drain algorithm

aiops anomaly-detection clustering drain log log-clustering machine-learning observability template-mining

Last synced: 15 May 2025

https://github.com/vanderschaarlab/synthcity

A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.

data-augmentation fairness-ml generative-model machine-learning privacy pytorch synthetic-data tabular-data

Last synced: 16 May 2025

https://github.com/redis-developer/arxivchatguru

Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.

ai arxiv langchain machine-learning openai python question-answering rag redis retrieval retrieval-augmented-generation streamlit vector-database vector-search

Last synced: 15 May 2025

https://github.com/redis-developer/ArXivChatGuru

Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.

ai arxiv langchain machine-learning openai python question-answering rag redis retrieval retrieval-augmented-generation streamlit vector-database vector-search

Last synced: 11 Apr 2025

https://github.com/salesforce/logai

LogAI - An open-source library for log analytics and intelligence

ai aiops anomaly-detection benchmarking log-analysis log-intelligence machine-learning python

Last synced: 14 May 2025

https://github.com/databricks/mlops-stacks

This repo provides a customizable stack for starting new ML projects on Databricks that follow production best-practices out of the box.

databricks machine-learning mlops

Last synced: 15 May 2025

https://github.com/wcipriano/pretty-print-confusion-matrix

Confusion Matrix in Python: plot a pretty confusion matrix (like Matlab) in python using seaborn and matplotlib

confusion-matrix machine-learning machine-learning-algorithms machine-learning-library neural-network python

Last synced: 17 Mar 2026

https://github.com/WecoAI/aideml

AIDE: the state-of-the-art machine learning engineer agent, generating machine learning solution code from natural language descriptions.

ai data-science llm machine-learning

Last synced: 02 May 2025

https://github.com/linkedin/fasttreeshap

Fast SHAP value computation for interpreting tree-based models

explainable-ai interpretability lightgbm machine-learning random-forest shap xgboost

Last synced: 17 Aug 2025

https://github.com/tensorlayer/TensorLayerX

TensorLayerX: A Unified Deep Learning and Reinforcement Learning Framework for All Hardwares, Backends and OS.

deep-learning jittor machine-learning mindspore neural-network oneflow paddlepaddle python pytorch tensorflow tensorlayer tensorlayerx

Last synced: 13 May 2025

https://github.com/mozilla/bugbug

Platform for Machine Learning projects on Software Engineering

ai developer-tools llm machine-learning ml python software-engineering

Last synced: 14 May 2025

https://github.com/waldo-vision/optical.flow.demo

A project that uses optical flow and machine learning to detect aimhacking in video clips.

anti-cheat anticheat deep-learning fps fps-shooter gaming machine-learning opencv opencv-python optical-flow

Last synced: 17 Jul 2025

https://github.com/OpenLemur/Lemur

[ICLR 2024] Lemur: Open Foundation Models for Language Agents

code-generation language-model machine-learning natural-language-processing nlp text-reasoning

Last synced: 07 May 2025

https://github.com/AutoViML/Auto_ViML

Automatically Build Multiple ML Models with a Single Line of Code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.

auto-viml autokeras automated-machine-learning automl automl-algorithms autosklearn machine-learning python python3 scikit-learn tpot xgboost

Last synced: 01 Apr 2025

https://github.com/autoviml/auto_viml

Automatically Build Multiple ML Models with a Single Line of Code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.

auto-viml autokeras automated-machine-learning automl automl-algorithms autosklearn machine-learning python python3 scikit-learn tpot xgboost

Last synced: 14 May 2025

https://github.com/davidbau/rewriting

Rewriting a Deep Generative Model, ECCV 2020 (oral). Interactive tool to directly edit the rules of a GAN to synthesize scenes with objects added, removed, or altered. Change StyleGANv2 to make extravagant eyebrows, or horses wearing hats.

deep-learning gans graphics hci machine-learning research vision

Last synced: 04 Apr 2025

https://github.com/neonwatty/meme-search

The open source Meme Search Engine and Finder. Free and built to self-host locally with Python, Ruby, and Docker.

docker machine-learning python ruby-on-rails self-hosted vector-database vision-language-model

Last synced: 15 May 2025

https://github.com/henripal/labnotebook

LabNotebook is a tool that allows you to flexibly monitor, record, save, and query all your machine learning experiments.

experiment-manager experimental-data machine-learning postgres postgresql python reproducibility reproducible-research vuejs webapp

Last synced: 18 Apr 2025

https://github.com/magnivorg/prompt-layer-library

🍰 PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions.

gpt machine-learning openai prompt prompt-engineering python

Last synced: 12 Sep 2025

https://github.com/mpaepper/content-chatbot

Build a chatbot or Q&A bot of your website's content

deep-learning llm machine-learning

Last synced: 05 Apr 2025

https://github.com/bradleyboehmke/data-science-learning-resources

A collection of data science and machine learning resources that I've found helpful (I only post what I've read!)

data-science machine-learning

Last synced: 07 Apr 2025

https://github.com/neuml/codequestion

🔎 Semantic search for developers

machine-learning nlp python search txtai

Last synced: 24 Mar 2025

https://github.com/Toni-SM/skrl

Modular reinforcement learning library (on PyTorch and JAX) with support for NVIDIA Isaac Gym, Omniverse Isaac Gym and Isaac Lab

deep-learning deepmind gym gymnasium isaac-gym isaac-lab isaac-orbit isaac-sim isaaclab jax machine-learning nvidia-omniverse openai-gym python pytorch reinforcement-learning rl robosuite robotics skrl

Last synced: 02 Apr 2025

https://github.com/google/tacotron

Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.

audio machine-learning prosody speech tacotron tts

Last synced: 15 Apr 2025

https://github.com/mrdbourke/m1-machine-learning-test

Code for testing various M1 Chip benchmarks with TensorFlow.

machine-learning metal tensorflow tensorflow-macos

Last synced: 04 Apr 2025

https://github.com/microsoft/ocr-form-tools

A set of tools to use in Microsoft Azure Form Recognizer and OCR services.

form-recognizer labeling-tool machine-learning machine-learning-algorithms ocr-form-labeling rpa typescript

Last synced: 05 May 2025

https://github.com/philipperemy/fx-1-minute-data

HISTDATA - Dataset composed of all FX trading pairs / Crude Oil / Stock Indexes. Simple API to retrieve 1 Minute data (and tick data) Historical FX Prices (up to date).

dataset deep-learning financial-data financial-markets fx machine-learning trading

Last synced: 15 May 2025

https://github.com/wq2012/spectralcluster

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

auto-tune clustering constrained-clustering machine-learning python speaker-diarization spectral-clustering unsupervised-clustering unsupervised-learning

Last synced: 16 May 2025

https://hdi-project.github.io/ATM/

Auto Tune Models - A multi-tenant, multi-data system for automated machine learning (model selection and tuning).

automl data-science distributed-computing hyperparameter-optimization machine-learning

Last synced: 12 May 2025

https://github.com/tensorlayer/tensorlayerx

TensorLayerX: A Unified Deep Learning and Reinforcement Learning Framework for All Hardwares, Backends and OS.

deep-learning jittor machine-learning mindspore neural-network oneflow paddlepaddle python pytorch tensorflow tensorlayer tensorlayerx

Last synced: 16 May 2025

https://github.com/x-datainitiative/tick

Module for statistical learning, with a particular emphasis on time-dependent modelling

machine-learning modelling optimization point-process python statistics

Last synced: 14 Jan 2026

https://github.com/HDI-Project/ATM

Auto Tune Models - A multi-tenant, multi-data system for automated machine learning (model selection and tuning).

automl data-science distributed-computing hyperparameter-optimization machine-learning

Last synced: 18 Jul 2025

https://github.com/philipperemy/deep-learning-bitcoin

Exploiting Bitcoin prices patterns with Deep Learning.

artificial-intelligence bitcoin deep-learning machine-learning

Last synced: 05 Apr 2025

https://github.com/mmasana/FACIL

Framework for Analysis of Class-Incremental Learning with 12 state-of-the-art methods and 3 baselines.

continual-learning deep-learning framework incremental-learning lifelong-learning machine-learning reproducible-research survey

Last synced: 08 May 2025

https://github.com/redis-developer/ArxivChatGuru

Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.

ai arxiv langchain machine-learning openai python question-answering rag redis retrieval retrieval-augmented-generation streamlit vector-database vector-search

Last synced: 18 Jul 2025

https://github.com/mdozmorov/MachineLearning_notes

Machine learning and deep learning resources

deep-learning machine-learning

Last synced: 01 Apr 2025

https://github.com/locuslab/optnet

OptNet: Differentiable Optimization as a Layer in Neural Networks

deep-learning machine-learning optimization paper pytorch

Last synced: 05 Apr 2025

https://github.com/mdozmorov/machinelearning_notes

Machine learning and deep learning resources

deep-learning machine-learning

Last synced: 26 Feb 2025

https://github.com/dmitryryumin/aaai-2024-papers

AAAI 2024 Papers: Explore a comprehensive collection of innovative research papers presented at one of the premier artificial intelligence conferences. Seamlessly integrate code implementations for better understanding. ⭐ experience the forefront of progress in artificial intelligence with this repository!

aaai aaai2024 application-domains artificial-intelligence cognitive-systems computer-vision computer-vison deep-learning expert-systems human-computer-interaction knowledge-representation machine-learning multi-agent-systems neural-networks reinforcement-learning sentiment-analysis

Last synced: 15 May 2025

https://github.com/4paradigm/autox

AutoX is an efficient automl tool, which is mainly aimed at data mining tasks with tabular data.

kaggle machine-learning python

Last synced: 16 May 2025

https://github.com/curiousily/ai-bootcamp

Self-paced bootcamp on Generative AI. Tutorials on ML fundamentals, LLMs, RAGs, LangChain, LangGraph, Fine-tuning Llama 3 & AI Agents (CrewAI)

artificial-intelligence chatgpt crewai langchain langgraph large-language-models llama machine-learning prompt-engineering rag

Last synced: 15 May 2025

https://github.com/linkedin/FastTreeSHAP

Fast SHAP value computation for interpreting tree-based models

explainable-ai interpretability lightgbm machine-learning random-forest shap xgboost

Last synced: 19 Jul 2025

https://github.com/openhackathons-org/gpubootcamp

This repository consists for gpu bootcamp material for HPC and AI

ai4hpc cuda data-science deep-learning deepstream gpu hpc machine-learning mpi openacc openmp rapidsai

Last synced: 27 Mar 2025

https://github.com/RunLLM/aqueduct

Aqueduct is no longer being maintained. Aqueduct allows you to run LLM and ML workloads on any cloud infrastructure.

ai data data-science kubernetes llm llms machine-learning ml ml-infrastructure ml-monitoring mlops orchestration python python3

Last synced: 18 Apr 2025

https://github.com/ayaka14732/tpu-starter

Everything you want to know about Google Cloud TPU

cloud-tpu deep-learning gcp google-cloud-platform jax machine-learning tpu

Last synced: 25 Oct 2025