An open API service indexing awesome lists of open source software.

Machine learning

Machine learning is the practice of teaching a computer to learn. The concept uses pattern recognition, as well as other forms of predictive algorithms, to make judgments on incoming data. This field is closely related to artificial intelligence and computational statistics.

https://github.com/leelabcnbc/book-notes

a bunch of notes about machine learning, image statistics, theoretical neuroscience, etc.

machine-learning notes

Last synced: 17 Jan 2026

https://github.com/okfn-brasil/whistleblower

🚨A Twitter bot for publicly reporting suspicions found by Rosie, Serenata de Amor's AI

data-science facebook-messenger-bot machine-learning twitter-bot

Last synced: 28 Mar 2025

https://github.com/hewlettpackard/dc-rl

SustainDC is a set of Python environments for Data Center simulation and control using Heterogeneous Multi Agent Reinforcement Learning. Includes customizable environments for workload scheduling, cooling optimization, and battery management, with integration into Gymnasium.

battery-management benchmark benchmarking benchmarking-framework carbon-footprint cooling-optimization data-center data-center-management energy-efficiency gymnasium gymnasium-environment heterogeneous-reinforcement-learning machine-learning multi-agent-reinforcement-learning python reinforcement-learningmulti-agent-systems sustainability sustainable-computing workload-scheduling

Last synced: 26 Apr 2025

https://github.com/llnl/djinn

Deep jointly-informed neural networks -- as easy-to-use algorithm for designing/initializing neural nets

deep-learning machine-learning neural-network

Last synced: 29 Apr 2025

https://github.com/topazape/molecular-vae

Implementation of the paper - Automatic chemical design using a data-driven continuous representation of molecules

cheminformatics chemoinformatics denovo machine-learning python3 pytorch vae

Last synced: 20 Jul 2025

https://github.com/jvalegre/robert

Automated machine learning protocols that start from CSV databases of descriptors or SMILES and produce publication-quality results in Chemistry studies with only one command line.

automation cheminformatics machine-learning python reproducibility scikit-learn workflows

Last synced: 13 Apr 2025

https://github.com/glouppe/recnn

Repository for the code of "QCD-Aware Recursive Neural Networks for Jet Physics"

deep-learning jet-clustering machine-learning particle-physics

Last synced: 22 Sep 2025

https://github.com/xviniette/asteroidslearning

Program that learns to avoid asteroids by machine learning (Neuroevolution)

avoid-asteroids machine-learning

Last synced: 03 May 2025

https://github.com/microsoft/LLF-Bench

A benchmark for evaluating learning agents based on just language feedback

large-language-models llm llm-training llms machine-learning natural-language-processing reinforcement-learning

Last synced: 18 Apr 2025

https://github.com/pliang279/lm_bias

[ICML 2021] Towards Understanding and Mitigating Social Biases in Language Models

fairness-ai language-model machine-learning natural-language-processing

Last synced: 05 May 2025

https://github.com/zeno-ml/zeno-hub

AI Evaluation Platform

ai evaluation machine-learning visualization

Last synced: 04 Apr 2025

https://github.com/dabit3/real-time-image-tracking

Real-time image tracking with React, GraphQL, and AWS AppSync

amplify aws graphql javascript machine-learning react rekognition

Last synced: 17 Nov 2025

https://github.com/osdg-ai/osdg-tool

OSDG is an open-source tool that maps and connects activities to the UN Sustainable Development Goals (SDGs) by identifying SDG-relevant content in any text. The tool is available online at www.osdg.ai. API access available for research purposes.

machine-learning machine-learning-algorithms machine-translation ml open-source osdg sdg sdg-data sdgs sustainability sustainability-score sustainable-development sustainable-development-goals united-nations

Last synced: 05 Dec 2025

https://github.com/superdoc-dev/docx-corpus

The largest open corpus of .docx files for document processing research

bun common-crawl corpus dataset document-processing docx machine-learning nlp typescript word-documents

Last synced: 12 Mar 2026

https://github.com/neuropoly/gmseg

Spinal cord gray matter segmentation using deep dilated convolutions.

deep-learning machine-learning research segmentation spinal-cord

Last synced: 12 Mar 2026

https://github.com/mukhlishga/gnn-powerflow

Graph Neural Network application in predicting AC Power Flow calculation. Developed with Pytorch Geometric framework. My Master Thesis at Eindhoven University of Technology

artificial-intelligence electrical electrical-engineering graph-neural-network machine-learning neural-network power-flow python pytorch pytorch-geometric

Last synced: 10 Apr 2025

https://github.com/greenelab/multi-plier

An unsupervised transfer learning approach for rare disease transcriptomics

analysis bioinformatics-analysis dataset gene-expression-signatures machine-learning methodology multiplier plier rare-diseases

Last synced: 16 Feb 2026

https://github.com/protosec-research/pwnbert

A project based on Fine-tuned BERT to detect GLIBC vulnerabilities.

bert-fine-tuning classification machine-learning openai-api pwn vulnerability-detection

Last synced: 04 Jul 2025

https://github.com/dragen1860/capsnet-pytorch

Pytorch version of Hinton's Capsule Theory paper: Dynamic Routing Between Capsules

capsnet-pytorch deep-learning machine-learning pytorch

Last synced: 30 Apr 2025

https://github.com/avik-jain/school-of-ai

Repository for Resources and Code used in Dehradun School of AI Workshops and Meetups

artificial-intelligence machine-learning school-of-ai workshop

Last synced: 02 Aug 2025

https://github.com/esa/dsgp4

dSGP4: differentiable SGP4. Supports differentiability, ML integration & embarassingly parallel computations

astrodynamics differentiable-programming embarassingly-parallel machine-learning orbital-dynamics orbital-mechanics orbital-propagation sgp4 space-debris

Last synced: 06 Apr 2025

https://github.com/hrolive/applications-of-ai-for-anomaly-detection

Nvidia DLI workshop on AI-based anomaly detection techniques using GPU-accelerated XGBoost, deep learning-based autoencoders, and generative adversarial networks (GANs) and then implement and compare supervised and unsupervised learning techniques.

anomaly-detection autoencoders deep-learning generative-adversarial-network keras machine-learning notebook nvidia-gpu pandas python rapids tensorflow xgboost

Last synced: 12 May 2025

https://github.com/flintml/flint

A self-contained, lightweight and OOB research platform for modern ML

data-science deltalake jupyter machine-learning mlops polars

Last synced: 09 May 2025

https://github.com/dlmacedo/distinction-maximization-loss

A project to improve out-of-distribution detection (open set recognition) and uncertainty estimation by changing a few lines of code in your project! Perform efficient inferences (i.e., do not increase inference time) without repetitive model training, hyperparameter tuning, or collecting additional data.

ai-safety anomaly-detection classification deep-learning machine-learning novelty-detection ood ood-detection open-set open-set-recognition osr out-of-distribution out-of-distribution-detection pytorch robust-machine-learning trustworthy-ai trustworthy-machine-learning uncertainty-estimation

Last synced: 10 Oct 2025

https://github.com/AidanCooper/shap-analysis-guide

How to Interpret SHAP Analyses: A Non-Technical Guide

data-science machine-learning shap tutorial

Last synced: 01 May 2025

https://github.com/akimach/tfgraphviz

A visualization tool to show a TensorFlow's graph like TensorBoard

dataflow-programming deep-learning graphviz machine-learning neural-network python tensorboard tensorflow visualization

Last synced: 21 Apr 2025

https://github.com/uetchy/dockerfile-machinelearning

🐳🤖 Dockerfile for ML researchers.

docker-image dockerfile machine-learning

Last synced: 19 Apr 2025

https://github.com/spring-epfl/trickster

Library and experiments for attacking machine learning in discrete domains

adversarial-machine-learning graph-algorithms machine-learning

Last synced: 20 Apr 2025

https://github.com/popcornell/keras-triplet-center-loss

Simple Keras implementation of Triplet-Center Loss on the MNIST dataset

center-loss keras machine-learning mnist tensorflow triplet-loss

Last synced: 19 Oct 2025

https://github.com/alexhraber/flowhawk

Real-time eBPF-powered network security monitor with AI-driven threat detection. Surfaces port scans, DDoS attacks, botnet activity, and anomalies at 100Gbps+ speeds with sub-microsecond latency (~150 million packets/sec).

anomaly-detection cybersecurity ddos-protection ebpf golang intrusion-detection machine-learning network-analysis network-security packet-processing real-time-monitoring threat-detection xdp zero-day-detection

Last synced: 12 Mar 2026

https://github.com/servicenow/doomarena

DoomArena is a Framework for Testing AI Agents Against Evolving Security Threats

ai ai-safety attack browsergym defense llm machine machine-learning red-teaming security taubench web-agents

Last synced: 09 Oct 2025

https://github.com/maxhumber/mummify

Version Control for Machine Learning

git machine-learning version-control

Last synced: 19 Aug 2025

https://github.com/ahoylabs/gguf.js

A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.

ggml gguf large-language-models llamacpp llm machine-learning

Last synced: 06 Oct 2025

https://github.com/google-research-datasets/swim-ir

SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 languages, generated using PaLM 2 and summarize-then-ask prompting.

cross-lingual datasets deep-learning information-retrieval machine-learning multilingual natural-language-processing neural-information-retrieval nlp training-data

Last synced: 01 Apr 2026

https://github.com/msusazureaccelerators/ai-powered-call-center-intelligence-accelerator

Automate call transcriptions for real-time and previously recorded calls by using custom speech models, text analytics, and industry-specific natural language processing with the Microsoft Call Center Intelligence Accelerator.

ai artificial-intelligence azure call-center human-resources machine-learning microsoft ml solution-accelerator

Last synced: 26 Oct 2025

https://github.com/pedrol2b/react-native-vision-camera-mlkit

A powerful React Native Vision Camera plugin delivering high-performance Google ML Kit frame processor features—including text recognition (OCR), face detection, barcode scanning, pose detection, and more. Seamlessly bridges native ML Kit capabilities for real-time, on-device computer vision in your React Native apps.

android barcode barcode-scanner camera computer-vision face-detection frame-processor google-ml-kit image-processing ios machine-learning mlkit native-module ocr pose-detection react-native react-native-module text-detection vision vision-camera

Last synced: 01 May 2026

https://github.com/yujiabao/ls

Learning to Split for Automatic Bias Detection

bias-detection data-split label-noise machine-learning

Last synced: 01 May 2026

https://github.com/ploomber/soopervisor

☁️ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.

airflow argo argo-workflows aws data-science kubeflow kubeflow-pipelines kubernetes machine-learning slurm workflow

Last synced: 21 Aug 2025

https://github.com/asem000/pytreeclass

Visualize, create, and operate on pytrees in the most intuitive way possible.

data dataclasses deep-learning jax machine-learning pipelines pytorch pytree tensorflow

Last synced: 07 Apr 2025

https://github.com/xd-deng/spark-ml-intro

PySpark Machine Learning Examples

machine-learning spark

Last synced: 15 Apr 2025

https://github.com/citoverse/cito

Building and Training Neural Networks with an R interface

machine-learning neural-network r r-package

Last synced: 22 Jun 2025

https://github.com/SciML/DeepEquilibriumNetworks.jl

Implicit Layer Machine Learning via Deep Equilibrium Networks, O(1) backpropagation with accelerated convergence.

deep-equilibrium-models deep-learning implicit-deep-learning julia machine-learning neural-networks nonlinear-equations nonlinear-solve

Last synced: 04 May 2025

https://github.com/mundipagg/amora-data-build-tool

Amora Data Build Tool enables analysts and engineers to transform data on the data warehouse (BigQuery) by writing Amora Models that describe the data schema using Python's "PEP484 - Type Hints" and select statements with SQLAlchemy. Amora is able to transform Python code into SQL data transformation jobs that run inside the warehouse.

analytics analytics-dashboard analytics-engineering bigquery business-intelligence data-engineering data-modeling datacleaning dataquality elt machine-learning python transformation

Last synced: 08 Sep 2025

https://github.com/spidy20/data-scince-ml-project

In this repository i created many data scince - machine learning projects like(Deep dream,weather prediction,Movie recommender system etc) with code & datasets

data-analysis data-analytics deep-dream machine-learning matplotlib number-recognition python recommender-system songs songs-data-analysis stock-market-prediction

Last synced: 12 Apr 2025

https://github.com/ethz-asl/autolabel

A project for computing high-quality ground truth training examples for RGB-D data.

computer-vision labeling-tool machine-learning nerf rgb-d robotics

Last synced: 11 Apr 2025

https://github.com/imgcook/datacook

Machine Learning and Data Analysis in JavaScript.

data-science feature-engineering javascript machine-learning

Last synced: 24 Jun 2025

https://github.com/matteocourthoud/Machine-Learning-for-Economic-Analysis

Material for the exercise sessions of master course Machine Learning for Economic Analysis @UZH

course data-science econometrics economics machine-learning phd python statistics

Last synced: 15 Jun 2026

https://github.com/ramadis/delitos-caba

🚓 Crime dataset for the City of Buenos Aires, Argentina

argentina crime-data crime-prediction datasets machine-learning

Last synced: 23 Apr 2025

https://github.com/fabridamicelli/echoes

Machine Learning with Echo State Networks, a scikit-learn compatible package.

echo-state-networks esn esnetwork machine-learning neural-network python recurrent-neural-networks reservoir-computing shallow-learning

Last synced: 02 Apr 2026

https://github.com/ibm/mlapp

MLApp is a Python library for building scalable data science solutions that meet modern software engineering standards.

ai artificial-intelligence machine-learning ml python

Last synced: 14 Jan 2026

https://github.com/duyongan/nlp-is-so-easy

自然语言处理、深度学习、机器学习的一些个人博客

deep-learning machine-learning nlp

Last synced: 08 Oct 2025

https://github.com/charliegerard/fem-ml-workshop

Repository for my FrontEnd Masters workshop on Machine Learning in JavaScript

ai javascript machine-learning tensorflowjs

Last synced: 05 Jul 2025

https://github.com/sacdallago/biotrainer

Biological prediction models made simple.

deep-learning language-model machine-learning protein proteins

Last synced: 16 Jan 2026

https://github.com/binds-lab-umass/bindsnet

Simulation of spiking neural networks (SNNs) using PyTorch.

machine-learning pytorch reinforcement-learning spiking-neural-networks

Last synced: 31 Oct 2025

https://github.com/mj-will/nessai

nessai: Nested Sampling with Artificial Intelligence

bayesian-inference bilby machine-learning nested-sampling normalizing-flows python pytorch

Last synced: 20 Feb 2026

https://github.com/chatopera/chatopera.feishu

通过 Feishu 开放平台和 Chatopera 机器人平台上线智能对话机器人服务, 聊天机器人,飞书,lark

ai bot chatbot chatopera dialog feishu lark machine-learning nlp nlu python python3

Last synced: 20 Mar 2025

https://github.com/kupynorest/instance_augmentation

[ECCV 2024] Official Repo for: Dataset Enhancement with Instance-Level Augmentations

augmentation computer-vision diffusion-models eccv2024 image-generation image-processing machine-learning pytorch

Last synced: 03 Aug 2025

https://github.com/nlpodyssey/goslide

SLIDE (Sub-LInear Deep learning Engine) written in Go

artificial-intelligence deep-learning lsh machine-learning sparse-learning

Last synced: 13 Apr 2025

https://github.com/novartis/scar

scAR (single-cell Ambient Remover) is a deep learning model for removal of the ambient signals in droplet-based single cell omics

cite-seq crispr-screen denoising-algorithm generative-model machine-learning probabilistic-graphical-models pytorch single-cell-rna-seq variational-autoencoder

Last synced: 21 Jul 2025

https://github.com/joergmlpts/nature-id

Identify plants, birds, and insects in photos. Should an identification to species be unsuccessful, an identification to a higher taxonomic level - like genus, family, or order - is made.

birds identification identifying-plants image-classification inaturalist insects machine-learning plants python taxonomy

Last synced: 25 Jun 2025

https://github.com/eBay/AutoOpt

Automatic and Simultaneous Adjustment of Learning Rate and Momentum for Stochastic Gradient Descent

hyperparameters learning-rate machine-learning momentum optimization pytorch sgd

Last synced: 08 May 2025

https://github.com/ottogroup/dstoolbox

Tools that make working with scikit-learn and pandas easier.

machine-learning pandas scikit-learn

Last synced: 13 Apr 2025

https://github.com/isair/tensorflow-load-csv

🤖 TensorFlow.js CSV loading on steroids. Clean up, normalise, transform, shuffle, and split your data all in a handful of lines and dive right into the fun parts of ML.

browser csv csv-files javascript machine-learning node tensorflow typescript

Last synced: 23 Jan 2026

https://github.com/crlandsc/torch-log-wmse

logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source separation systems.

ai audio audio-denoising audio-quality audio-quality-assessment audio-separation audio-source-separation bss deep-learning loss-functions machine-learning mss music music-source-separation python sound-processing sound-separation sound-source-separation speech-denoising torch

Last synced: 30 Jan 2026

https://github.com/o19s/rankymcrankface

Hardened Fork of Ranklib learning to rank library

information-retrieval machine-learning search

Last synced: 10 Jul 2025

https://github.com/tri-ml/rap

This is the official code for the paper RAP: Risk-Aware Prediction for Robust Planning: https://arxiv.org/abs/2210.01368

autonomous-driving machine-learning risk-modelling trajectory-prediction

Last synced: 05 May 2025

https://github.com/born-ml/born

Production-ready ML framework for Go with zero dependencies. Train and deploy neural networks as single binaries. PyTorch-like API, type-safe tensors, automatic differentiation.

autodiff automatic-differentiation cpu-backend cross-platform deep-learning go golang high-performance machine-learning neural-networks pure-go tensor type-safety

Last synced: 04 Mar 2026

https://github.com/fedml-ai/fediot

Federated Learning for Internet of Things: A Federated Learning Framework for On-device Anomaly Data Detection, backed by FedML, Inc.

anomaly-detection autoencoder cybersecurity federated-learning iot iot-application machine-learning pytorch raspberry-pi

Last synced: 22 Apr 2025

https://github.com/joaopaulolndev/my-data-scientist-roadmap

Description about my roadmap to become Data Scientist and Engineer Machine Learning

artificial-intelligence data-science deep-learning machine-learning python python3

Last synced: 23 Apr 2025

https://github.com/google-research/pisac

Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)

deep-learning deep-reinforcement-learning information-theory machine-learning reinforcement-learning robotics vision

Last synced: 23 Jun 2025

https://github.com/Novartis/scar

scAR (single-cell Ambient Remover) is a deep learning model for removal of the ambient signals in droplet-based single cell omics

cite-seq crispr-screen denoising-algorithm generative-model machine-learning probabilistic-graphical-models pytorch single-cell-rna-seq variational-autoencoder

Last synced: 28 Sep 2025

https://github.com/shreyansh26/ml-optimizers-jax

Toy implementations of some popular ML optimizers using Python/JAX

adam adam-optimizer gradient-descent jax machine-learning momentum optimization-algorithms optimizers

Last synced: 10 Apr 2025

https://github.com/ianhi/ac295-final-project-jwi

manual image labelling and transfer learning for segmentation

jupyter-widget machine-learning transfer-learning unet-segmentation

Last synced: 20 Mar 2025

https://github.com/aralroca/react-text-toxicity

Detect text toxicity in a simple way, using React. Based in a Keras model, loaded with Tensorflow.js.

javascript machine-learning preact react tensorflow text toxicity

Last synced: 19 Apr 2025