An open API service indexing awesome lists of open source software.

Machine learning

Machine learning is the practice of teaching a computer to learn. The concept uses pattern recognition, as well as other forms of predictive algorithms, to make judgments on incoming data. This field is closely related to artificial intelligence and computational statistics.

https://github.com/konst-int-i/healnet

Multimodal fusion for heterogeneous biomedical data. NeurIPS 2024.

computational-pathology machine-learning multimodal representation-learning

Last synced: 10 Apr 2025

https://github.com/rickiepark/intro_ml_with_python_2nd_revised

<파이썬 라이브러리를 활용한 머신러닝 (번역개정2판)>의 코드 저장소

machine-learning numpy pandas python scikit-learn

Last synced: 10 Apr 2025

https://github.com/bhavik-jikadara/ai-ml-roadmap

Welcome to the ultimate guide for starting your journey in Artificial Intelligence and Machine Learning in 2025! This roadmap provides a step-by-step approach to mastering AI and ML, from fundamentals to advanced topics.

artificial-intelligence computer-vision deep-learning deployment fundamentals-of-programming keras libraries machine-learning mathematics mlops natural-language-processing production-code pytorch reinforcement-learning roadmap scikit-learn tensorflow tools

Last synced: 30 Apr 2025

https://github.com/eric-bradford/sdd-gp-mpc

This repository contains the source code for "Stochastic data-driven model predictive control using Gaussian processes" (SDD-GP-MPC).

casadi chemical-engineering constraints differential-equations gaussian-processes machine-learning model-predictive-control monte-carlo-simulation optimization-algorithms python3 state-space-model stochastic-processes

Last synced: 16 Jun 2025

https://github.com/d4l3k/go-bayesopt

A library for doing Bayesian Optimization using Gaussian Processes (blackbox optimizer) in Go/Golang.

bayesianoptimization bayesopt blackbox-optimizer gaussian-processes go hyperparameter-optimization machine-learning optimization

Last synced: 12 Apr 2025

https://github.com/dipanjans/adv_nlp_workshop_odsc_europe22

Extensive tutorials for the Advanced NLP Workshop in Open Data Science Conference Europe 2020. We will leverage deep learning and deep transfer learning to solve popular tasks in NLP including Classification, Information Retrieval, Sentiment Analysis, Search Engines, Clustering, Paraphrase Mining, Summarization, Language Translation, Q&A systems

deep-learning gensim huggingface jupyter-notebook machine-learning natural-language-processing python pytorch tensorflow transfer-learning transformers

Last synced: 24 Aug 2025

https://github.com/kaggledatasets/kaggledatasets

Collection of Kaggle Datasets ready to use for Everyone (Looking for contributors)

data-science datasets deep-learning kaggle keras machine-learning python pytorch scikit-learn tensorflow

Last synced: 20 Jun 2025

https://github.com/nzw0301/lightLDA

fast sampling algorithm based on CGS

lda machine-learning nlp python topic-modeling

Last synced: 03 Apr 2025

https://github.com/onesuper/HuggingFace-Datasets-Text-Quality-Analysis

Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in dataset using pandas

dataset huggingface-datasets llm machine-learning nlp streamlit text-processing

Last synced: 22 Jul 2025

https://github.com/jeffthompson/word2vecandtsne

Scripts demo-ing how to train a Word2Vec model and reduce its vector space

gensim language machine-learning python sklearn tsne word2vec words

Last synced: 12 May 2025

https://github.com/googleforgames/clean-chat

Disruptive Behavior Mitigation Framework for Games

game-development machine-learning machine-learning-games multiplayer python

Last synced: 11 Apr 2025

https://github.com/nzw0301/lightlda

fast sampling algorithm based on CGS

lda machine-learning nlp python topic-modeling

Last synced: 30 Apr 2025

https://github.com/spaceml-org/starcop

Official code for STARCOP: Semantic Segmentation of Methane Plumes with Hyperspectral Machine Learning models :rainbow::artificial_satellite:

aviris aviris-ng emit hyperspectral hyperspectral-datasets machine-learning methane methane-detection

Last synced: 14 Apr 2025

https://github.com/tshrjn/env-zoo

A curated list of reinforcement learning environments and frameworks.

deep-learning machine-learning reinforcement-learning reinforcement-learning-playground

Last synced: 02 Apr 2025

https://github.com/lddl/cnns

Convolutional Neural Networks in Go

cnn convolutional-neural-networks machine-learning mlp neural-networks

Last synced: 10 Jul 2025

https://github.com/rvandewater/yaib

🧪Yet Another ICU Benchmark: a holistic framework for the standardization of clinical prediction model experiments. Provide custom datasets, cohorts, prediction tasks, endpoints, preprocessing, and models. Paper: https://arxiv.org/abs/2306.05109

amsterdamumcdb benchmark clinical-data clinical-ml deep-learning ehr eicu-crd framework hirid-dataset icu machine-learning mimic-iii mimic-iv patient-monitoring time-series

Last synced: 06 Apr 2025

https://github.com/madhurimarawat/semester-notes

A comprehensive, well-structured repository of B.Tech (Hons) CSE notes and learning resources, specializing in Artificial Intelligence and Data Science. Includes semester-wise notes, question papers, curated study guides, and indexed materials designed for efficient learning, revision, and academic reference.

artificial-intelligence btech-notes computer-networks computer-organization-architecture computer-science cse-notes data-science data-visualization database-management-system engineering-mathematics engineering-notes learning-resources machine-learning object-oriented-programming operating-systems probability-and-statistics python-for-data-science semester-notes study-materials theory-of-computation

Last synced: 07 Mar 2026

https://github.com/aldro61/kover

Learn interpretable computational phenotyping models from k-merized genomic data

biomarker-discovery genomics k-mer machine-learning phenotypes

Last synced: 11 Oct 2025

https://github.com/dselivanov/ftrl

R/Rcpp implementation of the 'Follow-the-Regularized-Leader' algorithm

ftrl logistic-regression machine-learning r sgd

Last synced: 26 Jun 2025

https://github.com/benedekrozemberczki/feather

The reference implementation of FEATHER from the CIKM '20 paper "Characteristic Functions on Graphs: Birds of a Feather, from Statistical Descriptors to Parametric Models".

data-mining deep-learning deep-neural-networks deepwalk graph graph-classification graph-convolution graph-embedding graph-kernel graph2vec machine-learning network-embedding networkx neural-network node-classification node-embedding node2vec pytorch representation-learning tensorflow

Last synced: 11 Apr 2025

https://github.com/lexiestleszek/namegen

Self-contained, minimalistic implementation of a language model that generates coherent and normal sounding names. It uses an input dataset of names and probability distribution to generate new names based on the sequences of four characters.

language-model machine-learning markov-chain name-generation natural-language-processing nlp

Last synced: 30 Oct 2025

https://github.com/metatensor/metatrain

Train, fine-tune, and manipulate machine learning models for atomistic systems

atomistic-simulations machine-learning molecular-dynamics torch

Last synced: 12 Jan 2026

https://github.com/alanlaboratory/unrealmlagents

The Unreal ML Agents Toolkit is an open-source project that enables Unreal Engine games and simulations to serve as environments for training intelligent agents using deep reinforcement learning. This project is a port of Unity ML-Agents, adapted to work within Unreal Engine.

artificial-intelligence deep-learning deep-reinforcement-learning machine-learning neural-network reinforcement-learning unreal-engine unreal-engine-5 unreal-engine-plugin

Last synced: 11 Sep 2025

https://github.com/sanity/pairadjacentviolators

A JVM implementation of the Pair Adjacent Violators algorithm for isotonic regression

isotonic-regression java jvm-languages kotlin machine-learning regression

Last synced: 20 Jun 2025

https://github.com/pietrobarbiero/logic_explained_networks

Logic Explained Networks is a python repository implementing explainable-by-design deep learning models.

deep-learning explainable-ai logic machine-learning neural-networks xai

Last synced: 10 Oct 2025

https://github.com/plainerman/variational-doob

Lagrangian formulation of Doob's h-transform allowing for efficient rare event sampling

machine-learning neural-networks transition-paths variational-method

Last synced: 02 Mar 2026

https://github.com/lsjsj92/keras_basic

keras를 이용한 딥러닝 기초 학습

cnn deep-learning deeplearning example keras lstm machine-learning python

Last synced: 07 Mar 2026

https://github.com/Piyushi-0/ACE

Code for our ICML '19 paper: Neural Network Attributions: A Causal Perspective.

attribution-methods causality deep-learning icml machine-learning neural-network

Last synced: 17 Sep 2025

https://github.com/bearloga/maltese

Little R utility package for making time series data more machine learning-friendly

forecasting machine-learning r r-package rstats time-series

Last synced: 21 Sep 2025

https://github.com/faceplugin-ltd/id-document-liveness-detection

The world's 1st completely free, open-source ID Document Liveness Detection SDK which can detect fake ID cards, Driver Licenses and Passports.Ideal for developers looking for robust, fraud-prevention tools.

anti-spoofing deep-learning document-liveness fraud-detection fraud-prevention id-document-liveness identity-verification liveness-detection liveness-detection-sdk machine-learning

Last synced: 03 Aug 2025

https://github.com/kennethleungty/end-to-end-automl-insurance

An End-to-End Implementation of AutoML with H2O, MLflow, FastAPI, and Streamlit for Insurance Cross-Sell

automl data-science fastapi h2o h2o-automl machine-learning mlflow mlops python streamlit

Last synced: 12 Jul 2025

https://github.com/SAP-samples/btp-ai-sustainability-bootcamp

This github repository contains the sample code and exercises of btp-ai-sustainability-bootcamp, which showcases how to build Intelligence and Sustainability into Your Solutions on SAP Business Technology Platform with SAP AI Core and SAP Analytics Cloud for Planning.

computer-vision condition-monitoring deep-learning defect-detection image-segmentation machine-learning predictive-maintenance sac-planning sample sample-code sap-ai-core sap-ai-launchpad sap-analytics-cloud sound-classification sustainability

Last synced: 07 May 2025

https://github.com/tusharsarkar3/tla

A comprehensive tool for linguistic analysis of communities

hacktoberfest machine-learning nlp pytorch sentiment-analysis text-classification

Last synced: 14 Apr 2025

https://github.com/misaogura/mrnet

PyTorch implementation of the MRNet paper, developed for the MRNet Competition hosted by the Stanford ML Group

convolutional-neural-networks deep-learning deep-neural-networks machine-learning paper-implementations pytorch pytorch-implementation

Last synced: 21 Aug 2025

https://github.com/m-jovanovic/digit-recognizer

Small neural network framework developed in C#, specialized in digit classification (MNIST dataset)

machine-learning mnist-classification neural-networks

Last synced: 29 Jun 2025

https://github.com/fridiculous/django-estimators

a django app to persist and retrieve scikit learn machine learning models

django machine-learning scikit-learn

Last synced: 26 Oct 2025

https://github.com/donny-hikari/viola-jones

A face detection program in python using Viola-Jones algorithm.

adaboost boosting computer-vision emsembling face-detection haar-cascade machine-learning viola-jones

Last synced: 12 Apr 2025

https://github.com/kalininalab/datasail

DataSAIL is a tool to split datasets while reducing information leakage.

dataset-split ilp ilp-problem machine-learning optimization scip

Last synced: 08 Apr 2026

https://github.com/loaiabdalslam/fc

Face enhancer‏ - Denoising Auto Encoder by Tensorflow and Keras and skimage

auto-encoder deep-learning image-processing machine-learning remini tensorflow

Last synced: 16 Oct 2025

https://github.com/ekinakyurek/gan-70-lines-of-julia

A Knet implementation of MLP GAN for MNIST data.

adversarial-networks gan knet machine-learning mlp-gan

Last synced: 13 Apr 2025

https://github.com/ml-tooling/lazycluster

🎛 Distributed machine learning made simple.

cluster dask distributed-computing hyperopt machine-learning python ssh

Last synced: 30 Dec 2025

https://github.com/fkie-cad/comidds

A comprehensive survey of datasets for research in host-based and/or network-based intrusion detection, with a focus on enterprise networks

cybersecurity datasets events intrusion-detection logs machine-learning netflow

Last synced: 06 Mar 2026

https://github.com/ndrplz/semiparametric

[TPAMI 2020] Generating Novel Views of Vehicles via Semi-parametric Guidance. A semi-parametric approach for synthesizing novel views of a rigid object from a single monocular image.

computer-vision convolutional-neural-networks deep-learning image-synthesis machine-learning novel-viewpoint-synthesis pascal3d semi-parametric semi-parametric-learning

Last synced: 10 Jul 2025

https://github.com/ctuavastlab/jsongrinder.jl

Machine learning with Mill.jl for JSON documents

flux hierarchical-data json julia machine-learning multi-instance-learning

Last synced: 09 Apr 2025

https://github.com/caraml-dev/mlp

A platform for developing and operating the machine learning systems at the various stages of machine learning life cycle.

machine-learning

Last synced: 04 Feb 2026

https://github.com/bigd4/PyNEP

A python interface of NEP

machine-learning python

Last synced: 04 May 2025

https://github.com/wangz10/class_imbalance

Jupyter Notebook presentation for class imbalance in binary classification

classification imbalanced-data machine-learning tutorial

Last synced: 11 May 2025

https://github.com/andi611/conditional-seqgan-tensorflow

Conditional Sequence Generative Adversarial Network trained with policy gradient, Implementation in Tensorflow

chatbot conditional-gan gan machine-learning nlp nlp-machine-learning seqgan tensorflow

Last synced: 13 Apr 2025

https://github.com/favstats/perspective

An R wrapper for Conversation AI's Perspective API

machine-learning perspective-api rstats toxic-comment-classification

Last synced: 15 Jan 2026

https://github.com/LdDl/cnns

Convolutional Neural Networks in Go

cnn convolutional-neural-networks machine-learning mlp neural-networks

Last synced: 08 Apr 2025

https://github.com/roboticsclubiitj/ml-dl-implementation

An implementation of ML and DL algorithms from scratch in python using nothing but NumPy and Matplotlib.

deep-learning hacktoberfest machine-learning matplotlib numpy nwoc python statistics woc

Last synced: 07 May 2025

https://github.com/cyberlife-coder/velesdb

VelesDB is a local‑first AI data engine written in Rust that unifies vectors, full‑text and graph in a single file with a familiar SQL‑like language. Instead of sending every RAG or semantic search query to a remote cluster, VelesDB runs directly on your server, laptop, browser, mobile or edge device — no cloud dependency, no external services, ..

ai ai-memory all-in-one-databse columnstore-database embeddings graph-database hnsw local-first machine-learning rag rust search-engine vector-database

Last synced: 30 Apr 2026

https://github.com/jacksonburns/astartes

Better Data Splits for Machine Learning

ai data-science machine-learning ml python sampling

Last synced: 21 Aug 2025

https://github.com/praktiskt/featuretoolsR

An R interface to the Python module Featuretools

feature-engineering featuretools machine-learning r-package rstats

Last synced: 13 Jul 2025

https://github.com/oracle-samples/pgx-samples

Applications using Parallel Graph AnalytiX (PGX) from Oracle Labs

graph graph-algorithms graph-analytics graph-machine-learning machine-learning

Last synced: 07 Apr 2025

https://github.com/lukasmosser/stochastic_seismic_waveform_inversion

Official Implementation of "Stochastic seismic waveform inversion using generative adversarial networks as a geological prior"

bayesian-inference generative-adversarial-network geophysics machine-learning

Last synced: 15 Apr 2025

https://github.com/cambricon/cnstream

CNStream is a streaming framework for building Cambricon machine learning pipelines http://forum.cambricon.com https://gitee.com/SolutionSDK/CNStream

c-plus-plus cambricon cambricon-cnstream computer-vision graph-framework inference machine-learning mlu pipeline-framework

Last synced: 26 Dec 2025

https://github.com/soniccodes/lucid-v1

realtime latent world model inference demo

diffusion-models generative-model machine-learning minecraft

Last synced: 13 Jun 2026

https://github.com/davisidarta/dbmap

A fast, accurate, and modularized dimensionality reduction approach based on diffusion harmonics and graph layouts. Escalates to millions of samples on a personal laptop. Adds high-dimensional big data intrinsic structure to your clustering and data visualization workflow.

denoising diffusion-process dimensionality-reduction graph-layout high-dimensional machine-learning nearest-neighbors single-cell umap visualization

Last synced: 06 Mar 2026

https://github.com/anthonymrios/multi-label-zero-shot

Few- and Zero-shot Multi-Label Learning for Structured Label Spaces

biomedical-informatics machine-learning natural-language-processing neural-networks

Last synced: 09 Jul 2025

https://github.com/junlulocky/airticketpredicting

Machine Learning modeling for Air Ticket Predicting

air-ticket-price-prediction machine-learning machine-learning-modeling

Last synced: 12 Apr 2025

https://github.com/daun-io/study-data-science

Practical data science notebooks that I used to study at 2016

data-science jupyter-notebook machine-learning tensorflow

Last synced: 13 May 2025

https://github.com/argosopentech/metaltranslate

Customizable machine translation in C++

machine-learning nlp nlp-machine-learning translation

Last synced: 14 Apr 2025

https://github.com/vsergeyev/loudml-grafana-app

Visualization panel and datasource for Grafana to connect with Loud ML AI solution for ICT and IoT automation

ai anomaly-detection baseline datasource docker donut grafana graph loudml machine-learning ml model monitoring panel plugin prediction

Last synced: 27 Jan 2026

https://github.com/center-for-threat-informed-defense/technique-inference-engine

TIE is a machine learning model for inferring associated MITRE ATT&CK techniques from previously observed techniques.

ctid cyber-threat-intelligence cybersecurity machine-learning mitre-attack threat-informed-dense

Last synced: 12 Apr 2025