An open API service indexing awesome lists of open source software.

Machine learning

Machine learning is the practice of teaching a computer to learn. The concept uses pattern recognition, as well as other forms of predictive algorithms, to make judgments on incoming data. This field is closely related to artificial intelligence and computational statistics.

https://github.com/kaggledatasets/kaggledatasets

Collection of Kaggle Datasets ready to use for Everyone (Looking for contributors)

data-science datasets deep-learning kaggle keras machine-learning python pytorch scikit-learn tensorflow

Last synced: 20 Jun 2025

https://github.com/s-ol/gpwfc

openCL-accelerated python implementation of the Wave Function Collapse procgen algorithm

algorithm gamedev machine-learning opencl procedural-generation python wfc

Last synced: 21 Mar 2025

https://github.com/codingbeard/tfkg

Create, train, and save Tensorflow Keras models all in Golang

deep-learning go golang gpu-acceleration keras machine-learning tensorflow tf

Last synced: 11 Sep 2025

https://github.com/alanlaboratory/unrealmlagents

The Unreal ML Agents Toolkit is an open-source project that enables Unreal Engine games and simulations to serve as environments for training intelligent agents using deep reinforcement learning. This project is a port of Unity ML-Agents, adapted to work within Unreal Engine.

artificial-intelligence deep-learning deep-reinforcement-learning machine-learning neural-network reinforcement-learning unreal-engine unreal-engine-5 unreal-engine-plugin

Last synced: 11 Sep 2025

https://github.com/mccorby/PhotoLabellerServer

Federated Learning: Parameter Server doing aggregation of updates to a model coming from clients participating in a Federated Learning setup. See also the Android application companion at https://github.com/mccorby/PhotoLabeller

cifar-10 deeplearning4j dl4j federated-learning kotlin machine-learning

Last synced: 10 May 2025

https://github.com/dipanjans/adv_nlp_workshop_odsc_europe22

Extensive tutorials for the Advanced NLP Workshop in Open Data Science Conference Europe 2020. We will leverage deep learning and deep transfer learning to solve popular tasks in NLP including Classification, Information Retrieval, Sentiment Analysis, Search Engines, Clustering, Paraphrase Mining, Summarization, Language Translation, Q&A systems

deep-learning gensim huggingface jupyter-notebook machine-learning natural-language-processing python pytorch tensorflow transfer-learning transformers

Last synced: 24 Aug 2025

https://github.com/konst-int-i/healnet

Multimodal fusion for heterogeneous biomedical data. NeurIPS 2024.

computational-pathology machine-learning multimodal representation-learning

Last synced: 10 Apr 2025

https://github.com/thecodrr/vspeech

πŸ“’ Complete V bindings for Mozilla's DeepSpeech TensorFlow based Speech-to-Text library. πŸ“œ

deepspeech machine-learning mozilla speech-to-text tensorflow v

Last synced: 06 Mar 2026

https://github.com/rickiepark/intro_ml_with_python_2nd_revised

<파이썬 라이브러리λ₯Ό ν™œμš©ν•œ λ¨Έμ‹ λŸ¬λ‹ (λ²ˆμ—­κ°œμ •2판)>의 μ½”λ“œ μ €μž₯μ†Œ

machine-learning numpy pandas python scikit-learn

Last synced: 10 Apr 2025

https://github.com/takashiishida/pconf

[NeurIPS 2018] Code for the paper "Binary Classification from Positive-Confidence Data"

deep-learning machine-learning

Last synced: 11 Apr 2025

https://github.com/IBM/MAX-OCR

MAX Optical Character Recognition

docker-image machine-learning tesseract-ocr-engine

Last synced: 13 May 2025

https://github.com/nzw0301/lightLDA

fast sampling algorithm based on CGS

lda machine-learning nlp python topic-modeling

Last synced: 03 Apr 2025

https://github.com/nzw0301/lightlda

fast sampling algorithm based on CGS

lda machine-learning nlp python topic-modeling

Last synced: 30 Apr 2025

https://github.com/icanerdogan/google-mlkit-android-apps

This repository contains a collection of Android applications developed using Google ML Kit, demonstrating the power and versatility of machine learning features in mobile development. Each project is crafted using Java and Kotlin, showcasing various use cases and practical implementations.

android android-app android-application barcode-scanner face-detection firebase google google-api image-labeling java java-android kotlin kotlin-android machine-learning mlkit mlkit-face-detection mlkit-pose-detection mlkit-text-recognition pose-detection text-recognition

Last synced: 28 Apr 2025

https://github.com/tshrjn/env-zoo

A curated list of reinforcement learning environments and frameworks.

deep-learning machine-learning reinforcement-learning reinforcement-learning-playground

Last synced: 02 Apr 2025

https://github.com/lddl/cnns

Convolutional Neural Networks in Go

cnn convolutional-neural-networks machine-learning mlp neural-networks

Last synced: 10 Jul 2025

https://github.com/praktiskt/featuretoolsr

An R interface to the Python module Featuretools

feature-engineering featuretools machine-learning r-package rstats

Last synced: 11 Mar 2026

https://github.com/mccorby/photolabellerserver

Federated Learning: Parameter Server doing aggregation of updates to a model coming from clients participating in a Federated Learning setup. See also the Android application companion at https://github.com/mccorby/PhotoLabeller

cifar-10 deeplearning4j dl4j federated-learning kotlin machine-learning

Last synced: 23 Apr 2025

https://github.com/dls5-omics/multimolecule

Accelerate Molecular Biology Research with Machine Learning

ai4science machine-learning molecular-biology

Last synced: 28 Feb 2026

https://github.com/allenai/aspire

Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.

document-similarity information-retrieval machine-learning natural-language-processing

Last synced: 13 Oct 2025

https://github.com/masatoi/cl-online-learning

A collection of machine learning algorithms for online linear classification written in Common Lisp

classifier common-lisp machine-learning perceptron

Last synced: 20 Jan 2026

https://github.com/metatensor/metatrain

Train, fine-tune, and manipulate machine learning models for atomistic systems

atomistic-simulations machine-learning molecular-dynamics torch

Last synced: 12 Jan 2026

https://github.com/d4l3k/go-bayesopt

A library for doing Bayesian Optimization using Gaussian Processes (blackbox optimizer) in Go/Golang.

bayesianoptimization bayesopt blackbox-optimizer gaussian-processes go hyperparameter-optimization machine-learning optimization

Last synced: 12 Apr 2025

https://github.com/lexiestleszek/namegen

Self-contained, minimalistic implementation of a language model that generates coherent and normal sounding names. It uses an input dataset of names and probability distribution to generate new names based on the sequences of four characters.

language-model machine-learning markov-chain name-generation natural-language-processing nlp

Last synced: 30 Oct 2025

https://github.com/the-database/trainner-redux

Deep learning training framework for image super resolution and restoration.

deep-learning image-restoration machine-learning neural-network python pytorch super-resolution upscale

Last synced: 20 Mar 2025

https://github.com/dselivanov/ftrl

R/Rcpp implementation of the 'Follow-the-Regularized-Leader' algorithm

ftrl logistic-regression machine-learning r sgd

Last synced: 26 Jun 2025

https://github.com/eric-bradford/sdd-gp-mpc

This repository contains the source code for "Stochastic data-driven model predictive control using Gaussian processes" (SDD-GP-MPC).

casadi chemical-engineering constraints differential-equations gaussian-processes machine-learning model-predictive-control monte-carlo-simulation optimization-algorithms python3 state-space-model stochastic-processes

Last synced: 16 Jun 2025

https://github.com/bsc-wdc/dislib

The Distributed Computing library for python implemented using PyCOMPSs programming model for HPC.

big-data distributed-computing hpc machine-learning python

Last synced: 14 Aug 2025

https://github.com/onesuper/HuggingFace-Datasets-Text-Quality-Analysis

Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in dataset using pandas

dataset huggingface-datasets llm machine-learning nlp streamlit text-processing

Last synced: 22 Jul 2025

https://github.com/bhavik-jikadara/ai-ml-roadmap

Welcome to the ultimate guide for starting your journey in Artificial Intelligence and Machine Learning in 2025! This roadmap provides a step-by-step approach to mastering AI and ML, from fundamentals to advanced topics.

artificial-intelligence computer-vision deep-learning deployment fundamentals-of-programming keras libraries machine-learning mathematics mlops natural-language-processing production-code pytorch reinforcement-learning roadmap scikit-learn tensorflow tools

Last synced: 30 Apr 2025

https://github.com/oracle-samples/pgx-samples

Applications using Parallel Graph AnalytiX (PGX) from Oracle Labs

graph graph-algorithms graph-analytics graph-machine-learning machine-learning

Last synced: 07 Apr 2025

https://github.com/kalininalab/datasail

DataSAIL is a tool to split datasets while reducing information leakage.

dataset-split ilp ilp-problem machine-learning optimization scip

Last synced: 08 Apr 2026

https://github.com/bearloga/maltese

Little R utility package for making time series data more machine learning-friendly

forecasting machine-learning r r-package rstats time-series

Last synced: 21 Sep 2025

https://github.com/tusharsarkar3/tla

A comprehensive tool for linguistic analysis of communities

hacktoberfest machine-learning nlp pytorch sentiment-analysis text-classification

Last synced: 14 Apr 2025

https://github.com/donny-hikari/viola-jones

A face detection program in python using Viola-Jones algorithm.

adaboost boosting computer-vision emsembling face-detection haar-cascade machine-learning viola-jones

Last synced: 12 Apr 2025

https://github.com/fkie-cad/comidds

A comprehensive survey of datasets for research in host-based and/or network-based intrusion detection, with a focus on enterprise networks

cybersecurity datasets events intrusion-detection logs machine-learning netflow

Last synced: 06 Mar 2026

https://github.com/bigd4/PyNEP

A python interface of NEP

machine-learning python

Last synced: 04 May 2025

https://github.com/roboticsclubiitj/ml-dl-implementation

An implementation of ML and DL algorithms from scratch in python using nothing but NumPy and Matplotlib.

deep-learning hacktoberfest machine-learning matplotlib numpy nwoc python statistics woc

Last synced: 07 May 2025

https://github.com/ctuavastlab/jsongrinder.jl

Machine learning with Mill.jl for JSON documents

flux hierarchical-data json julia machine-learning multi-instance-learning

Last synced: 09 Apr 2025

https://github.com/ekinakyurek/gan-70-lines-of-julia

A Knet implementation of MLP GAN for MNIST data.

adversarial-networks gan knet machine-learning mlp-gan

Last synced: 13 Apr 2025

https://github.com/Piyushi-0/ACE

Code for our ICML '19 paper: Neural Network Attributions: A Causal Perspective.

attribution-methods causality deep-learning icml machine-learning neural-network

Last synced: 17 Sep 2025

https://github.com/jacksonburns/astartes

Better Data Splits for Machine Learning

ai data-science machine-learning ml python sampling

Last synced: 21 Aug 2025

https://github.com/loaiabdalslam/fc

Face enhancer‏ - Denoising Auto Encoder by Tensorflow and Keras and skimage

auto-encoder deep-learning image-processing machine-learning remini tensorflow

Last synced: 16 Oct 2025

https://github.com/lsjsj92/keras_basic

kerasλ₯Ό μ΄μš©ν•œ λ”₯λŸ¬λ‹ 기초 ν•™μŠ΅

cnn deep-learning deeplearning example keras lstm machine-learning python

Last synced: 07 Mar 2026

https://github.com/misaogura/mrnet

PyTorch implementation of the MRNet paper, developed for the MRNet Competition hosted by the Stanford ML Group

convolutional-neural-networks deep-learning deep-neural-networks machine-learning paper-implementations pytorch pytorch-implementation

Last synced: 21 Aug 2025

https://github.com/LdDl/cnns

Convolutional Neural Networks in Go

cnn convolutional-neural-networks machine-learning mlp neural-networks

Last synced: 08 Apr 2025

https://github.com/andi611/conditional-seqgan-tensorflow

Conditional Sequence Generative Adversarial Network trained with policy gradient, Implementation in Tensorflow

chatbot conditional-gan gan machine-learning nlp nlp-machine-learning seqgan tensorflow

Last synced: 13 Apr 2025

https://github.com/m-jovanovic/digit-recognizer

Small neural network framework developed in C#, specialized in digit classification (MNIST dataset)

machine-learning mnist-classification neural-networks

Last synced: 29 Jun 2025

https://github.com/caraml-dev/mlp

A platform for developing and operating the machine learning systems at the various stages of machine learning life cycle.

machine-learning

Last synced: 04 Feb 2026

https://github.com/ndrplz/semiparametric

[TPAMI 2020] Generating Novel Views of Vehicles via Semi-parametric Guidance. A semi-parametric approach for synthesizing novel views of a rigid object from a single monocular image.

computer-vision convolutional-neural-networks deep-learning image-synthesis machine-learning novel-viewpoint-synthesis pascal3d semi-parametric semi-parametric-learning

Last synced: 10 Jul 2025

https://github.com/pietrobarbiero/logic_explained_networks

Logic Explained Networks is a python repository implementing explainable-by-design deep learning models.

deep-learning explainable-ai logic machine-learning neural-networks xai

Last synced: 10 Oct 2025

https://github.com/cyberlife-coder/velesdb

VelesDB is a local‑first AI data engine written in Rust that unifies vectors, full‑text and graph in a single file with a familiar SQL‑like language. Instead of sending every RAG or semantic search query to a remote cluster, VelesDB runs directly on your server, laptop, browser, mobile or edge device β€” no cloud dependency, no external services, ..

ai ai-memory all-in-one-databse columnstore-database embeddings graph-database hnsw local-first machine-learning rag rust search-engine vector-database

Last synced: 30 Apr 2026

https://github.com/lukasmosser/stochastic_seismic_waveform_inversion

Official Implementation of "Stochastic seismic waveform inversion using generative adversarial networks as a geological prior"

bayesian-inference generative-adversarial-network geophysics machine-learning

Last synced: 15 Apr 2025

https://github.com/cambricon/cnstream

CNStream is a streaming framework for building Cambricon machine learning pipelines http://forum.cambricon.com https://gitee.com/SolutionSDK/CNStream

c-plus-plus cambricon cambricon-cnstream computer-vision graph-framework inference machine-learning mlu pipeline-framework

Last synced: 26 Dec 2025

https://github.com/praktiskt/featuretoolsR

An R interface to the Python module Featuretools

feature-engineering featuretools machine-learning r-package rstats

Last synced: 13 Jul 2025

https://github.com/ml-tooling/lazycluster

πŸŽ› Distributed machine learning made simple.

cluster dask distributed-computing hyperopt machine-learning python ssh

Last synced: 30 Dec 2025

https://github.com/SAP-samples/btp-ai-sustainability-bootcamp

This github repository contains the sample code and exercises of btp-ai-sustainability-bootcamp, which showcases how to build Intelligence and Sustainability into Your Solutions on SAP Business Technology Platform with SAP AI Core and SAP Analytics Cloud for Planning.

computer-vision condition-monitoring deep-learning defect-detection image-segmentation machine-learning predictive-maintenance sac-planning sample sample-code sap-ai-core sap-ai-launchpad sap-analytics-cloud sound-classification sustainability

Last synced: 07 May 2025

https://github.com/wangz10/class_imbalance

Jupyter Notebook presentation for class imbalance in binary classification

classification imbalanced-data machine-learning tutorial

Last synced: 11 May 2025

https://github.com/sanity/pairadjacentviolators

A JVM implementation of the Pair Adjacent Violators algorithm for isotonic regression

isotonic-regression java jvm-languages kotlin machine-learning regression

Last synced: 20 Jun 2025

https://github.com/favstats/perspective

An R wrapper for Conversation AI's Perspective API

machine-learning perspective-api rstats toxic-comment-classification

Last synced: 15 Jan 2026

https://github.com/plainerman/variational-doob

Lagrangian formulation of Doob's h-transform allowing for efficient rare event sampling

machine-learning neural-networks transition-paths variational-method

Last synced: 02 Mar 2026

https://github.com/fridiculous/django-estimators

a django app to persist and retrieve scikit learn machine learning models

django machine-learning scikit-learn

Last synced: 26 Oct 2025

https://github.com/faceplugin-ltd/id-document-liveness-detection

The world's 1st completely free, open-source ID Document Liveness Detection SDK which can detect fake ID cards, Driver Licenses and Passports.Ideal for developers looking for robust, fraud-prevention tools.

anti-spoofing deep-learning document-liveness fraud-detection fraud-prevention id-document-liveness identity-verification liveness-detection liveness-detection-sdk machine-learning

Last synced: 03 Aug 2025

https://github.com/kennethleungty/end-to-end-automl-insurance

An End-to-End Implementation of AutoML with H2O, MLflow, FastAPI, and Streamlit for Insurance Cross-Sell

automl data-science fastapi h2o h2o-automl machine-learning mlflow mlops python streamlit

Last synced: 12 Jul 2025

https://github.com/seeed-studio/sensecraft-ai

An application suite including an open-source inference server and web UI to deploy any YOLOv8 model to NVIDIA Jetson devices and visualize captured streams, with one line of code.

computer-vision deep-learning instance-segmentation jetpack jetson-orin machine-learning nvidia-jetson object-detection orin-nano orin-nx pytorch yolov5 yolov8

Last synced: 24 Oct 2025

https://github.com/solygambas/mlops-projects

Hands-on MLOps projects to explore and learn the practical aspects of machine learning engineering for production.

docker fastapi google-cloud google-cloud-platform huggingface huggingface-transformers keras kubectl kubeflow kubernetes machine-learning mlops python scikit-learn tensorflow

Last synced: 19 Mar 2025

https://github.com/davisidarta/dbmap

A fast, accurate, and modularized dimensionality reduction approach based on diffusion harmonics and graph layouts. Escalates to millions of samples on a personal laptop. Adds high-dimensional big data intrinsic structure to your clustering and data visualization workflow.

denoising diffusion-process dimensionality-reduction graph-layout high-dimensional machine-learning nearest-neighbors single-cell umap visualization

Last synced: 06 Mar 2026

https://github.com/vzhong/e3

Dockerized code for E3: Entailment-driven Extracting and Editing for Conversational Machine Reading.

deep-learning machine-learning nlp

Last synced: 12 Apr 2025

https://github.com/davidrosenberg/ttml2021fall

DS-GA 3001: Tools and Techniques for Machine Learning (NYU Fall 2021)

machine-learning

Last synced: 25 Aug 2025

https://github.com/sudheerachary/Manga_Colorization

cGAN-based Manga Colorization Using a Single Training Image.

cgan comics computer-vision image-processing machine-learning manga

Last synced: 26 Sep 2025

https://github.com/daun-io/study-data-science

Practical data science notebooks that I used to study at 2016

data-science jupyter-notebook machine-learning tensorflow

Last synced: 13 May 2025