An open API service indexing awesome lists of open source software.

Machine learning

Machine learning is the practice of teaching a computer to learn. The concept uses pattern recognition, as well as other forms of predictive algorithms, to make judgments on incoming data. This field is closely related to artificial intelligence and computational statistics.

https://github.com/rvandewater/yaib

πŸ§ͺYet Another ICU Benchmark: a holistic framework for the standardization of clinical prediction model experiments. Provide custom datasets, cohorts, prediction tasks, endpoints, preprocessing, and models. Paper: https://arxiv.org/abs/2306.05109

amsterdamumcdb benchmark clinical-data clinical-ml deep-learning ehr eicu-crd framework hirid-dataset icu machine-learning mimic-iii mimic-iv patient-monitoring time-series

Last synced: 06 Apr 2025

https://github.com/praktiskt/featuretoolsr

An R interface to the Python module Featuretools

feature-engineering featuretools machine-learning r-package rstats

Last synced: 11 Mar 2026

https://github.com/aldro61/kover

Learn interpretable computational phenotyping models from k-merized genomic data

biomarker-discovery genomics k-mer machine-learning phenotypes

Last synced: 11 Oct 2025

https://github.com/alanlaboratory/unrealmlagents

The Unreal ML Agents Toolkit is an open-source project that enables Unreal Engine games and simulations to serve as environments for training intelligent agents using deep reinforcement learning. This project is a port of Unity ML-Agents, adapted to work within Unreal Engine.

artificial-intelligence deep-learning deep-reinforcement-learning machine-learning neural-network reinforcement-learning unreal-engine unreal-engine-5 unreal-engine-plugin

Last synced: 11 Sep 2025

https://github.com/nzw0301/lightlda

fast sampling algorithm based on CGS

lda machine-learning nlp python topic-modeling

Last synced: 30 Apr 2025

https://github.com/takashiishida/pconf

[NeurIPS 2018] Code for the paper "Binary Classification from Positive-Confidence Data"

deep-learning machine-learning

Last synced: 11 Apr 2025

https://github.com/the-database/trainner-redux

Deep learning training framework for image super resolution and restoration.

deep-learning image-restoration machine-learning neural-network python pytorch super-resolution upscale

Last synced: 20 Mar 2025

https://github.com/d4l3k/go-bayesopt

A library for doing Bayesian Optimization using Gaussian Processes (blackbox optimizer) in Go/Golang.

bayesianoptimization bayesopt blackbox-optimizer gaussian-processes go hyperparameter-optimization machine-learning optimization

Last synced: 12 Apr 2025

https://github.com/benedekrozemberczki/FEATHER

The reference implementation of FEATHER from the CIKM '20 paper "Characteristic Functions on Graphs: Birds of a Feather, from Statistical Descriptors to Parametric Models".

data-mining deep-learning deep-neural-networks deepwalk graph graph-classification graph-convolution graph-embedding graph-kernel graph2vec machine-learning network-embedding networkx neural-network node-classification node-embedding node2vec pytorch representation-learning tensorflow

Last synced: 17 Apr 2025

https://github.com/tatevkaren/artificial-neural-network-business_case_study

Business Case Study to predict customer churn rate based on Artificial Neural Network (ANN), with TensorFlow and Keras in Python. This is a customer churn analysis that contains training, testing, and evaluation of an ANN model. (Includes: Case Study Paper, Code)

ann ann-model artificial-neural-network artificial-neural-networks bank-customers case-study churn-analysis data-science deep-learning machine-learning prediction-model predictive-analytics python3 tensorflow-tutorials

Last synced: 02 May 2025

https://github.com/bsc-wdc/dislib

The Distributed Computing library for python implemented using PyCOMPSs programming model for HPC.

big-data distributed-computing hpc machine-learning python

Last synced: 14 Aug 2025

https://github.com/madhurimarawat/semester-notes

A comprehensive, well-structured repository of B.Tech (Hons) CSE notes and learning resources, specializing in Artificial Intelligence and Data Science. Includes semester-wise notes, question papers, curated study guides, and indexed materials designed for efficient learning, revision, and academic reference.

artificial-intelligence btech-notes computer-networks computer-organization-architecture computer-science cse-notes data-science data-visualization database-management-system engineering-mathematics engineering-notes learning-resources machine-learning object-oriented-programming operating-systems probability-and-statistics python-for-data-science semester-notes study-materials theory-of-computation

Last synced: 07 Mar 2026

https://github.com/konst-int-i/healnet

Multimodal fusion for heterogeneous biomedical data. NeurIPS 2024.

computational-pathology machine-learning multimodal representation-learning

Last synced: 10 Apr 2025

https://github.com/thecodrr/vspeech

πŸ“’ Complete V bindings for Mozilla's DeepSpeech TensorFlow based Speech-to-Text library. πŸ“œ

deepspeech machine-learning mozilla speech-to-text tensorflow v

Last synced: 06 Mar 2026

https://github.com/rickiepark/intro_ml_with_python_2nd_revised

<파이썬 라이브러리λ₯Ό ν™œμš©ν•œ λ¨Έμ‹ λŸ¬λ‹ (λ²ˆμ—­κ°œμ •2판)>의 μ½”λ“œ μ €μž₯μ†Œ

machine-learning numpy pandas python scikit-learn

Last synced: 10 Apr 2025

https://github.com/allenai/aspire

Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.

document-similarity information-retrieval machine-learning natural-language-processing

Last synced: 13 Oct 2025

https://github.com/onesuper/HuggingFace-Datasets-Text-Quality-Analysis

Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in dataset using pandas

dataset huggingface-datasets llm machine-learning nlp streamlit text-processing

Last synced: 22 Jul 2025

https://github.com/lddl/cnns

Convolutional Neural Networks in Go

cnn convolutional-neural-networks machine-learning mlp neural-networks

Last synced: 10 Jul 2025

https://github.com/tshrjn/env-zoo

A curated list of reinforcement learning environments and frameworks.

deep-learning machine-learning reinforcement-learning reinforcement-learning-playground

Last synced: 02 Apr 2025

https://github.com/mccorby/photolabellerserver

Federated Learning: Parameter Server doing aggregation of updates to a model coming from clients participating in a Federated Learning setup. See also the Android application companion at https://github.com/mccorby/PhotoLabeller

cifar-10 deeplearning4j dl4j federated-learning kotlin machine-learning

Last synced: 23 Apr 2025

https://github.com/s-ol/gpwfc

openCL-accelerated python implementation of the Wave Function Collapse procgen algorithm

algorithm gamedev machine-learning opencl procedural-generation python wfc

Last synced: 21 Mar 2025

https://github.com/googleforgames/clean-chat

Disruptive Behavior Mitigation Framework for Games

game-development machine-learning machine-learning-games multiplayer python

Last synced: 11 Apr 2025

https://github.com/mccorby/PhotoLabellerServer

Federated Learning: Parameter Server doing aggregation of updates to a model coming from clients participating in a Federated Learning setup. See also the Android application companion at https://github.com/mccorby/PhotoLabeller

cifar-10 deeplearning4j dl4j federated-learning kotlin machine-learning

Last synced: 10 May 2025

https://github.com/lexiestleszek/namegen

Self-contained, minimalistic implementation of a language model that generates coherent and normal sounding names. It uses an input dataset of names and probability distribution to generate new names based on the sequences of four characters.

language-model machine-learning markov-chain name-generation natural-language-processing nlp

Last synced: 30 Oct 2025

https://github.com/dipanjans/adv_nlp_workshop_odsc_europe22

Extensive tutorials for the Advanced NLP Workshop in Open Data Science Conference Europe 2020. We will leverage deep learning and deep transfer learning to solve popular tasks in NLP including Classification, Information Retrieval, Sentiment Analysis, Search Engines, Clustering, Paraphrase Mining, Summarization, Language Translation, Q&A systems

deep-learning gensim huggingface jupyter-notebook machine-learning natural-language-processing python pytorch tensorflow transfer-learning transformers

Last synced: 24 Aug 2025

https://github.com/jeffthompson/word2vecandtsne

Scripts demo-ing how to train a Word2Vec model and reduce its vector space

gensim language machine-learning python sklearn tsne word2vec words

Last synced: 12 May 2025

https://github.com/benedekrozemberczki/feather

The reference implementation of FEATHER from the CIKM '20 paper "Characteristic Functions on Graphs: Birds of a Feather, from Statistical Descriptors to Parametric Models".

data-mining deep-learning deep-neural-networks deepwalk graph graph-classification graph-convolution graph-embedding graph-kernel graph2vec machine-learning network-embedding networkx neural-network node-classification node-embedding node2vec pytorch representation-learning tensorflow

Last synced: 11 Apr 2025

https://github.com/bhavik-jikadara/ai-ml-roadmap

Welcome to the ultimate guide for starting your journey in Artificial Intelligence and Machine Learning in 2025! This roadmap provides a step-by-step approach to mastering AI and ML, from fundamentals to advanced topics.

artificial-intelligence computer-vision deep-learning deployment fundamentals-of-programming keras libraries machine-learning mathematics mlops natural-language-processing production-code pytorch reinforcement-learning roadmap scikit-learn tensorflow tools

Last synced: 30 Apr 2025

https://github.com/eric-bradford/sdd-gp-mpc

This repository contains the source code for "Stochastic data-driven model predictive control using Gaussian processes" (SDD-GP-MPC).

casadi chemical-engineering constraints differential-equations gaussian-processes machine-learning model-predictive-control monte-carlo-simulation optimization-algorithms python3 state-space-model stochastic-processes

Last synced: 16 Jun 2025

https://github.com/ndrplz/semiparametric

[TPAMI 2020] Generating Novel Views of Vehicles via Semi-parametric Guidance. A semi-parametric approach for synthesizing novel views of a rigid object from a single monocular image.

computer-vision convolutional-neural-networks deep-learning image-synthesis machine-learning novel-viewpoint-synthesis pascal3d semi-parametric semi-parametric-learning

Last synced: 10 Jul 2025

https://github.com/caraml-dev/mlp

A platform for developing and operating the machine learning systems at the various stages of machine learning life cycle.

machine-learning

Last synced: 04 Feb 2026

https://github.com/praktiskt/featuretoolsR

An R interface to the Python module Featuretools

feature-engineering featuretools machine-learning r-package rstats

Last synced: 13 Jul 2025

https://github.com/lukasmosser/stochastic_seismic_waveform_inversion

Official Implementation of "Stochastic seismic waveform inversion using generative adversarial networks as a geological prior"

bayesian-inference generative-adversarial-network geophysics machine-learning

Last synced: 15 Apr 2025

https://github.com/oracle-samples/pgx-samples

Applications using Parallel Graph AnalytiX (PGX) from Oracle Labs

graph graph-algorithms graph-analytics graph-machine-learning machine-learning

Last synced: 07 Apr 2025

https://github.com/bearloga/maltese

Little R utility package for making time series data more machine learning-friendly

forecasting machine-learning r r-package rstats time-series

Last synced: 21 Sep 2025

https://github.com/fkie-cad/comidds

A comprehensive survey of datasets for research in host-based and/or network-based intrusion detection, with a focus on enterprise networks

cybersecurity datasets events intrusion-detection logs machine-learning netflow

Last synced: 06 Mar 2026

https://github.com/fridiculous/django-estimators

a django app to persist and retrieve scikit learn machine learning models

django machine-learning scikit-learn

Last synced: 26 Oct 2025

https://github.com/plainerman/variational-doob

Lagrangian formulation of Doob's h-transform allowing for efficient rare event sampling

machine-learning neural-networks transition-paths variational-method

Last synced: 02 Mar 2026

https://github.com/loaiabdalslam/fc

Face enhancer‏ - Denoising Auto Encoder by Tensorflow and Keras and skimage

auto-encoder deep-learning image-processing machine-learning remini tensorflow

Last synced: 16 Oct 2025

https://github.com/wangz10/class_imbalance

Jupyter Notebook presentation for class imbalance in binary classification

classification imbalanced-data machine-learning tutorial

Last synced: 11 May 2025

https://github.com/pietrobarbiero/logic_explained_networks

Logic Explained Networks is a python repository implementing explainable-by-design deep learning models.

deep-learning explainable-ai logic machine-learning neural-networks xai

Last synced: 10 Oct 2025

https://github.com/tusharsarkar3/tla

A comprehensive tool for linguistic analysis of communities

hacktoberfest machine-learning nlp pytorch sentiment-analysis text-classification

Last synced: 14 Apr 2025

https://github.com/faceplugin-ltd/id-document-liveness-detection

The world's 1st completely free, open-source ID Document Liveness Detection SDK which can detect fake ID cards, Driver Licenses and Passports.Ideal for developers looking for robust, fraud-prevention tools.

anti-spoofing deep-learning document-liveness fraud-detection fraud-prevention id-document-liveness identity-verification liveness-detection liveness-detection-sdk machine-learning

Last synced: 03 Aug 2025

https://github.com/cambricon/cnstream

CNStream is a streaming framework for building Cambricon machine learning pipelines http://forum.cambricon.com https://gitee.com/SolutionSDK/CNStream

c-plus-plus cambricon cambricon-cnstream computer-vision graph-framework inference machine-learning mlu pipeline-framework

Last synced: 26 Dec 2025

https://github.com/kalininalab/datasail

DataSAIL is a tool to split datasets while reducing information leakage.

dataset-split ilp ilp-problem machine-learning optimization scip

Last synced: 08 Apr 2026

https://github.com/SAP-samples/btp-ai-sustainability-bootcamp

This github repository contains the sample code and exercises of btp-ai-sustainability-bootcamp, which showcases how to build Intelligence and Sustainability into Your Solutions on SAP Business Technology Platform with SAP AI Core and SAP Analytics Cloud for Planning.

computer-vision condition-monitoring deep-learning defect-detection image-segmentation machine-learning predictive-maintenance sac-planning sample sample-code sap-ai-core sap-ai-launchpad sap-analytics-cloud sound-classification sustainability

Last synced: 07 May 2025

https://github.com/ekinakyurek/gan-70-lines-of-julia

A Knet implementation of MLP GAN for MNIST data.

adversarial-networks gan knet machine-learning mlp-gan

Last synced: 13 Apr 2025

https://github.com/sanity/pairadjacentviolators

A JVM implementation of the Pair Adjacent Violators algorithm for isotonic regression

isotonic-regression java jvm-languages kotlin machine-learning regression

Last synced: 20 Jun 2025

https://github.com/favstats/perspective

An R wrapper for Conversation AI's Perspective API

machine-learning perspective-api rstats toxic-comment-classification

Last synced: 15 Jan 2026

https://github.com/lsjsj92/keras_basic

kerasλ₯Ό μ΄μš©ν•œ λ”₯λŸ¬λ‹ 기초 ν•™μŠ΅

cnn deep-learning deeplearning example keras lstm machine-learning python

Last synced: 07 Mar 2026

https://github.com/jacksonburns/astartes

Better Data Splits for Machine Learning

ai data-science machine-learning ml python sampling

Last synced: 21 Aug 2025

https://github.com/misaogura/mrnet

PyTorch implementation of the MRNet paper, developed for the MRNet Competition hosted by the Stanford ML Group

convolutional-neural-networks deep-learning deep-neural-networks machine-learning paper-implementations pytorch pytorch-implementation

Last synced: 21 Aug 2025

https://github.com/bigd4/PyNEP

A python interface of NEP

machine-learning python

Last synced: 04 May 2025

https://github.com/andi611/conditional-seqgan-tensorflow

Conditional Sequence Generative Adversarial Network trained with policy gradient, Implementation in Tensorflow

chatbot conditional-gan gan machine-learning nlp nlp-machine-learning seqgan tensorflow

Last synced: 13 Apr 2025

https://github.com/LdDl/cnns

Convolutional Neural Networks in Go

cnn convolutional-neural-networks machine-learning mlp neural-networks

Last synced: 08 Apr 2025

https://github.com/Piyushi-0/ACE

Code for our ICML '19 paper: Neural Network Attributions: A Causal Perspective.

attribution-methods causality deep-learning icml machine-learning neural-network

Last synced: 17 Sep 2025

https://github.com/donny-hikari/viola-jones

A face detection program in python using Viola-Jones algorithm.

adaboost boosting computer-vision emsembling face-detection haar-cascade machine-learning viola-jones

Last synced: 12 Apr 2025

https://github.com/cyberlife-coder/velesdb

VelesDB is a local‑first AI data engine written in Rust that unifies vectors, full‑text and graph in a single file with a familiar SQL‑like language. Instead of sending every RAG or semantic search query to a remote cluster, VelesDB runs directly on your server, laptop, browser, mobile or edge device β€” no cloud dependency, no external services, ..

ai ai-memory all-in-one-databse columnstore-database embeddings graph-database hnsw local-first machine-learning rag rust search-engine vector-database

Last synced: 30 Apr 2026

https://github.com/ctuavastlab/jsongrinder.jl

Machine learning with Mill.jl for JSON documents

flux hierarchical-data json julia machine-learning multi-instance-learning

Last synced: 09 Apr 2025

https://github.com/kennethleungty/end-to-end-automl-insurance

An End-to-End Implementation of AutoML with H2O, MLflow, FastAPI, and Streamlit for Insurance Cross-Sell

automl data-science fastapi h2o h2o-automl machine-learning mlflow mlops python streamlit

Last synced: 12 Jul 2025

https://github.com/m-jovanovic/digit-recognizer

Small neural network framework developed in C#, specialized in digit classification (MNIST dataset)

machine-learning mnist-classification neural-networks

Last synced: 29 Jun 2025

https://github.com/ml-tooling/lazycluster

πŸŽ› Distributed machine learning made simple.

cluster dask distributed-computing hyperopt machine-learning python ssh

Last synced: 30 Dec 2025

https://github.com/roboticsclubiitj/ml-dl-implementation

An implementation of ML and DL algorithms from scratch in python using nothing but NumPy and Matplotlib.

deep-learning hacktoberfest machine-learning matplotlib numpy nwoc python statistics woc

Last synced: 07 May 2025

https://github.com/anthonymrios/multi-label-zero-shot

Few- and Zero-shot Multi-Label Learning for Structured Label Spaces

biomedical-informatics machine-learning natural-language-processing neural-networks

Last synced: 09 Jul 2025

https://github.com/solygambas/mlops-projects

Hands-on MLOps projects to explore and learn the practical aspects of machine learning engineering for production.

docker fastapi google-cloud google-cloud-platform huggingface huggingface-transformers keras kubectl kubeflow kubernetes machine-learning mlops python scikit-learn tensorflow

Last synced: 19 Mar 2025

https://github.com/nasdin/videorecognition-realtime-autotrainer-alerts

State of the art object detection in real-time using YOLOV3 algorithm. Augmented with a process that allows easy training of the classifier as a plug & play solution . Provides alert if an item in an alert list is detected.

alerts automatic convolutional-neural-networks darknet deep-learning google-image-search image-processing image-recognition machine-learning object-detection real-time tensorflow video-recognition web-scraper webcam webscraping yolo yolo2 yolov2 yolov2-model

Last synced: 09 Oct 2025

https://github.com/jonathandinu/spark-ray-data-science

Supporting content (slides and exercises) for the Pearson video series covering best practices for developing scalable applications with Spark and Ray in the context of a data scientist's standard workflow.

artificial-intelligence data-science distributed-computing machine-learning python ray spark

Last synced: 08 May 2025

https://github.com/davisidarta/dbmap

A fast, accurate, and modularized dimensionality reduction approach based on diffusion harmonics and graph layouts. Escalates to millions of samples on a personal laptop. Adds high-dimensional big data intrinsic structure to your clustering and data visualization workflow.

denoising diffusion-process dimensionality-reduction graph-layout high-dimensional machine-learning nearest-neighbors single-cell umap visualization

Last synced: 06 Mar 2026

https://github.com/soniccodes/lucid-v1

realtime latent world model inference demo

diffusion-models generative-model machine-learning minecraft

Last synced: 13 Jun 2026

https://github.com/davidrosenberg/ttml2021fall

DS-GA 3001: Tools and Techniques for Machine Learning (NYU Fall 2021)

machine-learning

Last synced: 25 Aug 2025