An open API service indexing awesome lists of open source software.

Machine learning

Machine learning is the practice of teaching a computer to learn. The concept uses pattern recognition, as well as other forms of predictive algorithms, to make judgments on incoming data. This field is closely related to artificial intelligence and computational statistics.

https://github.com/jeffthompson/word2vecandtsne

Scripts demo-ing how to train a Word2Vec model and reduce its vector space

gensim language machine-learning python sklearn tsne word2vec words

Last synced: 12 May 2025

https://github.com/codingbeard/tfkg

Create, train, and save Tensorflow Keras models all in Golang

deep-learning go golang gpu-acceleration keras machine-learning tensorflow tf

Last synced: 11 Sep 2025

https://github.com/kaggledatasets/kaggledatasets

Collection of Kaggle Datasets ready to use for Everyone (Looking for contributors)

data-science datasets deep-learning kaggle keras machine-learning python pytorch scikit-learn tensorflow

Last synced: 20 Jun 2025

https://github.com/dls5-omics/multimolecule

Accelerate Molecular Biology Research with Machine Learning

ai4science machine-learning molecular-biology

Last synced: 28 Feb 2026

https://github.com/allenai/aspire

Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.

document-similarity information-retrieval machine-learning natural-language-processing

Last synced: 13 Oct 2025

https://github.com/rickiepark/intro_ml_with_python_2nd_revised

<파이썬 라이브러리를 활용한 머신러닝 (번역개정2판)>의 코드 저장소

machine-learning numpy pandas python scikit-learn

Last synced: 10 Apr 2025

https://github.com/thecodrr/vspeech

📢 Complete V bindings for Mozilla's DeepSpeech TensorFlow based Speech-to-Text library. 📜

deepspeech machine-learning mozilla speech-to-text tensorflow v

Last synced: 06 Mar 2026

https://github.com/icanerdogan/google-mlkit-android-apps

This repository contains a collection of Android applications developed using Google ML Kit, demonstrating the power and versatility of machine learning features in mobile development. Each project is crafted using Java and Kotlin, showcasing various use cases and practical implementations.

android android-app android-application barcode-scanner face-detection firebase google google-api image-labeling java java-android kotlin kotlin-android machine-learning mlkit mlkit-face-detection mlkit-pose-detection mlkit-text-recognition pose-detection text-recognition

Last synced: 28 Apr 2025

https://github.com/praktiskt/featuretoolsr

An R interface to the Python module Featuretools

feature-engineering featuretools machine-learning r-package rstats

Last synced: 11 Mar 2026

https://github.com/madhurimarawat/semester-notes

A comprehensive, well-structured repository of B.Tech (Hons) CSE notes and learning resources, specializing in Artificial Intelligence and Data Science. Includes semester-wise notes, question papers, curated study guides, and indexed materials designed for efficient learning, revision, and academic reference.

artificial-intelligence btech-notes computer-networks computer-organization-architecture computer-science cse-notes data-science data-visualization database-management-system engineering-mathematics engineering-notes learning-resources machine-learning object-oriented-programming operating-systems probability-and-statistics python-for-data-science semester-notes study-materials theory-of-computation

Last synced: 07 Mar 2026

https://github.com/eric-bradford/sdd-gp-mpc

This repository contains the source code for "Stochastic data-driven model predictive control using Gaussian processes" (SDD-GP-MPC).

casadi chemical-engineering constraints differential-equations gaussian-processes machine-learning model-predictive-control monte-carlo-simulation optimization-algorithms python3 state-space-model stochastic-processes

Last synced: 16 Jun 2025

https://github.com/benedekrozemberczki/FEATHER

The reference implementation of FEATHER from the CIKM '20 paper "Characteristic Functions on Graphs: Birds of a Feather, from Statistical Descriptors to Parametric Models".

data-mining deep-learning deep-neural-networks deepwalk graph graph-classification graph-convolution graph-embedding graph-kernel graph2vec machine-learning network-embedding networkx neural-network node-classification node-embedding node2vec pytorch representation-learning tensorflow

Last synced: 17 Apr 2025

https://github.com/bhavik-jikadara/ai-ml-roadmap

Welcome to the ultimate guide for starting your journey in Artificial Intelligence and Machine Learning in 2025! This roadmap provides a step-by-step approach to mastering AI and ML, from fundamentals to advanced topics.

artificial-intelligence computer-vision deep-learning deployment fundamentals-of-programming keras libraries machine-learning mathematics mlops natural-language-processing production-code pytorch reinforcement-learning roadmap scikit-learn tensorflow tools

Last synced: 30 Apr 2025

https://github.com/benedekrozemberczki/feather

The reference implementation of FEATHER from the CIKM '20 paper "Characteristic Functions on Graphs: Birds of a Feather, from Statistical Descriptors to Parametric Models".

data-mining deep-learning deep-neural-networks deepwalk graph graph-classification graph-convolution graph-embedding graph-kernel graph2vec machine-learning network-embedding networkx neural-network node-classification node-embedding node2vec pytorch representation-learning tensorflow

Last synced: 11 Apr 2025

https://github.com/takashiishida/pconf

[NeurIPS 2018] Code for the paper "Binary Classification from Positive-Confidence Data"

deep-learning machine-learning

Last synced: 11 Apr 2025

https://github.com/d4l3k/go-bayesopt

A library for doing Bayesian Optimization using Gaussian Processes (blackbox optimizer) in Go/Golang.

bayesianoptimization bayesopt blackbox-optimizer gaussian-processes go hyperparameter-optimization machine-learning optimization

Last synced: 12 Apr 2025

https://github.com/metatensor/metatrain

Train, fine-tune, and manipulate machine learning models for atomistic systems

atomistic-simulations machine-learning molecular-dynamics torch

Last synced: 12 Jan 2026

https://github.com/lexiestleszek/namegen

Self-contained, minimalistic implementation of a language model that generates coherent and normal sounding names. It uses an input dataset of names and probability distribution to generate new names based on the sequences of four characters.

language-model machine-learning markov-chain name-generation natural-language-processing nlp

Last synced: 30 Oct 2025

https://github.com/the-database/trainner-redux

Deep learning training framework for image super resolution and restoration.

deep-learning image-restoration machine-learning neural-network python pytorch super-resolution upscale

Last synced: 20 Mar 2025

https://github.com/alanlaboratory/unrealmlagents

The Unreal ML Agents Toolkit is an open-source project that enables Unreal Engine games and simulations to serve as environments for training intelligent agents using deep reinforcement learning. This project is a port of Unity ML-Agents, adapted to work within Unreal Engine.

artificial-intelligence deep-learning deep-reinforcement-learning machine-learning neural-network reinforcement-learning unreal-engine unreal-engine-5 unreal-engine-plugin

Last synced: 11 Sep 2025

https://github.com/mccorby/PhotoLabellerServer

Federated Learning: Parameter Server doing aggregation of updates to a model coming from clients participating in a Federated Learning setup. See also the Android application companion at https://github.com/mccorby/PhotoLabeller

cifar-10 deeplearning4j dl4j federated-learning kotlin machine-learning

Last synced: 10 May 2025

https://github.com/masatoi/cl-online-learning

A collection of machine learning algorithms for online linear classification written in Common Lisp

classifier common-lisp machine-learning perceptron

Last synced: 20 Jan 2026

https://github.com/onesuper/HuggingFace-Datasets-Text-Quality-Analysis

Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in dataset using pandas

dataset huggingface-datasets llm machine-learning nlp streamlit text-processing

Last synced: 22 Jul 2025

https://github.com/mccorby/photolabellerserver

Federated Learning: Parameter Server doing aggregation of updates to a model coming from clients participating in a Federated Learning setup. See also the Android application companion at https://github.com/mccorby/PhotoLabeller

cifar-10 deeplearning4j dl4j federated-learning kotlin machine-learning

Last synced: 23 Apr 2025

https://github.com/ndrplz/semiparametric

[TPAMI 2020] Generating Novel Views of Vehicles via Semi-parametric Guidance. A semi-parametric approach for synthesizing novel views of a rigid object from a single monocular image.

computer-vision convolutional-neural-networks deep-learning image-synthesis machine-learning novel-viewpoint-synthesis pascal3d semi-parametric semi-parametric-learning

Last synced: 10 Jul 2025

https://github.com/SAP-samples/btp-ai-sustainability-bootcamp

This github repository contains the sample code and exercises of btp-ai-sustainability-bootcamp, which showcases how to build Intelligence and Sustainability into Your Solutions on SAP Business Technology Platform with SAP AI Core and SAP Analytics Cloud for Planning.

computer-vision condition-monitoring deep-learning defect-detection image-segmentation machine-learning predictive-maintenance sac-planning sample sample-code sap-ai-core sap-ai-launchpad sap-analytics-cloud sound-classification sustainability

Last synced: 07 May 2025

https://github.com/wangz10/class_imbalance

Jupyter Notebook presentation for class imbalance in binary classification

classification imbalanced-data machine-learning tutorial

Last synced: 11 May 2025

https://github.com/fkie-cad/comidds

A comprehensive survey of datasets for research in host-based and/or network-based intrusion detection, with a focus on enterprise networks

cybersecurity datasets events intrusion-detection logs machine-learning netflow

Last synced: 06 Mar 2026

https://github.com/oracle-samples/pgx-samples

Applications using Parallel Graph AnalytiX (PGX) from Oracle Labs

graph graph-algorithms graph-analytics graph-machine-learning machine-learning

Last synced: 07 Apr 2025

https://github.com/donny-hikari/viola-jones

A face detection program in python using Viola-Jones algorithm.

adaboost boosting computer-vision emsembling face-detection haar-cascade machine-learning viola-jones

Last synced: 12 Apr 2025

https://github.com/misaogura/mrnet

PyTorch implementation of the MRNet paper, developed for the MRNet Competition hosted by the Stanford ML Group

convolutional-neural-networks deep-learning deep-neural-networks machine-learning paper-implementations pytorch pytorch-implementation

Last synced: 21 Aug 2025

https://github.com/andi611/conditional-seqgan-tensorflow

Conditional Sequence Generative Adversarial Network trained with policy gradient, Implementation in Tensorflow

chatbot conditional-gan gan machine-learning nlp nlp-machine-learning seqgan tensorflow

Last synced: 13 Apr 2025

https://github.com/plainerman/variational-doob

Lagrangian formulation of Doob's h-transform allowing for efficient rare event sampling

machine-learning neural-networks transition-paths variational-method

Last synced: 02 Mar 2026

https://github.com/ctuavastlab/jsongrinder.jl

Machine learning with Mill.jl for JSON documents

flux hierarchical-data json julia machine-learning multi-instance-learning

Last synced: 09 Apr 2025

https://github.com/kennethleungty/end-to-end-automl-insurance

An End-to-End Implementation of AutoML with H2O, MLflow, FastAPI, and Streamlit for Insurance Cross-Sell

automl data-science fastapi h2o h2o-automl machine-learning mlflow mlops python streamlit

Last synced: 12 Jul 2025

https://github.com/caraml-dev/mlp

A platform for developing and operating the machine learning systems at the various stages of machine learning life cycle.

machine-learning

Last synced: 04 Feb 2026

https://github.com/praktiskt/featuretoolsR

An R interface to the Python module Featuretools

feature-engineering featuretools machine-learning r-package rstats

Last synced: 13 Jul 2025

https://github.com/jacksonburns/astartes

Better Data Splits for Machine Learning

ai data-science machine-learning ml python sampling

Last synced: 21 Aug 2025

https://github.com/lsjsj92/keras_basic

keras를 이용한 딥러닝 기초 학습

cnn deep-learning deeplearning example keras lstm machine-learning python

Last synced: 07 Mar 2026

https://github.com/kalininalab/datasail

DataSAIL is a tool to split datasets while reducing information leakage.

dataset-split ilp ilp-problem machine-learning optimization scip

Last synced: 08 Apr 2026

https://github.com/loaiabdalslam/fc

Face enhancer‏ - Denoising Auto Encoder by Tensorflow and Keras and skimage

auto-encoder deep-learning image-processing machine-learning remini tensorflow

Last synced: 16 Oct 2025

https://github.com/Piyushi-0/ACE

Code for our ICML '19 paper: Neural Network Attributions: A Causal Perspective.

attribution-methods causality deep-learning icml machine-learning neural-network

Last synced: 17 Sep 2025

https://github.com/bearloga/maltese

Little R utility package for making time series data more machine learning-friendly

forecasting machine-learning r r-package rstats time-series

Last synced: 21 Sep 2025

https://github.com/cambricon/cnstream

CNStream is a streaming framework for building Cambricon machine learning pipelines http://forum.cambricon.com https://gitee.com/SolutionSDK/CNStream

c-plus-plus cambricon cambricon-cnstream computer-vision graph-framework inference machine-learning mlu pipeline-framework

Last synced: 26 Dec 2025

https://github.com/lukasmosser/stochastic_seismic_waveform_inversion

Official Implementation of "Stochastic seismic waveform inversion using generative adversarial networks as a geological prior"

bayesian-inference generative-adversarial-network geophysics machine-learning

Last synced: 15 Apr 2025

https://github.com/tusharsarkar3/tla

A comprehensive tool for linguistic analysis of communities

hacktoberfest machine-learning nlp pytorch sentiment-analysis text-classification

Last synced: 14 Apr 2025

https://github.com/roboticsclubiitj/ml-dl-implementation

An implementation of ML and DL algorithms from scratch in python using nothing but NumPy and Matplotlib.

deep-learning hacktoberfest machine-learning matplotlib numpy nwoc python statistics woc

Last synced: 07 May 2025

https://github.com/sanity/pairadjacentviolators

A JVM implementation of the Pair Adjacent Violators algorithm for isotonic regression

isotonic-regression java jvm-languages kotlin machine-learning regression

Last synced: 20 Jun 2025

https://github.com/LdDl/cnns

Convolutional Neural Networks in Go

cnn convolutional-neural-networks machine-learning mlp neural-networks

Last synced: 08 Apr 2025

https://github.com/bigd4/PyNEP

A python interface of NEP

machine-learning python

Last synced: 04 May 2025

https://github.com/favstats/perspective

An R wrapper for Conversation AI's Perspective API

machine-learning perspective-api rstats toxic-comment-classification

Last synced: 15 Jan 2026

https://github.com/faceplugin-ltd/id-document-liveness-detection

The world's 1st completely free, open-source ID Document Liveness Detection SDK which can detect fake ID cards, Driver Licenses and Passports.Ideal for developers looking for robust, fraud-prevention tools.

anti-spoofing deep-learning document-liveness fraud-detection fraud-prevention id-document-liveness identity-verification liveness-detection liveness-detection-sdk machine-learning

Last synced: 03 Aug 2025

https://github.com/ekinakyurek/gan-70-lines-of-julia

A Knet implementation of MLP GAN for MNIST data.

adversarial-networks gan knet machine-learning mlp-gan

Last synced: 13 Apr 2025

https://github.com/m-jovanovic/digit-recognizer

Small neural network framework developed in C#, specialized in digit classification (MNIST dataset)

machine-learning mnist-classification neural-networks

Last synced: 29 Jun 2025

https://github.com/pietrobarbiero/logic_explained_networks

Logic Explained Networks is a python repository implementing explainable-by-design deep learning models.

deep-learning explainable-ai logic machine-learning neural-networks xai

Last synced: 10 Oct 2025

https://github.com/cyberlife-coder/velesdb

VelesDB is a local‑first AI data engine written in Rust that unifies vectors, full‑text and graph in a single file with a familiar SQL‑like language. Instead of sending every RAG or semantic search query to a remote cluster, VelesDB runs directly on your server, laptop, browser, mobile or edge device — no cloud dependency, no external services, ..

ai ai-memory all-in-one-databse columnstore-database embeddings graph-database hnsw local-first machine-learning rag rust search-engine vector-database

Last synced: 30 Apr 2026

https://github.com/fridiculous/django-estimators

a django app to persist and retrieve scikit learn machine learning models

django machine-learning scikit-learn

Last synced: 26 Oct 2025

https://github.com/ml-tooling/lazycluster

🎛 Distributed machine learning made simple.

cluster dask distributed-computing hyperopt machine-learning python ssh

Last synced: 30 Dec 2025

https://github.com/solygambas/mlops-projects

Hands-on MLOps projects to explore and learn the practical aspects of machine learning engineering for production.

docker fastapi google-cloud google-cloud-platform huggingface huggingface-transformers keras kubectl kubeflow kubernetes machine-learning mlops python scikit-learn tensorflow

Last synced: 19 Mar 2025

https://github.com/nasdin/videorecognition-realtime-autotrainer-alerts

State of the art object detection in real-time using YOLOV3 algorithm. Augmented with a process that allows easy training of the classifier as a plug & play solution . Provides alert if an item in an alert list is detected.

alerts automatic convolutional-neural-networks darknet deep-learning google-image-search image-processing image-recognition machine-learning object-detection real-time tensorflow video-recognition web-scraper webcam webscraping yolo yolo2 yolov2 yolov2-model

Last synced: 09 Oct 2025

https://github.com/jonathandinu/spark-ray-data-science

Supporting content (slides and exercises) for the Pearson video series covering best practices for developing scalable applications with Spark and Ray in the context of a data scientist's standard workflow.

artificial-intelligence data-science distributed-computing machine-learning python ray spark

Last synced: 08 May 2025

https://github.com/soniccodes/lucid-v1

realtime latent world model inference demo

diffusion-models generative-model machine-learning minecraft

Last synced: 13 Jun 2026

https://github.com/vzhong/e3

Dockerized code for E3: Entailment-driven Extracting and Editing for Conversational Machine Reading.

deep-learning machine-learning nlp

Last synced: 12 Apr 2025

https://github.com/upul/machine-learning-algorithms-from-scratch

A collection of commonly used machine learning algorithms implemented in Python/Numpy

implementation ipython-notebook machine-learning pyhton3 tutorial

Last synced: 28 Oct 2025

https://github.com/smith42/astroddpm

A denoising diffusion probabilistic model synthesises galaxies that are qualitatively and physically indistinguishable from the real thing.

astronomy galaxies machine-learning

Last synced: 27 Oct 2025

https://github.com/visweswaran1998/sklearn

Trying to implement Scikit Learn for Python in C++ (Single Headers and No dependencies)

machine-learning

Last synced: 31 Oct 2025

https://github.com/testdrivenio/fastapi-ml

deploying an ML model to Heroku with FastAPI

fastapi fastapi-docker fbprophet heroku-deployment machine-learning

Last synced: 04 Apr 2025

https://github.com/junlulocky/airticketpredicting

Machine Learning modeling for Air Ticket Predicting

air-ticket-price-prediction machine-learning machine-learning-modeling

Last synced: 12 Apr 2025