An open API service indexing awesome lists of open source software.

Machine learning

Machine learning is the practice of teaching a computer to learn. The concept uses pattern recognition, as well as other forms of predictive algorithms, to make judgments on incoming data. This field is closely related to artificial intelligence and computational statistics.

https://github.com/synsense/sinabs

A deep learning library for spiking neural networks which is based on PyTorch, focuses on fast training and supports inference on neuromorphic hardware.

machine-learning pytorch snn spiking-neural-networks

Last synced: 17 Jan 2026

https://github.com/manjillama/facial-recognition-python-django

Face detection and facial recognition along with recognized persons information fetched from database.

django machine-learning opencv python2 sklearn

Last synced: 30 Sep 2025

https://github.com/intel/MLSL

Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication patterns used in deep learning.

artificial-intelligence deep-learning distributed intel machine-learning mlsl mpi

Last synced: 26 Mar 2025

https://github.com/okrasolar/pytorch-timeseries

PyTorch implementations of neural networks for timeseries classification

classification deep-learning machine-learning pytorch time-series

Last synced: 26 Apr 2025

https://github.com/microsoft/eureka-ml-insights

A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.

ai artificial-intelligence evaluation-framework llm machine-learning mllm

Last synced: 05 Apr 2025

https://github.com/jrzaurin/ml-pipeline

Using Kafka-Python to illustrate a ML production pipeline

hyperopt hyperparameter-optimization kafka-python lightgbm machine-learning mlflow

Last synced: 14 Apr 2025

https://github.com/senwu/emmental

A deep learning framework for building multimodal multi-task learning systems.

machine-learning multi-task-learning multimodality

Last synced: 10 Apr 2025

https://github.com/aetros/aetros-cli

AETROS CLI + SDK. Command line application to manage/monitor machine learning training in AETROS Trainer

deep-learning machine-learning tensorflow theano

Last synced: 09 May 2025

https://github.com/tidy-finance/website

This repository hosts the source code for the website tidy-finance.org

asset-pricing dplyr finance ggplot2 machine-learning numpy pandas plotnine sqlite tidyverse

Last synced: 16 Jan 2026

https://github.com/DFKI-NLP/TRE

[AKBC 19] Improving Relation Extraction by Pre-trained Language Representations

information-extraction machine-learning multi-task-learning nlp relation-extraction transformer

Last synced: 31 Mar 2025

https://github.com/wassname/viz_torch_optim

Videos of deep learning optimizers moving on 3D problem-landscapes

3d-graphics deep-learning learning-rate machine-learning optimization-algorithms optimizer pytorch vizualisation

Last synced: 11 Oct 2025

https://github.com/henzler/neuraltexture

Learning a Neural 3D Texture Space from 2D Exemplars [CVPR 2020]

computer-graphics computer-vision deep-learning machine-learning texture-synthesis

Last synced: 10 Oct 2025

https://github.com/chrismattmann/tika-similarity

Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.

clustering cosine-distance cosine-similarity information-retrieval jaccard-similarity machine-learning metadata-features python similarity-score tika tika-python tika-similarity

Last synced: 02 May 2025

https://github.com/codeplaysoftware/portdnn

portDNN is a library implementing neural network algorithms written using SYCL

cplusplus cpp cpp11 gpgpu machine-learning neural-network opencl sycl

Last synced: 07 Apr 2025

https://github.com/takuti/flurs

:ocean: FluRS: A Python library for streaming recommendation algorithms

data-science factorization-machines machine-learning matrix-factorization python recommender-system

Last synced: 18 Aug 2025

https://github.com/happy-machine/fastql

Spin up a super fast Rust powered GraphQL API to prototype your ML model in one line of Python code.

ai aiart generative-art graphql graphql-server machine-learning python rust

Last synced: 27 Oct 2025

https://github.com/seetaresearch/dragon

A Computation Graph Virtual Machine based ML Framework

deep-learning machine-learning python pytorch tensorflow

Last synced: 07 May 2025

https://github.com/auto-flow/ultraopt

Distributed Asynchronous Hyperparameter Optimization better than HyperOpt. 比HyperOpt更强的分布式异步超参优化库。

automl bayesian-optimization blackbox-optimization hyperopt hyperparameter-optimization machine-learning multi-fidelity optimization python

Last synced: 04 Apr 2026

https://github.com/julialogging/tensorboardlogger.jl

Easy peasy logging to TensorBoard with Julia

julia logging machine-learning tensorboard

Last synced: 12 Apr 2025

https://github.com/hep-lbdl/CaloGAN

Generative Adversarial Networks for High Energy Physics extended to a multi-layer calorimeter simulation

atlas calogan calorimeter cern deep-learning gan generative-adversarial-network hep high-energy-physics machine-learning physics

Last synced: 27 Mar 2025

https://github.com/doughtmw/HoloLens2-Machine-Learning

Using deep learning models for image classification directly on the HoloLens 2.

efficientnet hololens2 machine-learning

Last synced: 29 Apr 2025

https://github.com/ibis-project/ibis-ml

IbisML is a library for building scalable ML pipelines using Ibis.

feature-engineering ibis machine-learning sql

Last synced: 12 Apr 2025

https://github.com/Superzchen/iLearnPlus

iLearnPlus is the first machine-learning platform with both graphical- and web-based user interface that enables the construction of automated machine-learning pipelines for computational analysis and predictions using nucleic acid and protein sequences.

automated-modelling bioinformatics-tool biomedical-data-analytics deep-learning feature-selection machine-learning prediction python sequence-analysis

Last synced: 21 Jul 2025

https://github.com/HuantWang/FUNDED_NISL

FUNDED is a novel learning framework for building vulnerability detection models.

datacollection graphneuralnetwork machine-learning vulnerability-detection

Last synced: 07 May 2025

https://github.com/furkan-gulsen/sport-with-ai

The human body is detected with the help of the Mediapipe library. Then, using the mathematical methods applied, it is determined how much the exercise count is done.

ai artificial-intelligence computer-vision deep-learning image-processing keras machine-learning mediapipe python python3 sport tensorflow

Last synced: 16 Oct 2025

https://github.com/NITRO-AI/NitroFE

NitroFE is a Python feature engineering engine which provides a variety of modules designed to internally save past dependent values for providing continuous calculation.

feature feature-engineering features indicator indicators machine-learning time-series timeseries

Last synced: 19 Jul 2025

https://github.com/happy-machine/FastQL

Spin up a super fast Rust powered GraphQL API to prototype your ML model in one line of Python code.

ai aiart generative-art graphql graphql-server machine-learning python rust

Last synced: 29 Mar 2025

https://github.com/kanyun-inc/ytk-mp4j

Ytk-mp4j is a fast, user-friendly, cross-platform, multi-process, multi-thread collective message passing java library which includes gather, scatter, allgather, reduce-scatter, broadcast, reduce, allreduce communications for distributed machine learning.

allreduce broadcast machine-learning messaging-library mpi openmp reduce

Last synced: 06 May 2025

https://github.com/khurramjaved96/incremental-learning

Pytorch implementation of ACCV18 paper "Revisiting Distillation and Incremental Classifier Learning."

convolutional-neural-networks distillation incremental-learning machine-learning paper-implementations pytorch

Last synced: 08 May 2025

https://github.com/thorben-frank/mlff

Build neural networks for machine learning force fields with JAX

deep-learning force-fields machine-learning molecular-dynamics

Last synced: 04 May 2025

https://github.com/kabirkhan/recon

Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality of your data.

machine-learning model-insights natural-language-processing ner

Last synced: 17 Aug 2025

https://github.com/GustikS/NeuraLogic

Deep relational learning through differentiable logic programming.

deep-learning differentiable-programming logic-programming machine-learning relational-learning

Last synced: 10 Jan 2026

https://github.com/formlio/forml

ForML - A development framework and MLOps platform for the lifecycle management of data science projects

ai data-science machine-learning ml mlops portability python reproducibility

Last synced: 08 May 2025

https://github.com/ramhiser/datamicroarray

A collection of small-sample, high-dimensional microarray data sets to assess machine-learning algorithms and models.

cancer colon-cancer high-dimensional-data machine-learning r

Last synced: 22 Jun 2025

https://github.com/olow304/data-science-machine-learning

The overall objective of this toolkit is to provide and offer a free collection of data analysis and machine learning that is specifically suited for doing data science. Its purpose is to get you started in a matter of minutes. You can run this collections either in Jupyter notebook or python alone.

all best-practices cheatsheet cheatsheets data-science data-science-toolkit deep-learning jupyter-notebook machine-learning machine-learning-algorithms machine-learning-tutorials matplotlib mindmap numpy pandas popular-posts python roadmap sklearn toolkit

Last synced: 24 Oct 2025

https://github.com/kohjingyu/search-agents

Code for the paper 🌳 Tree Search for Language Model Agents

agents llms machine-learning

Last synced: 02 Feb 2026

https://github.com/doccano/doccano-transformer

The official tool for transforming doccano format into common dataset formats.

annotation conll dataset doccano machine-learning natural-language-processing

Last synced: 07 May 2025

https://github.com/AlexIoannides/pymc-example-project

Example PyMC3 project for performing Bayesian data analysis using a probabilistic programming approach to machine learning.

bayesian-data-analysis bayesian-inference data-science machine-learning numpy pandas probabilistic-programming pymc3 python scikit-learn

Last synced: 19 Jul 2025

https://github.com/victorqribeiro/budget

A simply budget app that predicts where the expenses are being made

app budget budget-manager budgeting javascript k-nearest-neighbours knn machine-learning machine-learning-algorithms

Last synced: 05 May 2025

https://github.com/dmitryryumin/emnlp-2023-papers

EMNLP 2023 Papers: Explore cutting-edge research from EMNLP 2023, the premier conference for advancing empirical methods in natural language processing. Stay updated on the latest in machine learning, deep learning, and natural language processing with code included. :star: support NLP!

bert computational-linguistics emnlp emnlp2023 gpt language-models llms machine-learning machine-translation multilingual-nlp named-entity-recognition natural-language-processing ner nlp nlp-applications sentiment-analysis syntax-and-semantics text-mining transformers word-embeddings

Last synced: 12 Apr 2025

https://github.com/knodle/knodle

A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently develop and compare their own methods.

ai classification denoising-methods knodle machine-learning natural-language-procressing python pytorch relation-extraction snorkel weak-supervision weakly-supervised-learning

Last synced: 14 Mar 2025

https://github.com/qq547276542/labelmarker

A small tools for marking training set label in machine learning task.(crowdsourcing)

crowdsourcing django machine-learning marking tool

Last synced: 23 Jul 2025

https://github.com/scisharp/scisharp

SciSharp STACK is focused on building tools for Machine Learning development.

dotnet machine-learning scisharp

Last synced: 12 Jun 2025

https://github.com/charliegerard/whoosh

[Prototype] Control a 3D spaceship with hand movements

creative-coding javascript machine-learning tensorflow tensorflowjs threejs

Last synced: 14 Apr 2025

https://github.com/kaiko-ai/eva

Evaluation framework for oncology foundation models (FMs)

evaluation-framework foundation-models machine-learning oncology

Last synced: 24 Dec 2025

https://github.com/hukenovs/hh_research

Автоматизация поиска и исследования вакансий с сайта hh.ru (Headhunter) с помощью методов Python. Классификация данных, поиск статистических параметров.

api data-mining development headhunter http json jupyter-notebook machine-learning matplotlib nltk-python numpy pandas parser python research salary statistics

Last synced: 12 Mar 2026

https://github.com/machine-learning-tokyo/ai-ml-newsletter

AI Digest: Monthly updates on AI and ML topics

deep-learning machine-learning

Last synced: 03 Oct 2025

https://github.com/vroomai/vst

🎹 Generate sounds from words. Directly in your DAW.

audio generative-art machine-learning vst vst3

Last synced: 16 Mar 2025

https://github.com/aershov24/machine-learning-ds-interview-questions

🔴 1704 Machine Learning, Data Science & Python Interview Questions (ANSWERED) To Kill Your Next ML & DS Interview. Get All Answers + PDFs on MLStack.Cafe. Post your ML Jobs 👉

algorithms-and-data-structures data-analysis data-science interview-practice interview-preparation interview-questions machine-learning machine-learning-algorithms machinelearning

Last synced: 17 Aug 2025

https://github.com/mc2-project/secure-xgboost

Secure collaborative training and inference for XGBoost.

collaborative-learning data-science enclave machine-learning privacy security xgboost

Last synced: 17 Jan 2026

https://github.com/clojurenlp/core

Clojure wrapper for the Stanford CoreNLP Java library

clojure machine-learning natural-language-processing parsing

Last synced: 13 Apr 2025

https://github.com/explosion/spacy-lookups-data

📂 Additional lookup tables and data resources for spaCy

lemmatization machine-learning natural-language-processing nlp spacy

Last synced: 15 May 2025

https://github.com/hazyresearch/reef

Automatically labeling training data

machine-learning stanford synthesis weakly-supervised-learning

Last synced: 21 Jul 2025

https://github.com/brannondorsey/keras_weight_animator

Save keras weight matrices as short animated videos during training

deep-learning keras machine-learning neural-network video visualization

Last synced: 03 May 2025

https://github.com/raiyanyahya/prompt

🥝 A command line application to interact with OpenAI's ChatGPT API.

ai chatgpt chatgpt-api cli developer-tools machine-learning python

Last synced: 17 Jan 2026

https://github.com/archsyscall/convnets-tensorflow2

⛵️ Implementation a variety of popular Image Classification Models using TensorFlow2. [ResNet, GoogLeNet, VGG, Inception-v3, Inception-v4, MobileNet, MobileNet-v2, ShuffleNet, ShuffleNet-v2, etc...]

deep-learning googlenet inception-v3 inception-v4 machine-learning mobilenet mobilenet-v2 resnet shufflenet shufflenet-v2 tensorflow vgg

Last synced: 05 May 2025

https://github.com/aminehorseman/images-web-crawler

This package is a complete tool for creating a large dataset of images (specially designed -but not only- for machine learning enthusiasts). It can crawl the web, download images, rename / resize / covert the images and merge folders..

crawler dataset dataset-creation flickr-api google-images-crawler google-images-downloader image-classification image-dataset image-processing images machine-learning

Last synced: 07 Oct 2025

https://github.com/alexioannides/pymc-example-project

Example PyMC3 project for performing Bayesian data analysis using a probabilistic programming approach to machine learning.

bayesian-data-analysis bayesian-inference data-science machine-learning numpy pandas probabilistic-programming pymc3 python scikit-learn

Last synced: 05 Jul 2025

https://github.com/juliaml/tabletransforms.jl

Transforms and pipelines with tabular data in Julia

data-science machine-learning pipelines statistics table transforms

Last synced: 05 May 2026

https://scisharp.github.io/SciSharp/

SciSharp STACK is focused on building tools for Machine Learning development.

dotnet machine-learning scisharp

Last synced: 02 Apr 2025

https://github.com/luigibonati/mlcolvar

A unified framework for machine learning collective variables for enhanced sampling simulations

collective-variables data-driven enhanced-sampling machine-learning python

Last synced: 04 May 2025

https://github.com/edouardpoitras/nowtrade

Python library for backtesting technical/mechanical strategies in the stock and currency markets

algorithmic-trading-library currency machine-learning neural-network python random-forest stock technical-indicators trading

Last synced: 16 Mar 2025

https://github.com/surrealdb/surrealml

A machine learning library for Python and Rust, for PyTorch, Tensorflow and SKLearn models

artificial-intelligence artificial-intelligence-framework database machine-learning machine-learning-library python-ml rust-ml surreal surrealdb surrealml

Last synced: 02 Jul 2025

https://github.com/dssg/MLforPublicPolicy

Class resources for CAPP 30254 (Machine Learning for Public Policy)

data-science machine-learning public-policy

Last synced: 15 Mar 2025

https://github.com/softwaremill/lemon-dataset

Lemons quality control dataset

dataset lemonade machine-learning segmentation

Last synced: 07 Feb 2026

https://github.com/edouardpoitras/NowTrade

Python library for backtesting technical/mechanical strategies in the stock and currency markets

algorithmic-trading-library currency machine-learning neural-network python random-forest stock technical-indicators trading

Last synced: 04 May 2025

https://github.com/mit-ccc/tweebanknlp

[LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweebank-NER dataset

dependency-parser lemmatization machine-learning named-entity-recognition natural-language-processing ner nlp-toolkit pos-tagging text-annotation tokenization tweet-analysis twitter-nlp

Last synced: 11 May 2025

https://github.com/coqui-ai/coqpit

Simple but maybe too simple config management through python data classes. We use it for machine learning.

config-management dataclasses json machine-learning python python-data serialization typing yaml

Last synced: 05 Apr 2025

https://github.com/project-monai/monai-deploy-app-sdk

MONAI Deploy App SDK offers a framework and associated tools to design, develop and verify AI-driven applications in the healthcare imaging domain.

ai deep-learning deploy dicom healthcare image-processing machine-learning medical-imaging ml ml-infrastructure ml-platform mlops model-deployment model-serving monai pipeline python pytorch workflow

Last synced: 16 May 2025

https://github.com/kb22/Color-Identification-using-Machine-Learning

This project explores colors in various images and then enables the user to query the images based on a given color.

classification color-identification machine-learning

Last synced: 01 Aug 2025

https://github.com/openmined/sympc

A SMPC companion library for Syft

cryptography machine-learning mpc privacy python torch

Last synced: 02 Jul 2025

https://github.com/garethjns/kaggle-eeg

Seizure prediction from EEG data using machine learning. 3rd place solution for Kaggle/Uni Melbourne seizure prediction competition.

eeg kaggle kaggle-competition machine-learning matlab melbourne-university seizure-prediction svm tree-ensemble

Last synced: 12 May 2025