An open API service indexing awesome lists of open source software.

Machine learning

Machine learning is the practice of teaching a computer to learn. The concept uses pattern recognition, as well as other forms of predictive algorithms, to make judgments on incoming data. This field is closely related to artificial intelligence and computational statistics.

https://github.com/spotify/basic-pitch-ts

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection.

audio lightweight machine-learning midi music pitch-detection polyphonic transcription

Last synced: 04 Apr 2025

https://github.com/zae-bayern/elpv-dataset

A dataset of functional and defective solar cells extracted from EL images of solar modules

computer-vision machine-learning photovoltaic solar-cells solar-energy

Last synced: 07 May 2025

https://github.com/dwhitena/gophernet

A simple from-scratch neural net written in Go

artificial-intelligence data-science go golang machine-learning neural-network

Last synced: 13 Sep 2025

https://github.com/huggingface/data-is-better-together

Let's build better datasets, together!

community datasets human-feedback machine-learning

Last synced: 14 Oct 2025

https://github.com/morphl-ai/morphl-community-edition

MorphL Community Edition uses big data and machine learning to predict user behaviors in digital products and services with the end goal of increasing KPIs (click-through rates, conversion rates, etc.) through personalization

artificial-intelligence cassandra conversion-rate-optimization data-driven-design front-end-development hadoop-hdfs kubernetes machine-learning morphl-platform pipeline product-development pyspark user-experience

Last synced: 13 Apr 2025

https://github.com/wizardforcel/data-science-notebook

:book: 每一个伟大的思想和行动都有一个微不足道的开始

data-analysis data-science machine-learning notebook numpy pandas sklearn tensorflow

Last synced: 10 Apr 2025

https://github.com/orktes/go-torch

LibTorch (PyTorch) bindings for Golang

golang golang-library machine-learning pytorch

Last synced: 13 Apr 2025

https://github.com/open-mmlab/mmeval

A unified evaluation library for multiple machine learning libraries

machine-learning metrics python pytorch tensorflow

Last synced: 04 Apr 2025

https://github.com/wujian16/Cornell-MOE

A Python library for the state-of-the-art Bayesian optimization algorithms, with the core implemented in C++.

bayesian bayesian-optimization gaussian-processes hyperparameter-optimization machine-learning optimization

Last synced: 01 May 2025

https://github.com/mlr-org/mlr3book

Online version of Bischl, B., Sonabend, R., Kotthoff, L., & Lang, M. (Eds.). (2024). "Applied Machine Learning Using mlr3 in R". CRC Press.

book bookdown machine-learning mlr3 r

Last synced: 16 May 2025

https://github.com/timschopf/keyphrasevectorizers

Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a document-keyphrase matrix.

keyphrase-extraction keyword-extraction machine-learning natural-language-processing nlp part-of-speech python vectorizer

Last synced: 10 Apr 2025

https://github.com/jeff1evesque/machine-learning

Web-interface + rest API for classification and regression (https://jeff1evesque.github.io/machine-learning.docs)

d3js flask machine-learning mariadb mongodb puppet python reactjs

Last synced: 07 Apr 2025

https://github.com/gcorso/torsional-diffusion

Implementation of Torsional Diffusion for Molecular Conformer Generation (NeurIPS 2022)

chemistry conformer-generator equivariance geometry machine-learning molecules neurips-2022 torsion-angles

Last synced: 07 Apr 2025

https://github.com/lucasxlu/LagouJob

Data Analysis & Mining for lagou.com

data-analysis data-mining lagou machine-learning nlp python3 web-crawler

Last synced: 18 Jul 2025

https://github.com/hollance/coreml-survival-guide

Source code for the book Core ML Survival Guide

coreml deep-learning ios machine-learning macos swift

Last synced: 05 May 2025

https://github.com/johnolafenwa/torchfusion

A modern deep learning framework built to accelerate research and development of AI systems

convolutional-neural-networks deep-learning gan machine-learning neural-network python pytorch visualization

Last synced: 05 Apr 2025

https://github.com/gmihaila/ml_things

This is where I put things I find useful that speed up my work with Machine Learning. Ever looked in your old projects to reuse those cool functions you created before? Well, this repo is designed to be a Python Library of functions I created in my previous project that can be reused. I also share some Notebooks Tutorials and Python Code Snippets.

google-colab machine-learning nlp nlp-machine-learning notebooks python-snippets pytorch snippets transformer

Last synced: 05 Apr 2025

https://github.com/johnolafenwa/TorchFusion

A modern deep learning framework built to accelerate research and development of AI systems

convolutional-neural-networks deep-learning gan machine-learning neural-network python pytorch visualization

Last synced: 19 Jul 2025

https://github.com/hui-po-wang/Real-Time-Facial-Expression-Recognition-with-DeepLearning

A real-time facial expression recognition system with webcam streaming and CNN

deep-learning machine-learning

Last synced: 09 May 2025

https://github.com/kengoa/fantasy-basketball

Scraping statistics, predicting NBA player performance with neural networks and boosting algorithms, and optimising lineups for Draft Kings with genetic algorithm

draftkings fantasy-basketball fantasy-lineup fantasy-sports machine-learning nba-prediction nba-statistics sports-analytics

Last synced: 07 Apr 2025

https://github.com/zhiningliu1998/self-paced-ensemble

[ICDE'20] ⚖️ A general, efficient ensemble framework for imbalanced classification. | 泛用,高效,鲁棒的类别不平衡学习框架

class-imbalance classification ensemble ensemble-learning ensemble-methods ensemble-model imbalance-classification imbalanced-data imbalanced-learn imbalanced-learning machine-learning pypi python3

Last synced: 05 Apr 2025

https://github.com/lil-lab/nlvr

Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with the truth-value of the sentence.

computer-vision corpus machine-learning natural-language-processing

Last synced: 02 May 2025

https://github.com/joelowj/machine-learning-and-reinforcement-learning-in-finance

Machine Learning and Reinforcement Learning in Finance New York University Tandon School of Engineering

coursera finance machine-learning python reinforcement-learning scikit-learn tensorflow tensorflow-examples

Last synced: 08 Mar 2026

https://github.com/fidelity/mabwiser

[IJAIT 2021] MABWiser: Contextual Multi-Armed Bandits Library

contextual-bandits machine-learning multi-armed-bandits non-parametric-bandits parametric-bandits recsys

Last synced: 09 Mar 2026

https://github.com/amzn/xfer

Transfer Learning library for Deep Neural Networks.

machine-learning mxnet neural-network python transfer-learning

Last synced: 05 Apr 2025

https://github.com/prakhar21/50-Days-of-ML

A day to day plan for this challenge (50 Days of Machine Learning) . Covers both theoretical and practical aspects

100daysofcode 100daysofmlcode dataprocessing deep-learning deep-neural-networks machine-learning pandas python siraj-raval tutorial

Last synced: 14 Apr 2025

https://github.com/jupyter-guide/ten-rules-jupyter

Ten Simple Rules for Writing and Sharing Computational Analyses in Jupyter Notebooks

binder jupyter jupyter-lab jupyter-notebook machine-learning reproducible-research

Last synced: 03 Apr 2025

https://github.com/bluefog-lib/bluefog

Distributed and decentralized training framework for PyTorch over graph

asynchronous decentralized deeplearning distributed-computing machine-learning mpi nccl one-sided pytorch

Last synced: 18 Feb 2026

https://github.com/clovaai/generative-evaluation-prdc

Code base for the precision, recall, density, and coverage metrics for generative models. ICML 2020.

deep-learning diversity evaluation evaluation-metrics fidelity generative-adversarial-network generative-model icml icml-2020 icml2020 machine-learning precision recall

Last synced: 09 Apr 2025

https://github.com/robbyzhaox/myocr

A highly extensible and customizable framework for building OCR systems.

ai cv machine-learning ocr

Last synced: 13 Jun 2025

https://github.com/iancovert/sage

For calculating global feature importance using Shapley values.

explainability interpretability machine-learning shapley

Last synced: 26 Mar 2025

https://github.com/analysiscenter/cardio

CardIO is a library for data science research of heart signals

data-science deep-learning deep-neural-networks healthcare machine-learning python

Last synced: 21 Jan 2026

https://github.com/ShifuML/shifu

An end-to-end machine learning and data mining framework on Hadoop

bigdata end-to-end-machine-learning gbdt hadoop machine-learning neural-network pipeline random-forest shifu

Last synced: 17 Jan 2026

https://github.com/strands-labs/ai-functions

Python functions powered by AI agents - with runtime post-conditions for reliable agentic workflows.

agentic agentic-ai ai genai llm machine-learning python strands-agents strands-labs

Last synced: 14 Apr 2026

https://github.com/tirthajyoti/uci-ml-api

Simple API for UCI Machine Learning Dataset Repository (search, download, analyze)

api classification clustering data-science learning machine-learning python regression statistics uci-machine-learning

Last synced: 09 Apr 2025

https://github.com/igashov/DiffLinker

DiffLinker: Equivariant 3D-Conditional Diffusion Model for Molecular Linker Design

diffusion-models drug-design equivariance fragment-based-drug-discovery machine-learning molecular-linker

Last synced: 14 May 2026

https://github.com/twitter-research/image-crop-analysis

Code for reproducing our analysis in the paper titled: Image Cropping on Twitter: Fairness Metrics, their Limitations, and the Importance of Representation, Design, and Agency

bias computer-vision fairness fairness-ml image-processing machine-learning research

Last synced: 27 Mar 2025

https://github.com/shifuml/shifu

An end-to-end machine learning and data mining framework on Hadoop

bigdata end-to-end-machine-learning gbdt hadoop machine-learning neural-network pipeline random-forest shifu

Last synced: 05 Apr 2025

https://github.com/aws-solutions/media-insights-on-aws

A serverless framework to accelerate the development of applications that discover next-generation insights in your video, audio, text, and image resources by utilizing AWS Machine Learning and Media services.

aws machine-learning serverless-framework video-processing

Last synced: 16 May 2025

https://github.com/healthcatalyst/healthcareai-r

R tools for healthcare machine learning

healthcare machine-learning r

Last synced: 04 Oct 2025

https://github.com/BiomedSciAI/histocartography

A standardized Python API with necessary preprocessing, machine learning and explainability tools to facilitate graph-analytics in computational pathology.

deep-learning graph-neural-networks healthcare machine-learning pathology pytorch

Last synced: 06 May 2025

https://github.com/juliadiff/differentiationinterface.jl

An interface to various automatic differentiation backends in Julia.

autodiff automatic-differentiation differentiation julia machine-learning

Last synced: 15 May 2025

https://github.com/duburcqa/jiminy

Jiminy: a fast and portable Python/C++ simulator of poly-articulated robots with OpenAI Gym interface for reinforcement learning

c-plus-plus machine-learning openai-gym python robotics simulator

Last synced: 15 Apr 2025

https://github.com/biomedsciai/histocartography

A standardized Python API with necessary preprocessing, machine learning and explainability tools to facilitate graph-analytics in computational pathology.

deep-learning graph-neural-networks healthcare machine-learning pathology pytorch

Last synced: 02 Jan 2026

https://github.com/IBM/transition-amr-parser

SoTA Abstract Meaning Representation (AMR) parsing with word-node alignments in Pytorch. Includes checkpoints and other tools such as statistical significance Smatch.

abstract-meaning-representation amr amr-graphs amr-parser amr-parsing machine-learning nlp semantic-parsing

Last synced: 28 Apr 2025

https://github.com/tim-salzmann/l4casadi

Use PyTorch Models with CasADi and Acados in Python, C(++) or Matlab

acados casadi cplusplus deep-learning learning-control machine-learning optimization optimization-algorithms python pytorch

Last synced: 05 Apr 2025

https://github.com/yujiabao/distributional-signatures

"Few-shot Text Classification with Distributional Signatures" ICLR 2020

few-shot-learning iclr2020 machine-learning text-classification

Last synced: 01 May 2026

https://github.com/MLWhiz/data_science_blogs

A repository to keep track of all the code that I end up writing for my blog posts.

blogging chatbot data datascience gan graphs machine-learning mcmc python spark streamlit time-series xgboost

Last synced: 05 May 2025

https://github.com/mlwhiz/data_science_blogs

A repository to keep track of all the code that I end up writing for my blog posts.

blogging chatbot data datascience gan graphs machine-learning mcmc python spark streamlit time-series xgboost

Last synced: 06 Apr 2025

https://github.com/SageMindAI/autogen-agi

AutoGen AGI: Advancing AI agents using AutoGen towards AGI capabilities. Explore cutting-edge enhancements in group chat dynamics, decision-making, and complex task proficiency. Join our journey in shaping AI's future!

agi ai autogen machine-learning

Last synced: 15 Oct 2025

https://github.com/fancompute/neuroptica

Flexible simulation package for optical neural networks

machine-learning nanophotonics neural-network optics photonics

Last synced: 21 Jan 2026

https://github.com/alshedivat/keras-gp

Keras + Gaussian Processes: Learning scalable deep and recurrent kernels.

gaussian-processes keras machine-learning neural-networks tensorflow theano

Last synced: 07 Sep 2025

https://github.com/boredbird/woe

Tools for WoE Transformation mostly used in ScoreCard Model for credit rating

credit-scoring iv machine-learning scorecard woe

Last synced: 19 Apr 2025

https://github.com/jaswinder9051998/zoofs

zoofs is a python library for performing feature selection using a variety of nature-inspired wrapper algorithms. The algorithms range from swarm-intelligence to physics-based to Evolutionary. It's easy to use , flexible and powerful tool to reduce your feature size.

evolutionary-algorithms feature-selection genetic-algorithm grey-wolf grey-wolf-optimizer machine-learning machine-learning-algorithms machinelearning optimization optimization-algorithms optimization-methods optimization-tools particle-swarm particle-swarm-optimization python subset-selection supervised-learning

Last synced: 21 Oct 2025

https://github.com/appeler/ethnicolr

Predict Race and Ethnicity Based on the Sequence of Characters in a Name

ethnicity lstm machine-learning names race

Last synced: 06 Mar 2026

https://github.com/Oxen-AI/oxen-archive

Deprecated: We moved this to Oxen-AI/Oxen core

artificial-intelligence data-science database machine-learning version-control

Last synced: 29 Aug 2025

https://github.com/rdk/p2rank

P2Rank: Protein-ligand binding site prediction tool based on machine learning. Stand-alone command line program / Java library for predicting ligand binding pockets from protein structure.

binding-sites bioinformatics drug-discovery groovy java ligand machine-learning mmcif molecular-structures p2rank pdb protein-ligand-docking protein-ligand-interactions protein-structure protein-surface proteins pymol random-forest structural-bioinformatics virtual-screening

Last synced: 12 Apr 2025