An open API service indexing awesome lists of open source software.

Machine learning

Machine learning is the practice of teaching a computer to learn. The concept uses pattern recognition, as well as other forms of predictive algorithms, to make judgments on incoming data. This field is closely related to artificial intelligence and computational statistics.

https://github.com/idsia/sacred

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

infrastructure machine-learning mongodb python reproducibility reproducible-research reproducible-science

Last synced: 14 May 2025

https://github.com/DistrictDataLabs/yellowbrick

Visual analysis and diagnostic tools to facilitate machine learning model selection.

anaconda estimator machine-learning matplotlib model-selection python scikit-learn visual-analysis visualization visualizer

Last synced: 14 Mar 2025

https://github.com/andri27-ts/Reinforcement-Learning

Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning

a2c artificial-intelligence deep-learning deep-reinforcement-learning deepmind dqn evolution-strategies machine-learning policy-gradients ppo qlearning reinforcement-learning

Last synced: 15 Mar 2025

https://github.com/salesforce/merlion

Merlion: A Machine Learning Framework for Time Series Intelligence

anomaly-detection automl benchmarking ensemble-learning forecasting machine-learning time-series

Last synced: 13 May 2025

https://github.com/IDSIA/sacred

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

infrastructure machine-learning mongodb python reproducibility reproducible-research reproducible-science

Last synced: 18 Apr 2025

https://github.com/JWarmenhoven/ISLR-python

An Introduction to Statistical Learning (James, Witten, Hastie, Tibshirani, 2013): Python code

islr islr-python machine-learning predictive-modeling statistical-learning

Last synced: 14 Mar 2025

https://github.com/openvenues/libpostal

A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.

address address-parser c deduping deduplication international machine-learning natural-language-processing nlp record-linkage

Last synced: 12 May 2025

https://github.com/salesforce/Merlion

Merlion: A Machine Learning Framework for Time Series Intelligence

anomaly-detection automl benchmarking ensemble-learning forecasting machine-learning time-series

Last synced: 26 Mar 2025

https://github.com/vicky002/algowiki

Repository which contains links and resources on different topics of Computer Science.

algorithm artificial-intelligence competitive-programming computer-science html knowledge linux machine-learning

Last synced: 09 Apr 2025

https://github.com/vicky002/AlgoWiki

Repository which contains links and resources on different topics of Computer Science.

algorithm artificial-intelligence competitive-programming computer-science html knowledge linux machine-learning

Last synced: 14 Mar 2025

https://github.com/openmlsys/openmlsys-zh

《Machine Learning Systems: Design and Implementation》- Chinese Version

computer-systems machine-learning software-architecture textbook

Last synced: 22 Mar 2025

https://github.com/azure/machinelearningnotebooks

Python notebooks with ML and deep learning examples with Azure Machine Learning Python SDK | Microsoft

azure azure-machine-learning azure-ml azureml data-science deep-learning machine-learning notebook

Last synced: 13 May 2025

https://github.com/hudson-and-thames/mlfinlab

MlFinLab helps portfolio managers and traders who want to leverage the power of machine learning by providing reproducible, interpretable, and easy to use tools.

algorithmic-trading finance financial-machine-learning investing machine-learning portfolio-management portfolio-optimization python quantitative-finance research trading

Last synced: 14 May 2025

https://github.com/rasbt/machine-learning-book

Code Repository for Machine Learning with PyTorch and Scikit-Learn

deep-learning machine-learning neural-networks pytorch scikit-learn

Last synced: 13 May 2025

https://github.com/thunlp/PromptPapers

Must-read papers on prompt-based tuning for pre-trained language models.

ai bert machine-learning nlp pre-trained-language-models prompt prompt-based prompt-learning prompt-toolkit

Last synced: 04 Apr 2025

https://github.com/thunlp/promptpapers

Must-read papers on prompt-based tuning for pre-trained language models.

ai bert machine-learning nlp pre-trained-language-models prompt prompt-based prompt-learning prompt-toolkit

Last synced: 26 Mar 2025

https://github.com/rasbt/pattern_classification

A collection of tutorials and examples for solving and understanding machine learning and pattern classification tasks

machine-learning machine-learning-algorithms pattern-classification

Last synced: 14 May 2025

https://github.com/FedML-AI/FedML

FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.

ai-agent deep-learning distributed-training edge-ai federated-learning inference-engine machine-learning mlops model-deployment model-serving on-device-training

Last synced: 04 Apr 2025

https://github.com/hill-a/stable-baselines

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

baselines data-science gym machine-learning openai python reinforcement-learning reinforcement-learning-algorithms toolbox

Last synced: 26 Mar 2025

https://github.com/rust-ml/linfa

A Rust machine learning framework.

algorithms machine-learning rust scientific-computing

Last synced: 14 May 2025

https://github.com/nvidia/digits

Deep Learning GPU Training System

caffe deep-learning gpu machine-learning torch

Last synced: 20 Mar 2025

https://github.com/NVIDIA/DIGITS

Deep Learning GPU Training System

caffe deep-learning gpu machine-learning torch

Last synced: 14 Mar 2025

https://github.com/Azure/MachineLearningNotebooks

Python notebooks with ML and deep learning examples with Azure Machine Learning Python SDK | Microsoft

azure azure-machine-learning azure-ml azureml data-science deep-learning machine-learning notebook

Last synced: 26 Mar 2025

https://github.com/py-why/econml

ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to bring automation to complex causal inference problems. To date, the ALICE Python SDK (econml) implements orthogonal machine learning algorithms such as the double machine learning work of Chernozhukov et al. This toolkit is designed to measure the causal effect of some treatment variable(s) t on an outcome variable y, controlling for a set of features x.

causal-inference causality econometrics economics machine-learning treatment-effects

Last synced: 14 May 2025

https://github.com/tensorzero/tensorzero

TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.

ai ai-engineering anthropic artificial-intelligence deep-learning genai generative-ai gpt large-language-models llama llm llmops llms machine-learning ml ml-engineering mlops openai python rust

Last synced: 12 May 2025

https://github.com/kermitt2/grobid

A machine learning software for extracting information from scholarly documents

bibliographical-references crf deep-learning fulltext hamburger-to-cow machine-learning metadata pdf rnn scientific-articles transformers

Last synced: 12 May 2025

https://github.com/xviniette/FlappyLearning

Program learning to play Flappy Bird by machine learning (Neuroevolution)

flappybird machine-learning neuroevolution

Last synced: 30 Mar 2025

https://github.com/xviniette/flappylearning

Program learning to play Flappy Bird by machine learning (Neuroevolution)

flappybird machine-learning neuroevolution

Last synced: 13 Apr 2025

https://github.com/MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

book courses machine-learning reinforcement-learning tutorials

Last synced: 28 Nov 2024

https://github.com/py-why/EconML

ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to bring automation to complex causal inference problems. To date, the ALICE Python SDK (econml) implements orthogonal machine learning algorithms such as the double machine learning work of Chernozhukov et al. This toolkit is designed to measure the causal effect of some treatment variable(s) t on an outcome variable y, controlling for a set of features x.

causal-inference causality econometrics economics machine-learning treatment-effects

Last synced: 26 Mar 2025

https://github.com/spotify/basic-pitch

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

audio lightweight machine-learning midi music pitch-detection polyphonic python transcription typescript

Last synced: 13 May 2025

https://github.com/arXivTimes/arXivTimes

repository to research & share the machine learning articles

arxivtimes computer-vision machine-learning natural-language-processing reinforcement-learning

Last synced: 08 Apr 2025

https://github.com/arxivtimes/arxivtimes

repository to research & share the machine learning articles

arxivtimes computer-vision machine-learning natural-language-processing reinforcement-learning

Last synced: 23 Mar 2025

https://github.com/yahoo/TensorFlowOnSpark

TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.

cluster featured machine-learning python scala spark tensorflow yahoo

Last synced: 24 Mar 2025

https://github.com/yahoo/tensorflowonspark

TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.

cluster featured machine-learning python scala spark tensorflow yahoo

Last synced: 13 May 2025

https://github.com/fedml-ai/fedml

FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.

ai-agent deep-learning distributed-training edge-ai federated-learning inference-engine machine-learning mlops model-deployment model-serving on-device-training

Last synced: 08 May 2025

https://github.com/tarrysingh/artificial-intelligence-deep-learning-machine-learning-tutorials

A comprehensive list of Deep Learning / Artificial Intelligence and Machine Learning tutorials - rapidly expanding into areas of AI/Deep Learning / Machine Vision / NLP and industry specific areas such as Climate / Energy, Automotives, Retail, Pharma, Medicine, Healthcare, Policy, Ethics and more.

artificial-intelligence aws capsule-network convolutional-neural-networks deep-learning ipython-notebook kaggle keras lua machine-learning matplotlib neural-network pandas python python-data pytorch scikit-learn tensorflow tensorflow-tutorials torch

Last synced: 13 May 2025

https://github.com/TarrySingh/Artificial-Intelligence-Deep-Learning-Machine-Learning-Tutorials

A comprehensive list of Deep Learning / Artificial Intelligence and Machine Learning tutorials - rapidly expanding into areas of AI/Deep Learning / Machine Vision / NLP and industry specific areas such as Climate / Energy, Automotives, Retail, Pharma, Medicine, Healthcare, Policy, Ethics and more.

artificial-intelligence aws capsule-network convolutional-neural-networks deep-learning ipython-notebook kaggle keras lua machine-learning matplotlib neural-network pandas python python-data pytorch scikit-learn tensorflow tensorflow-tutorials torch

Last synced: 18 Apr 2025

https://github.com/googlecreativelab/teachable-machine-v1

Explore how machine learning works, live in the browser. No coding required.

machine-learning teachable-machine

Last synced: 18 Jan 2025

https://github.com/google-deepmind/dm_control

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

artificial-intelligence deep-learning machine-learning mujoco neural-networks physics-simulation reinforcement-learning

Last synced: 01 Apr 2025

https://github.com/ravenscroftj/turbopilot

Turbopilot is an open source large-language-model based code completion engine that runs locally on CPU

code-completion cpp language-model machine-learning

Last synced: 17 Jan 2025

https://github.com/xitu/tensorflow-docs

TensorFlow 最新官方文档中文版

documentation machine-learning tensorflow

Last synced: 15 May 2025

https://github.com/deepchecks/deepchecks

Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.

data-drift data-science data-validation deep-learning html-report jupyter-notebook machine-learning ml mlops model-monitoring model-validation pandas-dataframe python pytorch

Last synced: 16 May 2025

https://github.com/lucidrains/stylegan2-pytorch

Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement

artificial-intelligence generative-adversarial-network generative-model machine-learning pytorch

Last synced: 12 May 2025

https://github.com/apachecn/hands-on-ml-zh

:book: [译] Sklearn 与 TensorFlow 机器学习实用指南【版权问题,网站已下线!!】

book deep-learning machine-learning python sklearn tensorflow

Last synced: 19 Jan 2025

https://github.com/justmarkham/scikit-learn-videos

Jupyter notebooks from the scikit-learn video series

data-science jupyter-notebook machine-learning python scikit-learn tutorial

Last synced: 14 May 2025

https://github.com/bytedance/byteps

A high performance and generic framework for distributed DNN training

deep-learning distributed-training keras machine-learning mxnet pytorch tensorflow

Last synced: 14 May 2025

https://github.com/benfred/implicit

Fast Python Collaborative Filtering for Implicit Feedback Datasets

collaborative-filtering machine-learning matrix-factorization recommendation recommendation-system recommender-system

Last synced: 13 May 2025

https://github.com/promptslab/promptify

Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research

chatgpt chatgpt-api chatgpt-python gpt-3 gpt-3-prompts gpt-4 gpt-4-api gpt3-library large-language-models machine-learning nlp openai prompt-engineering prompt-toolkit prompt-tuning prompt-versioning prompting prompts promptversioning transformers

Last synced: 13 May 2025

https://github.com/huggingface/speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

ai assistant language-model machine-learning python speech speech-synthesis speech-to-text speech-translation

Last synced: 31 Dec 2024

https://github.com/alibaba/alink

Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.

apriori classification clustering data-mining feature-engineering flink flink-machine-learning flink-ml fm graph-algorithms graph-embedding kafka machine-learning recommender recommender-system regression statistics word2vec xgboost

Last synced: 14 May 2025

https://github.com/chiphuyen/python-is-cool

Cool Python features for machine learning that I used to be too afraid to use. Will be updated as I have more time / learn more.

advanced-python data-science machine-learning python-tutorials python3

Last synced: 04 Apr 2025

https://github.com/alibaba/Alink

Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.

apriori classification clustering data-mining feature-engineering flink flink-machine-learning flink-ml fm graph-algorithms graph-embedding kafka machine-learning recommender recommender-system regression statistics word2vec xgboost

Last synced: 14 Mar 2025

https://github.com/priorlabs/tabpfn

⚡ TabPFN: Foundation Model for Tabular Data ⚡

data-science foundation-models machine-learning tabpfn tabular-data

Last synced: 11 May 2025

https://github.com/ploomber/ploomber

The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️

data-engineering data-science jupyter jupyter-notebooks machine-learning mlops notebooks papermill pipelines pycharm vscode workflow

Last synced: 29 Apr 2025

https://github.com/pair-code/lit

The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

machine-learning natural-language-processing visualization

Last synced: 12 May 2025