Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Machine learning

Machine learning is the practice of teaching a computer to learn. The concept uses pattern recognition, as well as other forms of predictive algorithms, to make judgments on incoming data. This field is closely related to artificial intelligence and computational statistics.

https://github.com/marimo-team/marimo

A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.

artificial-intelligence data-science data-visualization developer-tools machine-learning notebooks pipeline python reactive web-app

Last synced: 31 Jul 2024

https://github.com/rapidsai/cuml

cuML - RAPIDS Machine Learning Library

cuda gpu machine-learning machine-learning-algorithms nvidia

Last synced: 31 Jul 2024

https://github.com/arXivTimes/arXivTimes

repository to research & share the machine learning articles

arxivtimes computer-vision machine-learning natural-language-processing reinforcement-learning

Last synced: 01 Aug 2024

https://github.com/yahoo/tensorflowonspark

TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.

cluster featured machine-learning python scala spark tensorflow yahoo

Last synced: 24 Sep 2024

https://github.com/yahoo/TensorFlowOnSpark

TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.

cluster featured machine-learning python scala spark tensorflow yahoo

Last synced: 31 Jul 2024

https://github.com/googlecreativelab/teachable-machine-v1

Explore how machine learning works, live in the browser. No coding required.

machine-learning teachable-machine

Last synced: 30 Jul 2024

https://github.com/ravenscroftj/turbopilot

Turbopilot is an open source large-language-model based code completion engine that runs locally on CPU

code-completion cpp language-model machine-learning

Last synced: 30 Jul 2024

https://github.com/hudson-and-thames/mlfinlab

MlFinLab helps portfolio managers and traders who want to leverage the power of machine learning by providing reproducible, interpretable, and easy to use tools.

algorithmic-trading finance financial-machine-learning investing machine-learning portfolio-management portfolio-optimization python quantitative-finance research trading

Last synced: 31 Jul 2024

https://github.com/apachecn/hands-on-ml-zh

:book: [译] Sklearn 与 TensorFlow 机器学习实用指南【版权问题,网站已下线!!】

book deep-learning machine-learning python sklearn tensorflow

Last synced: 01 Aug 2024

https://github.com/tarrysingh/artificial-intelligence-deep-learning-machine-learning-tutorials

A comprehensive list of Deep Learning / Artificial Intelligence and Machine Learning tutorials - rapidly expanding into areas of AI/Deep Learning / Machine Vision / NLP and industry specific areas such as Climate / Energy, Automotives, Retail, Pharma, Medicine, Healthcare, Policy, Ethics and more.

artificial-intelligence aws capsule-network convolutional-neural-networks deep-learning ipython-notebook kaggle keras lua machine-learning matplotlib neural-network pandas python python-data pytorch scikit-learn tensorflow tensorflow-tutorials torch

Last synced: 24 Sep 2024

https://github.com/openmlsys/openmlsys-zh

《Machine Learning Systems: Design and Implementation》- Chinese Version

computer-systems machine-learning software-architecture textbook

Last synced: 31 Jul 2024

https://github.com/TarrySingh/Artificial-Intelligence-Deep-Learning-Machine-Learning-Tutorials

A comprehensive list of Deep Learning / Artificial Intelligence and Machine Learning tutorials - rapidly expanding into areas of AI/Deep Learning / Machine Vision / NLP and industry specific areas such as Climate / Energy, Automotives, Retail, Pharma, Medicine, Healthcare, Policy, Ethics and more.

artificial-intelligence aws capsule-network convolutional-neural-networks deep-learning ipython-notebook kaggle keras lua machine-learning matplotlib neural-network pandas python python-data pytorch scikit-learn tensorflow tensorflow-tutorials torch

Last synced: 01 Aug 2024

https://github.com/lucidrains/stylegan2-pytorch

Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement

artificial-intelligence generative-adversarial-network generative-model machine-learning pytorch

Last synced: 01 Aug 2024

https://github.com/justmarkham/scikit-learn-videos

Jupyter notebooks from the scikit-learn video series

data-science jupyter-notebook machine-learning python scikit-learn tutorial

Last synced: 24 Sep 2024

https://github.com/bytedance/byteps

A high performance and generic framework for distributed DNN training

deep-learning distributed-training keras machine-learning mxnet pytorch tensorflow

Last synced: 24 Sep 2024

https://github.com/luodian/otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

artificial-inteligence chatgpt deep-learning embodied-ai foundation-models gpt-4 instruction-tuning large-scale-models machine-learning multi-modality visual-language-learning

Last synced: 24 Sep 2024

https://github.com/py-why/EconML

ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to bring automation to complex causal inference problems. To date, the ALICE Python SDK (econml) implements orthogonal machine learning algorithms such as the double machine learning work of Chernozhukov et al. This toolkit is designed to measure the causal effect of some treatment variable(s) t on an outcome variable y, controlling for a set of features x.

causal-inference causality econometrics economics machine-learning treatment-effects

Last synced: 31 Jul 2024

https://github.com/microsoft/EconML

ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to bring automation to complex causal inference problems. To date, the ALICE Python SDK (econml) implements orthogonal machine learning algorithms such as the double machine learning work of Chernozhukov et al. This toolkit is designed to measure the causal effect of some treatment variable(s) t on an outcome variable y, controlling for a set of features x.

causal-inference causality econometrics economics machine-learning treatment-effects

Last synced: 31 Jul 2024

https://github.com/google-deepmind/dm_control

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

artificial-intelligence deep-learning machine-learning mujoco neural-networks physics-simulation reinforcement-learning

Last synced: 01 Aug 2024

https://github.com/deepmind/dm_control

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

artificial-intelligence deep-learning machine-learning mujoco neural-networks physics-simulation reinforcement-learning

Last synced: 01 Aug 2024

https://github.com/rust-ml/linfa

A Rust machine learning framework.

algorithms machine-learning rust scientific-computing

Last synced: 31 Jul 2024

https://github.com/Luodian/Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

artificial-inteligence chatgpt deep-learning embodied-ai foundation-models gpt-4 instruction-tuning large-scale-models machine-learning multi-modality visual-language-learning

Last synced: 29 Jul 2024

https://github.com/alibaba/Alink

Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.

apriori classification clustering data-mining feature-engineering flink flink-machine-learning flink-ml fm graph-algorithms graph-embedding kafka machine-learning recommender recommender-system regression statistics word2vec xgboost

Last synced: 30 Jul 2024

https://github.com/benfred/implicit

Fast Python Collaborative Filtering for Implicit Feedback Datasets

collaborative-filtering machine-learning matrix-factorization recommendation recommendation-system recommender-system

Last synced: 31 Jul 2024

https://github.com/ploomber/ploomber

The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️

data-engineering data-science jupyter jupyter-notebooks machine-learning mlops notebooks papermill pipelines pycharm vscode workflow

Last synced: 24 Sep 2024

https://github.com/chiphuyen/python-is-cool

Cool Python features for machine learning that I used to be too afraid to use. Will be updated as I have more time / learn more.

advanced-python data-science machine-learning python-tutorials python3

Last synced: 30 Jul 2024

https://github.com/tensorflow/hub

A library for transfer learning by reusing parts of TensorFlow models.

embeddings image-classification machine-learning ml python tensorflow transfer-learning

Last synced: 24 Sep 2024

https://github.com/fastai/course-nlp

A Code-First Introduction to NLP course

data-science machine-learning nlp python

Last synced: 31 Jul 2024

https://github.com/PAIR-code/lit

The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

machine-learning natural-language-processing visualization

Last synced: 01 Aug 2024

https://github.com/pathwaycom/llm-app

Dynamic RAG for enterprise. Ready to run with Docker,⚡in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.

chatbot hugging-face llm llm-local llm-prompting llm-security llmops machine-learning open-ai pathway rag real-time retrieval-augmented-generation vector-database vector-index

Last synced: 31 Jul 2024

https://github.com/pair-code/lit

The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

machine-learning natural-language-processing visualization

Last synced: 24 Sep 2024

https://pair-code.github.io/lit/

The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

machine-learning natural-language-processing visualization

Last synced: 31 Jul 2024

https://github.com/deepchecks/deepchecks

Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.

data-drift data-science data-validation deep-learning html-report jupyter-notebook machine-learning ml mlops model-monitoring model-validation pandas-dataframe python pytorch

Last synced: 24 Sep 2024

https://github.com/POSTECH-CVLab/PyTorch-StudioGAN

StudioGAN is a Pytorch library providing implementations of representative Generative Adversarial Networks (GANs) for conditional/unconditional image generation.

biggan clean-fid data-efficient-gan-training deep-learning generative-adversarial-network machine-learning pytorch stylegan2 stylegan2-ada stylegan3

Last synced: 01 Aug 2024

https://github.com/rasbt/machine-learning-book

Code Repository for Machine Learning with PyTorch and Scikit-Learn

deep-learning machine-learning neural-networks pytorch scikit-learn

Last synced: 24 Sep 2024

https://github.com/microsoft/hummingbird

Hummingbird compiles trained ML models into tensor computation for faster inference.

machine-learning neural-networks pytorch scikit-learn tensor-computation

Last synced: 24 Sep 2024

https://github.com/guillaume-chevalier/LSTM-Human-Activity-Recognition

Human Activity Recognition example using TensorFlow on smartphone sensors dataset and an LSTM RNN. Classifying the type of movement amongst six activity categories - Guillaume Chevalier

activity-recognition deep-learning human-activity-recognition lstm machine-learning neural-network recurrent-neural-networks rnn tensorflow

Last synced: 31 Jul 2024

https://github.com/salesforce/Merlion

Merlion: A Machine Learning Framework for Time Series Intelligence

anomaly-detection automl benchmarking ensemble-learning forecasting machine-learning time-series

Last synced: 31 Jul 2024

https://github.com/eto-ai/lance

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..

apache-arrow computer-vision data-analysis data-analytics data-centric data-format data-science dataops deep-learning duckdb embeddings llms machine-learning mlops python rust

Last synced: 02 Aug 2024

https://github.com/lancedb/lance

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..

apache-arrow computer-vision data-analysis data-analytics data-centric data-format data-science dataops deep-learning duckdb embeddings llms machine-learning mlops python rust

Last synced: 31 Jul 2024

https://github.com/jmschrei/pomegranate

Fast, flexible and easy to use probabilistic modelling in Python.

machine-learning probabilistic-graphical-models python pytorch

Last synced: 30 Jul 2024

https://github.com/higgsfield-ai/higgsfield

Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters

cluster-management deep-learning distributed llama llama2 llm machine-learning mlops pytorch

Last synced: 01 Aug 2024

https://github.com/pashpashpash/vault-ai

OP Vault ChatGPT: Give ChatGPT long-term memory using the OP Stack (OpenAI + Pinecone Vector Database). Upload your own custom knowledge base files (PDF, txt, epub, etc) using a simple React frontend.

ai artificial-intelligence chatgpt generative go golang knowledge-base long-term-memory machine-learning openai openai-api pdf-support pinecone qdrant-vector-database question-answering react reactjs vector-search

Last synced: 31 Jul 2024

https://github.com/scisharp/tensorflow.net

.NET Standard bindings for Google's TensorFlow for developing, training and deploying Machine Learning models in C# and F#.

chatbot csharp deep-learning dotnetcore keras machine-learning scisharp tensorflow

Last synced: 24 Sep 2024

https://github.com/dair-ai/ML-Notebooks

:fire: Machine Learning Notebooks

ai deep-learning machine-learning python pytorch

Last synced: 01 Aug 2024

https://github.com/louisfb01/best_AI_papers_2022

A curated list of the latest breakthroughs in AI (in 2022) by release date with a clear video explanation, link to a more in-depth article, and code.

2022 ai artificial-intelligence computer-science computer-vision deep-learning innovation machine-learning machinelearning neural-network paper papers python sota state-of-art state-of-the-art technology

Last synced: 30 Jul 2024

https://github.com/sbrugman/deep-learning-papers

Papers about deep learning ordered by task, date. Current state-of-the-art papers are labelled.

arxiv deep-learning deep-learning-papers machine-learning neural-networks papers science

Last synced: 31 Jul 2024

https://github.com/spotify/basic-pitch

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

audio lightweight machine-learning midi music pitch-detection polyphonic python transcription typescript

Last synced: 01 Aug 2024

https://github.com/SciSharp/TensorFlow.NET

.NET Standard bindings for Google's TensorFlow for developing, training and deploying Machine Learning models in C# and F#.

chatbot csharp deep-learning dotnetcore keras machine-learning scisharp tensorflow

Last synced: 01 Aug 2024

https://github.com/JoePenna/Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.

ai artificial-intelligence image-generation img2img latent-diffusion machine-learning model-training stable-diffusion txt2img

Last synced: 01 Aug 2024

https://github.com/graykode/nlp-roadmap

ROADMAP(Mind Map) and KEYWORD for students those who have interest in learning NLP

keyword machine-learning natural-language-processing nlp probability-statistics roadmap textmining

Last synced: 01 Aug 2024

https://github.com/promptslab/promptify

Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research

chatgpt chatgpt-api chatgpt-python gpt-3 gpt-3-prompts gpt-4 gpt-4-api gpt3-library large-language-models machine-learning nlp openai prompt-engineering prompt-toolkit prompt-tuning prompt-versioning prompting prompts promptversioning transformers

Last synced: 02 Aug 2024

https://github.com/jaymody/picogpt

An unnecessarily tiny implementation of GPT-2 in NumPy.

deep-learning gpt gpt-2 large-language-models machine-learning neural-network nlp python

Last synced: 02 Aug 2024

https://github.com/google/deepvariant

DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.

bioinformatics deep-learning deep-neural-network deepvariant dna genome genomics machine-learning ngs science sequencing tensorflow

Last synced: 01 Aug 2024

https://github.com/promptslab/Promptify

Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research

chatgpt chatgpt-api chatgpt-python gpt-3 gpt-3-prompts gpt-4 gpt-4-api gpt3-library large-language-models machine-learning nlp openai prompt-engineering prompt-toolkit prompt-tuning prompt-versioning prompting prompts promptversioning transformers

Last synced: 31 Jul 2024

https://github.com/ethen8181/machine-learning

:earth_americas: machine learning tutorials (mainly in Python3)

data-science deep-learning jupyter-notebook machine-learning python python3

Last synced: 31 Jul 2024

https://github.com/visionml/pytracking

Visual tracking library based on PyTorch.

computer-vision machine-learning tracking visual-tracking

Last synced: 31 Jul 2024

https://github.com/jaymody/picoGPT

An unnecessarily tiny implementation of GPT-2 in NumPy.

deep-learning gpt gpt-2 large-language-models machine-learning neural-network nlp python

Last synced: 31 Jul 2024

https://github.com/aksnzhy/xlearn

High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.

data-analysis data-science factorization-machines ffm fm machine-learning statistics

Last synced: 30 Jul 2024

https://github.com/DeepGraphLearning/LiteratureDL4Graph

A comprehensive collection of recent papers on graph deep learning

arxiv deep-learning machine-learning papers

Last synced: 30 Jul 2024