awesome-llmops
An awesome & curated list of best LLMOps tools for developers
https://github.com/tensorchord/awesome-llmops
Last synced: 1 day ago
JSON representation
-
Large Scale Deployment
-
Workflow
- Airflow - square) |
- Metaflow - life data science projects with ease! |  |
- Kubeflow Pipelines - square) |
- Argo Workflows - workflows.svg?style=flat-square) |
- Prefect - square) |
- Flyte - native workflow automation platform for complex, mission-critical data and ML processes at scale. |  |
- Hamilton - inc/hamilton.svg?style=flat-square) |
- ZenML - io/zenml.svg?style=flat-square) |
- Ploomber - square) |
- aqueduct - Source Platform for Production Data Science |  |
- LangFlow - and-drop components and a chat interface. |  |
-
ML Platforms
- OpenLLM - tune, serve, deploy, and monitor any LLMs with ease. |  |
- MLflow - square) |
- Kserve - square) |
- Kubeflow - square) |
- Polyaxon - square) |
- ModelFox - square) |
- Seldon-core - core.svg?style=flat-square) |
- Hopsworks - tuning and serving LLMs. Hopsworks includes both a feature store and vector database for RAG. |  |
- Weights & Biases - powered applications, featuring W&B Prompts for LLM execution flow visualization, input and output monitoring, and secure management of prompts and LLM chain configurations. |  |
- MLRun - square) |
- Primehub - square) |
- OpenModelZ - click machine learning deployment (LLM, text-to-image and so on) at scale on any cluster (GCP, AWS, Lambda labs, your home lab, or even a single machine). |  |
- Starwhale - tuning. |  |
- ClearML - Magical CI/CD to streamline your ML workflow. Experiment Manager, MLOps and Data-Management. |  |
- TrueFoundry - tune and serve LLM Models on a company’s own Infrastructure with Data Security and Optimal GPU and Cost Management. Launch your LLM Application at Production scale with best DevSecOps practices. | |
-
Model Management
-
Scheduling
- Kueue - native Job Queueing. |  |
- Volcano - sh/volcano.svg?style=flat-square) |
- Slurm - square) |
- PAI - sourced by Microsoft). |  |
- Yunikorn - weight, universal resource scheduler for container orchestrator systems. |  |
-
-
Search
-
Vector search
- Pinecone - performance vector search applications. Developer-friendly, fully managed, and easily scalable without infrastructure hassles. | |
- Vellum - of-box support for OCR, text chunking, embedding model experimentation, metadata filtering, and production-grade APIs. | |
- pgvector - source vector similarity search for Postgres. |  |
- Milvus - io/milvus.svg?style=flat-square) |
- txtai - powered semantic search applications |  |
- Qdrant - square) |
- Marqo - ai/marqo.svg?style=flat-square) |
- Vald - square) |
- Chroma - core/chroma.svg?style=flat-square) |
- Lancedb - friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps! |  |
- pgvecto.rs - square) |
- Infinity - native database built for LLM applications, providing incredibly fast vector and full-text search |  |
- ParadeDB - square) |
- Vearch - based vector retrieval |  |
- Epsilla - cloud/vectordb.svg?style=flat-square) |
- Awadb - ai/awadb.svg?style=flat-square) |
- VectorDB - no more, no less. |  |
- VectorChord - friendly vector search in Postgres, the successor of `pgvecto.rs`. |  |
- Weaviate - tolerance and scalability of a cloud-native database, all accessible through GraphQL, REST, and various language clients. |  |
- AquilaDB - NN search. |  |
-
-
Model
-
CV Foundation Model
- midjourney
- stable-diffusion - to-image diffusion model |  |
- stable-diffusion v2 - Resolution Image Synthesis with Latent Diffusion Models |  |
- segment-anything (SAM) - anything.svg?style=flat-square) |
- disco-diffusion - diffusion.svg?style=flat-square) |
-
Large Language Model
- Mixtral-8x7B-v0.1 - 8x7B Large Language Model (LLM) is a pretrained generative Sparse Mixture of Experts. | |
- Falcon 40B - 40B-Instruct is a 40B parameters causal decoder-only model built by TII based on Falcon-40B and finetuned on a mixture of Baize. It is made available under the Apache 2.0 license. | |
- Gemma
- FastChat (Vicuna) - T5. |  |
- Alpaca - lab/stanford_alpaca.svg?style=flat-square) |
- BELLE - tune by 34B Chinese Character Corpus, based on LLaMA and Alpaca. |  |
- StableLM - AI/StableLM.svg?style=flat-square) |
- GLM-6B (ChatGLM) - Trained Model, quantization of ChatGLM-130B, can run on consumer-level GPUs. |  |
- dolly - square) |
- Luotuo - Alpaca-LoRA. |  |
- ChatGLM2-6B - 6B is the second-generation version of the open-source bilingual (Chinese-English) chat model [ChatGLM-6B](https://github.com/THUDM/ChatGLM-6B). |  |
- GLM-130B (ChatGLM) - Trained Model (ICLR 2023) |  |
- GPT-NeoX - neox.svg?style=flat-square) |
- Bloom - science Open-access Multilingual Language Model |  |
-
Audio Foundation Model
- bark - based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects. |  |
- whisper - Scale Weak Supervision |  |
-
-
LLMOps
-
Observability
- TrueFoundry - prem) Infra including deploying, Fine-tuning, tracking Prompts and serving Open Source LLM Models with full Data Security and Optimal GPU Management. Train and Launch your LLM Application at Production scale with best Software Engineering practices. | |
- Portkey - efficient apps. | |
- Fiddler AI - production to production. | |
- Parea AI - controlled enhanced prompt playground. |  |
- Vellum
- Izlo
- Keywords AI
- Literal AI - modal LLM observability and evaluation platform. Create prompt templates, deploy prompts versions, debug LLM runs, create datasets, run evaluations, monitor LLM metrics and collect human feedback. | |
- agenta - AI/agenta.svg?style=flat-square) |
- Dify - source framework aims to enable developers (and even non-developers) to quickly build useful applications based on large language models, ensuring they are visual, operable, and improvable. |  |
- Pezzo 🕹️ - source LLMOps platform built for developers and teams. In just two lines of code, you can seamlessly troubleshoot your AI operations, collaborate and manage your prompts in one place, and instantly deploy changes to any environment. |  |
- Langfuse - square) |
- Evidently - source framework to evaluate, test and monitor ML and LLM-powered systems. |  |
- Haystack - answering and more. |  |
- deeplake - square) |
- Cheshire Cat AI - cat-ai/core.svg?style=flat-square) |
- GPTCache - square) |
- LLMApp - time LLM-enabled data pipelines with few lines of code. |  |
- Arize-Phoenix - ai/phoenix.svg?style=flat-square) |
- LangKit - of-the-box LLM telemetry collection library that extracts features and profiles prompts, responses and metadata about how your LLM is performing over time to find problems at scale. |  |
- Glide - Native LLM Routing Engine. Improve LLM app resilience and speed. |  |
- xTuring - tuning. |  |
- Helicone - source LLM observability platform for logging, monitoring, and debugging AI applications. Simple 1-line integration to get started. |  |
- prompttools - source tools for testing and experimenting with prompts. The core idea is to enable developers to evaluate prompts using familiar interfaces like code and notebooks. In just a few lines of codes, you can test your prompts and parameters across different models (whether you are using OpenAI, Anthropic, or LLaMA models). You can even evaluate the retrieval accuracy of vector databases. |  |
- magentic - powered functionality. |  |
- BudgetML - square) |
- Lunary - and-play integration into LangChain. |  |
- LLMFlows - answering systems, and agents. |  |
- Mirascope - fast, efficient development and ensuring quality in LLM-based applications |  |
- OpenLIT - native GenAI and LLM Application Observability tool and provides OpenTelmetry Auto-instrumentation for monitoring LLMs, VectorDBs and Frameworks. It provides valuable insights into token & cost usage, user interaction, and performance related metrics. |  |
- AI studio - square) |
- Dstack - effective LLM development in any cloud (AWS, GCP, Azure, Lambda, etc). |  |
- PromptMage - source tool to simplify the process of creating and managing LLM workflows and prompts as a self-hosted solution. |  |
- GPUStack - source GPU cluster manager for running and managing LLMs |  |
- PromptFoundry - foundry/python-sdk.svg?style=flat-square) |
- Opik - ml/opik.svg?style=flat-square) |
- gotoHuman - based and agentic workflows. Prompt users to approve actions, select next steps, or review and validate generated results. |
- Laminar - source all-in-one platform for engineering AI products. Traces, Evals, Datasets, Labels. |  |
- PromptLayer 🍰 - layer-library.svg?style=flat-square) |
- TensorZero - source framework for building production-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluations, and experimentation. |  |
- Dataoorts
- PromptDX - ai/promptdx.svg?style=flat-square) |
- systemprompt.io
- MLflow - source framework for the end-to-end machine learning lifecycle, helping developers track experiments, evaluate models/prompts, deploy models, and add observability with tracing. |  |
- Epsilla - in-one platform to create vertical AI agents powered by your private data and knowledge. | |
- PromptSite - works directly with your local filesystem, ideal for data scientists and engineers to easily integrate into existing LLM workflows | |
- AgentMark - Safe Markdown-based Agents |  |
- AI studio - square) |
- LiteLLM 🚅 - square) |
- LlamaIndex - square) |
- langchain - square) |
- Manag.ai - in-one prompt management and observability platform. Craft, track, and perfect your LLM prompts with ease. | |
- Neurolink - provider AI agent framework that unifies 12+ LLM providers (OpenAI, Google, Anthropic, AWS, Azure, Groq, etc.) with workflow orchestration. Production-grade platform for building LLM applications with streaming, tool calling, caching, and enterprise features. Battle-tested at 15M+ requests/month. |  |
- Manag.ai - in-one prompt management and observability platform. Craft, track, and perfect your LLM prompts with ease. | |
- Dataoorts
- Hypersigil - source prompt lifecycle management and gateway with a Web UI. |  |
- Roundtable - configuration unified AI assistant management built on the FastMCP framework. Provides seamless integration with Claude, ChatGPT, and other AI assistants through a single MCP interface with session management, logging, and production-ready operations. |  |
- Weights & Biases (Prompts) - first W&B MLOps platform. Utilize W&B Prompts for visualizing and inspecting LLM execution flow, tracking inputs and outputs, viewing intermediate results, securely managing prompts and LLM chain configurations. | |
- Embedchain - square) |
- Epsilla - in-one platform to create vertical AI agents powered by your private data and knowledge. | |
- gotoHuman - based and agentic workflows. Prompt users to approve actions, select next steps, or review and validate generated results. |
- Keywords AI
- Literal AI - modal LLM observability and evaluation platform. Create prompt templates, deploy prompts versions, debug LLM runs, create datasets, run evaluations, monitor LLM metrics and collect human feedback. | |
- PromptDX - ai/promptdx.svg?style=flat-square) |
- PromptFoundry - foundry/python-sdk.svg?style=flat-square) |
- Prompteams - time APIs. Have GitHub style with repos, branches, and commits (and commit history). | |
- Puzzlet AI - Based LLM Engineering Platform. Achieve more from GenAI: Manage, evaluate, and improve your full-stack LLM application - with version control, type-safety, and local development built-in. | |
- systemprompt.io
- TreeScale - enhanced APIs seamlessly using tools for prompt optimization, semantic querying, version management, statistical evaluation, and performance tracking. As a part of the developer friendly API implementation TreeScale offers Elastic LLM product, which makes a unified API Endpoint for all major LLM providers and open source models. | |
-
-
AutoML
-
Profiling
- TPOT - source software packages. |  |
- auto-sklearn - in replacement for a scikit-learn estimator. |  |
- Goptuna - bata/goptuna.svg?style=flat-square) |
- Hyperopt - square) |
- FLAML - us/research/publication/flaml-a-fast-and-lightweight-automl-library/)). |  |
- Pycaret - source, low-code machine learning library in Python that automates machine learning workflows. |  |
- AutoRAG - Boost your LLM app performance with your own data |  |
- autokeras - team/autokeras.svg?style=flat-square) |
- Optuna - square) |
- Determined - ai/determined.svg?style=flat-square) |
- Model Search - square) |
- Auto-PyTorch - PyTorch.svg?style=flat-square) |
- automl-gs - gs.svg?style=flat-square) |
- AutoGL - square) |
- Torchmeta - Learning library for PyTorch. |  |
- learn2learn - learning Framework for Researchers. |  |
- Keras Tuner - team/keras-tuner.svg?style=flat-square) |
- Dragonfly - square) |
- Archai - square) |
- MOE - square) |
- Hyperband - square) |
- autoai - square) |
- DEvol (DeepEvolution) - square) |
- EvalML - square) |
- FEDOT - itmo/FEDOT.svg?style=flat-square) |
- HpBandSter - square) |
- Hypernets - square) |
- hyperunity - box hyperparameter optimisation. |  |
- Intelli - square) |
- Katib - native project for automated machine learning (AutoML). |  |
- NASGym - of-concept OpenAI Gym environment for Neural Architecture Search (NAS). |  |
- NNI - parameter tuning. |  |
- REMBO - dimensions via random embedding. |  |
- RoBO - square) |
- scikit-optimize(skopt) - based optimization with a `scipy.optimize` interface. |  |
- Spearmint - square) |
- Vegas - noah/vega.svg?style=flat-square) |
- AutoGluon - square) |
- Ludwig - square) |
- HPOlib2 - square) |
-
-
Observability
- PromptHub - Full stack prompt management tool designed to be usable by technical and non-technical team members. Test, version, collaborate, deploy, and monitor, all from one place.
- Prompteams - Prompt management system. Version, test, collaborate, and retrieve prompts through real-time APIs. Have GitHub style with repos, branches, and commits (and commit history).
- Doku - An open-source LLM Observability platform streamlining the monitoring of LLM applications with just two lines of code. It provides valuable insights into token usage and user engagement, tracks API usage for providers like OpenAI, and facilitates easy data export to observability platforms like Grafana and DataDog.
-
Training
-
Visualization
- Fiddler AI
- netron - square) |
- TensorBoard - square) |
- TensorSpace - trained deep learning models from TensorFlow, Keras, TensorFlow.js. |  |
- Zetane Viewer - square) |
- Maniford - agnostic visual debugging tool for machine learning. |  |
- dtreeviz - square) |
- OpenOps - square) |
- Zeno - ml/zeno.svg?style=flat-square) |
- OpenOps - square) |
-
Foundation Model Fine Tuning
- Flyflow - devs/flyflow.svg?style=flat-square) |
- alpaca-lora - tune LLaMA on consumer hardware |  |
- peft - of-the-art Parameter-Efficient Fine-Tuning. |  |
- TRL - square) |
- p-tuning-v2 - tuning on small/medium-sized models and sequence tagging challenges. [(ACL 2022)](https://arxiv.org/abs/2110.07602) |  |
- QLoRA - bit finetuning task performance. |  |
- LMFlow - square) |
- Lora - rank adaptation to quickly fine-tune diffusion models. |  |
- finetuning-scheduler - tuning schedules. |  |
-
Frameworks for Training
- LightGBM - square) |
- TensorFlow - square) |
- Keras - team/keras.svg?style=flat-square) |
- Horovod - square) |
- scikit-learn - learn/scikit-learn.svg?style=flat-square) |
- Apache MXNet - aware Dataflow Dep Scheduler. |  |
- Caffe - square) |
- PyTorch - square) |
- Kedro - source Python framework for creating reproducible, maintainable and modular data science code. |  |
- XGBoost - square) |
- PaddlePaddle - square) |
- ColossalAI - scale model training system with efficient parallelization techniques. |  |
- MindSpore - ai/mindspore.svg?style=flat-square) |
- MegEngine - to-use deep learning framework, with auto-differentiation. |  |
- Oneflow - centered and open-source deep learning framework. |  |
- Accelerate - GPU, TPU, mixed-precision. |  |
- Candle - square`) |
- metric-learn - learn-contrib/metric-learn.svg?style=flat-square) |
- VectorFlow - square) |
- Jax - performance machine learning research. |  |
- DeepSpeed - square) |
- axolotl - tuning of various AI models, offering support for multiple configurations and architectures. |  |
-
IDEs and Workspaces
- Docker - source project created by Docker to enable and accelerate software containerization. |  |
- code server - server.svg?style=flat-square) |
- conda - agnostic, system-level binary package manager and ecosystem. |  |
- Kurtosis - container environments. |  |
- Jupyter Notebooks - based notebook environment for interactive computing. |  |
- envd - square) |
-
Experiment Tracking
- Aim - to-use and performant open-source experiment tracker. |  |
- Kedro-Viz - Viz is an interactive development tool for building data science pipelines with Kedro. Kedro-Viz also allows users to view and compare different runs in the Kedro project. |  |
- Guild AI - square) |
- LabNotebook - square) |
- Sacred - square) |
-
Model Editing
- FastEdit - square) |
-
-
ML Platforms
- TrueFoundry - A PaaS to deploy, Fine-tune and serve LLM Models on a company’s own Infrastructure with Data Security and Optimal GPU and Cost Management. Launch your LLM Application at Production scale with best DevSecOps practices.
-
Awesome Lists
-
Profiling
- Awesome Federated Learning Systems - paper.svg?style=flat-square) |
- kelvins/awesome-mlops - mlops.svg?style=flat-square) |
- Awesome AutoML Papers - automl-papers.svg?style=flat-square) |
- Awesome AutoDL - depth analysis) |  |
- awesome-federated-learning - federated-learning.svg?style=flat-square) |
- visenger/awesome-mlops - An awesome list of references for MLOps |  |
- Awesome Production Machine Learning - production-machine-learning.svg?style=flat-square) |
- Awesome Tensor Compilers - tensor-compilers.svg?style=flat-square) |
- currentslab/awesome-vector-search - vector-search.svg?style=flat-square) |
- pleisto/flappy - Ready LLM Agent SDK for Every Developer |  |
- Awesome-Code-LLM - LLM for research. |  |
- Awesome AutoML - related research, tools, projects and other resources |  |
- Awesome Federated Learning - organized from Arxiv (mostly) |  |
- Awesome Open MLOps - open-mlops.svg?style=flat-square) |
-
-
Serving
-
Frameworks/Servers for Serving
- BentoML - square) |
- TFServing - performance serving system for machine learning models. |  |
- Triton Server (TRTIS) - inference-server/server.svg?style=flat-square) |
- Xinference - source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop. |  |
- Torchserve - square) |
- lanarky - grade LLM applications |  |
- ray-llm - RayLLM |  |
- langchain-serve - ai/langchain-serve.svg?style=flat-square) |
- Mosec - to-use Python interface. |  |
- KubeAI - to-text. |  |
- Kaito - 3) using container images and GPU auto-provisioning. Includes an OpenAI-compatible server for inference and preset configurations for popular runtimes such as vLLM and transformers. |  |
- Open Responses - source platform for building long-running LLM agents with tool use. |  |
- Open Responses - source platform for building long-running LLM agents with tool use. |  |
- Open Responses - source platform for building long-running LLM agents with tool use. |  |
- Jina - ai/jina.svg?style=flat-square) |
-
Large Model Serving
- whisper.cpp - square) |
- text-generation-inference - generation-inference.svg?style=flat-square) |
- Clip-as-a-service - ai/clip-as-service.svg?style=flat-square) |
- text-embeddings-inference - embedding models |  |
- Infinity - embeddings |  |
- vllm - throughput and memory-efficient inference and serving engine for LLMs. |  |
- TensorRT-LLM - LLM.svg?style=flat-square) |
- Flowise - square) |
- tokenizers - of-the-Art Tokenizers optimized for Research and Production |  |
- CTranslate2 - square) |
- Modelz-LLM - llm.svg?style=flat-square) |
- x-stable-diffusion - time inference for Stable Diffusion - 0.88s latency. Covers AITemplate, nvFuser, TensorRT, FlashAttention. |  |
- DeepSpeed-MII - latency and high-throughput inference possible, powered by DeepSpeed. |  |
- prima.cpp - square) |
- Shimmy - free Rust inference server with OpenAI API compatibility and hot model swapping |  |
- FlexGen - oriented scenarios. |  |
- Ollama - square) |
- llama.cpp - square) |
-
-
Data
-
Data Management
- Quilt - organizing data hub for S3. |  |
- Dolt - square) |
- Pachyderm - square) |
- Delta-Lake - io/delta.svg?style=flat-square) |
- ArtiVC - square) |
-
Data Storage
-
Data Tracking
-
Data/Feature enrichment
- Feast - dev/feast.svg?style=flat-square) |
- Upgini - to-use features from public and community shared data sources and enriches your training dataset with only the accuracy improving features |  |
- distilabel - quality outputs, full data ownership, and overall efficiency. |  |
- FastDatasets - quality training datasets for Large Language Models. |  |
-
Feature Engineering
- Featureform - square) |
- FeatureTools - square) |
-
-
Optimizations
-
Profiling
- NCNN - performance neural network inference framework optimized for the mobile platform. |  |
- TNN - square) |
- PocketFlow - square) |
- TensorFlow Model Optimization - optimization.svg?style=flat-square) |
- FeatherCNN - square) |
- Forward - square) |
- LangWatch - square) |
- optimum-tpu - tpu.svg?style=flat-square) |
-
-
Code AI
-
Vector search
- CodeT5 - square) |
- Continue - source autopilot for software development—bring the power of ChatGPT to VS Code |  |
- tabby - hosted AI coding assistant. An opensource / on-prem alternative to GitHub Copilot. |  |
- fauxpilot - source alternative to GitHub Copilot server |  |
- CodeGeeX - square) |
- CodeGen - source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex. |  |
- promptext - square) |
-
-
Performance
-
ML Compiler
- ONNX-MLIR - mlir.svg?style=flat-square) |
- TVM - square) |
- bitsandbytes - bit quantization for PyTorch. |  |
-
Profiling
- scalene - performance, high-precision CPU, GPU, and memory profiler for Python |  |
- octoml-profile - profile is a python library and cloud service designed to provide the simplest experience for assessing and optimizing the performance of PyTorch models on cloud hardware with state-of-the-art ML acceleration technology. |  |
-
-
Security
-
Observability
- Great Expectations - expectations/great_expectations.svg?style=flat-square) |
- Deepchecks - square) |
- Traceloop OpenLLMetry - based observability and monitoring for LLM and agents workflows. | 
- whylogs - square) |
- Giskard - AI/giskard.svg?style=flat-square) |
- Azure OpenAI Logger - openai-logger?style=flat-square) |
- Fiddler AI - production to production. Ship more ML and LLMs into production, and monitor ML and LLM metrics like hallucination, PII, and toxicity. |  |
- Maxim AI
-
Frameworks for LLM security
- Plexiglass - labs/plexiglass?style=flat-square) |
- Plexiglass - labs/plexiglass?style=flat-square) |
-
-
Federated ML
-
Profiling
- FATE - square) |
- Flower - square) |
- FedML - scale cross-silo federated learning, cross-device federated learning on smartphones/IoTs, and research simulation. |  |
- EasyFL - to-use Federated Learning Platform |  |
- Harmonia - source project aiming at developing systems/infrastructures and libraries to ease the adoption of federated learning (abbreviated to FL) for researches and production usage. |  |
-
Programming Languages
Categories
Sub Categories
Observability
77
Profiling
69
Vector search
27
Frameworks for Training
22
Large Model Serving
18
ML Platforms
15
Frameworks/Servers for Serving
15
Large Language Model
14
Workflow
11
Visualization
10
Foundation Model Fine Tuning
9
IDEs and Workspaces
6
Model Management
5
Scheduling
5
Data Management
5
Experiment Tracking
5
CV Foundation Model
5
Data/Feature enrichment
4
ML Compiler
3
Data Storage
3
Feature Engineering
2
Audio Foundation Model
2
Frameworks for LLM security
2
Data Tracking
2
Model Editing
1
Keywords
machine-learning
106
python
62
deep-learning
57
llm
54
mlops
44
data-science
42
pytorch
40
ai
40
llmops
29
tensorflow
26
kubernetes
25
ml
25
automl
24
openai
19
inference
19
large-language-models
18
hyperparameter-optimization
17
llms
16
chatgpt
15
langchain
14
vector-database
14
keras
14
prompt-engineering
14
vector-search
13
rag
13
gpt
13
gpu
13
neural-network
12
neural-architecture-search
12
llama
11
observability
11
docker
10
generative-ai
10
artificial-intelligence
10
data-engineering
9
golang
9
hyperparameter-tuning
9
transformers
9
scikit-learn
9
analytics
9
chatbot
9
fine-tuning
9
developer-tools
9
evaluation
8
automated-machine-learning
8
open-source
8
go
8
nearest-neighbor-search
8
workflow
8
model-serving
8