Projects in Awesome Lists tagged with inference-api
A curated list of projects in awesome lists tagged with inference-api .
https://github.com/roboflow/inference
Turn any computer or edge device into a command center for your computer vision projects.
agents classification computer-vision deployment docker inference inference-api inference-server instance-segmentation jetson machine-learning object-detection onnx python tensorrt vit yolo11 yolov12 yolov5 yolov8
Last synced: 07 Jan 2026
https://github.com/basetenlabs/truss
The simplest way to serve AI/ML models in production
artificial-intelligence easy-to-use falcon inference-api inference-server machine-learning model-serving open-source packaging stable-diffusion whisper wizardlm
Last synced: 08 Feb 2026
https://github.com/bmw-innovationlab/bmw-tensorflow-training-gui
This repository allows you to get started with a gui based training a State-of-the-art Deep Learning model with little to no configuration needed! NoCode training with TensorFlow has never been so easy.
computer-vision computervision deep-learning deep-neural-networks deeplearning detection-api docker gui inference-api machine-learning neural-network no-code object-detection objectdetection resnet rest-api tensorboard tensorflow tensorflow-gui tensorflow2
Last synced: 02 Jul 2025
https://github.com/BMW-InnovationLab/BMW-TensorFlow-Training-GUI
This repository allows you to get started with a gui based training a State-of-the-art Deep Learning model with little to no configuration needed! NoCode training with TensorFlow has never been so easy.
computer-vision computervision deep-learning deep-neural-networks deeplearning detection-api docker gui inference-api machine-learning neural-network no-code object-detection objectdetection resnet rest-api tensorboard tensorflow tensorflow-gui tensorflow2
Last synced: 07 Apr 2025
https://github.com/searchsavior/openarc
Lightweight Inference server for OpenVINO
agentic-ai fastapi inference-api inference-engine openvino-toolkit optimum-intel transformers
Last synced: 06 Apr 2025
https://github.com/pszemraj/textsum
CLI & Python API to easily summarize text-based files with transformers
batch-processing inference inference-api pipeline summarization summary text text-to-text-transformer transformer transformers
Last synced: 04 Apr 2025
https://github.com/bmw-innovationlab/bmw-classification-training-gui
This repository allows you to get started with training a State-of-the-art Deep Learning model with little to no configuration needed! You provide your labeled dataset and you can start the training right away. You can even test your model with our built-in Inference REST API. Training classification models with GluonCV has never been so easy.
classification computer-vision deep-learning gluoncv inference-api training
Last synced: 02 Jul 2025
https://github.com/kardbord/hfapigo
Unofficial (Golang) Go bindings for the Hugging Face Inference API
api audio-classification conversational-ai go golang hacktoberfest huggingface inference-api natural-language-processing nlp object-detection question-answering speech-recognition speech-recognition-api summarization text-classification text-generation token-classification translation zero-shot-classification
Last synced: 04 Aug 2025
https://github.com/bmw-innovationlab/bmw-classification-inference-gpu-cpu
This is a repository for an image classification inference API using the Gluoncv framework. The inference REST API works on CPU/GPU. It's supported on Windows and Linux Operating systems. Models trained using our Gluoncv Classification training repository can be deployed in this API. Several models can be loaded and used at the same time.
classification computer-vision deep-learning gluoncv inference inference-api
Last synced: 03 Oct 2025
https://github.com/hupe1980/go-huggingface
🤗 Hugging Face Inference Client written in Go
golang huggingface inference-api
Last synced: 16 Apr 2025
https://github.com/lofcz/LlmTornado
One .NET library to consume OpenAI, Anthropic, Cohere, Google, Azure, Groq, and self-hosed APIs.
aiapi anthropic-ai chatbot cohere command-r-plus gemini gpt4o groq inference-api koboldcpp llama3 llm-inference o1 o3 ollama openai sdk sonnet sonnet3-5
Last synced: 05 Oct 2025
https://github.com/prismadic/magnet
the small distributed language model toolkit; fine-tune state-of-the-art LLMs anywhere, rapidly
apple-silicon claude distributed-computing distributed-systems embeddings fine-tuning finetuning-llms gemini huggingface inference-api langchain llm-training milvus mistral mlx nats nats-messaging nats-streaming sentence-splitting tokenizers
Last synced: 05 Oct 2025
https://github.com/stephanj/llama3javachatcompletionservice
Llama3.java Inference engine with OpenAI Chat Completion REST API/
inference-api java llama3 openai-api
Last synced: 05 May 2025
https://github.com/tommylemon/cvauto
👁 零代码零标注 CV AI 自动化测试工具 🚀 免除大量人工画框和打标签等,直接零代码快速自动化测试 CV 计算机视觉 AI 人工智能图像识别算法:行人检测、动植物分类、人脸识别、OCR 车牌识别、旋转校正、舞蹈姿态、抠图分割 等,还可一键 下载测试报告、导出训练和测试数据集
ai ai-testing apijson classification computer-vision cv cv2 detection face-recognition inference-api inference-server ocr pose-estimation rotation segmentation test-automation ultralytics ultralytics-yolo yolo yolo11
Last synced: 15 Sep 2025
https://github.com/decisionfacts/semantic-ai
An open source framework for Retrieval-Augmented System (RAG) uses semantic search helps to retrieve the expected results and generate human readable conversational response with the help of LLM (Large Language Model).
approximate-nearest-neighbor-search deep-neural-networks document-parser docx fastapi inference-api llama2 llm machine-learning ocr openai openai-api pdf rag retrieval-augmented-generation semantic-search vector-database
Last synced: 27 Jul 2025
https://github.com/yas-sim/openvino-ep-enabled-onnxruntime
Describing How to Enable OpenVINO Execution Provider for ONNX Runtime
deep-learning inference inference-api inference-engine inference-library intel onnx onnx-backend onnx-format onnxruntime openvino openvino-toolkit
Last synced: 22 Apr 2025
https://github.com/Prismadic/magnet
the small distributed language model toolkit; fine-tune state-of-the-art LLMs anywhere, rapidly
apple-silicon claude distributed-computing distributed-systems embeddings fine-tuning finetuning-llms gemini huggingface inference-api langchain llm-training milvus mistral mlx nats nats-messaging nats-streaming sentence-splitting tokenizers
Last synced: 10 Aug 2025
https://github.com/inference-gateway/inference-gateway
An open-source, high-performance gateway unifying multiple LLM providers, from local solutions like Ollama to major cloud providers such as OpenAI, Groq, Cohere, Anthropic, Cloudflare and DeepSeek.
agnostic anthropic api cohere deepseek-v3 gateway gateway-api golang inference-api kubernetes llm openai opensource opensource-projects opentelemetry performance proxy proxy-server self-hosted tracing
Last synced: 12 Dec 2025
https://github.com/rageagainstthepixel/com.rest.huggingface
A Non-Official HuggingFace Rest Client for Unity (UPM)
ai generative-ai generative-art generative-design huggingface inference-api unity unity3d upm upm-package
Last synced: 26 Oct 2025
https://github.com/kyryl-opens-ml/ml-in-production-practice
Practice for Machine Learning in Production course
data inference-api infrastructure llm ml mlops monitoring pipelines platform
Last synced: 16 Jan 2026
https://github.com/shivammg/stable-diffusion-on-azureml
REST APIs for StableDiffusion. Inferencing support on AzureML
ai artificial-intelligence azure azure-machine-learning generative-art inference-api latent-diffusion machine-learning rest-api stable-diffusion
Last synced: 07 Aug 2025
https://github.com/intelligencedev/eternal
Eternal is an experimental platform for machine learning models and workflows.
ai go htmx inference-api llamacpp ml stable-diffusion
Last synced: 09 Apr 2025
https://github.com/jparkerweb/bedrock-proxy-endpoint
🔀 Bedrock Proxy Endpoint ⇢ Spin up your own custom OpenAI API server endpoint for easy AWS Bedrock inference (using standard baseUrl, and apiKey params)
api aws-bedrock endpoint inference inference-api llama3 llm mistral mixtral openai-api proxy serverless wrapper
Last synced: 01 May 2025
https://github.com/pandruszkow/whisper-inference-server
A networked inference server for Whisper so you don't have to keep waiting for the audio model to reload for the x-hunderdth time.
flask inference-api inference-server python3 whisper-ai
Last synced: 25 Oct 2025
https://github.com/gmkung/cheemera
A Node.js backend that exposes a Typescript implementation of the deCheem inference engine.
ai expert-advisor expert-system gpt inference-api inference-engine
Last synced: 24 Jul 2025
https://github.com/sunra-ai/sunra-clients
ai image-generation inference-api sdk sunra video-generation
Last synced: 14 Jan 2026
https://github.com/yav-ai/nodejs-stable-diffusion-xl-base-1.0-hugging-face-inference-api
A simple node.js example that generates an image using StableDiffusion via Hugging Face Inference API.
ai generative-ai huggingface image-generation inference-api nodejs nodejsexamples sdxl stable-diffusion stable-diffusion-api stable-diffusion-xl
Last synced: 11 Apr 2025
https://github.com/socaity/socaity
SDK for generative AI.
3d-generation api artificial-intelligence audio-generation bark clip deepseek-r1 flux generative-ai hosting image-captioning image-generation inference-api llama3 llm runpod speech-synthesis stable-diffusion text-to-speech video-generation
Last synced: 15 Apr 2025
https://github.com/xx025/cvmd
A Computer Vision Model Development toolkit. cvmd uses NumPy arrays as both input and output, aiming to provide a unified and concise model inference interface.
deformable-detr detr inference inference-api yolo yolov11 yolov5 yolov8
Last synced: 13 Jan 2026
https://github.com/extrawest/flutter-podcast-to-blog-ai-app
This app is designed to provide a way to cooperate with PodcastIndex.org. You can listen to podcasts, get text version, short summary, audio file based on summary, and AI chat on the subject of podcast
ai bloc chatbot flutter huggingface image-generation-ai inference-api nodejs podcastindex speech-to-text summarization text-to-speech
Last synced: 03 Apr 2025
https://github.com/m-yasirghaffar/flux.1-schnell-ai-image-generator
FLUX.1-schnell is a state-of-the-art image generation tool powered by an AI model from Black Forest Labs, available through Hugging Face’s inference API. Generate high-quality, unique images from text prompts with ease. This application is a MERN stack implementation of the model.
flux-schnell inference-api mern-stack text-to-image
Last synced: 07 Jul 2025
https://github.com/yas-sim/openvino_asynchronous_api_performance_demo
This project demonstrates the high performance of OpenVINO asynchronous inference API
accelerator asynchronous deep-learning edge-ai embedded face-detection high-performance inference inference-api openvino python
Last synced: 29 Mar 2025
https://github.com/pogzyb/llmabda
Run llama.cpp server on aws lambda for cheap
agentic-ai inference inference-api inference-engine lambda-functions llamacpp
Last synced: 13 Jan 2026