An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with model-inference

A curated list of projects in awesome lists tagged with model-inference .

https://github.com/bentoml/openllm

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

bentoml fine-tuning llama llama2 llama3-1 llama3-2 llama3-2-vision llm llm-inference llm-ops llm-serving llmops mistral mlops model-inference open-source-llm openllm vicuna

Last synced: 23 Oct 2025

https://github.com/bentoml/OpenLLM

Run any open-source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.

bentoml fine-tuning llama llama2 llama3-1 llama3-2 llama3-2-vision llm llm-inference llm-ops llm-serving llmops mistral mlops model-inference open-source-llm openllm vicuna

Last synced: 14 Mar 2025

https://github.com/bentoml/clip-api-service

CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search

ai-applications clip cloud-native mlops model-inference model-inference-service model-serving openai-clip

Last synced: 04 May 2025

https://github.com/davidnyarko123/edge-tpu-silva

Streamlining the process for seamless execution of PyCoral in running TensorFlow Lite models on an Edge TPU USB.

coral-tpu edge-computing fps machine-learning model-inference object-detection pycoral raspberry-pi raspberry-pi-4

Last synced: 28 Oct 2025

https://github.com/koldim2001/image_captioning

Генерация описаний к изображениям с помощью различных архитектур нейронных сетей

attention-mechanism captioning-images computer-vision image-captioning lstm model-deployment model-inference nlp soft-attention streamlit website word-embeddings

Last synced: 23 Oct 2025

https://github.com/akrisanov/inference-engineering-journey

A personal journey into model inference engineering — learning, building, and sharing along the way.

inference-engineering jupyter-notebook learning-in-public machine-learning-journey model-inference personal-notes python

Last synced: 17 Apr 2026

https://github.com/sayamalt/cyberbullying-classification-using-fine-tuned-distilbert

Successfully fine-tuned a pretrained DistilBERT transformer model that can classify social media text data into one of 4 cyberbullying labels i.e. ethnicity/race, gender/sexual, religion and not cyberbullying with a remarkable accuracy of 99%.

cyberbullying-detection data-exploration distilbert-model exploratory-data-analysis fine-tune-bert-tensorflow llm model-inference model-training-and-evaluation multiclass-classification natural-language-processing text-classification text-preprocessing text-tokenization

Last synced: 09 Nov 2025

https://github.com/kwame-mintah/gcp-cloud-run-function-model-inference

A cloud run function to invoke a prediction against a machine learning model that has been trained outside of a cloud provider.

cloud-functions fastapi gcp google-cloud-platform model-inference

Last synced: 02 May 2026

https://github.com/sayamalt/natural-scenes-image-classification-using-cnns

Successfully established an image classification model using PyTorch to classify the images of several distinct natural sceneries such as mountains, glaciers, forests, seas, streets and buildings with an accuracy of 86%.

convolutional-neural-networks deep-learning fine-tuning-cnns flask-deployment image-classification image-transformations model-inference model-training-and-evaluation multiclass-classification pytorch resnet50-model torch-dataloader

Last synced: 25 Apr 2026

https://github.com/sayamalt/mental-health-classification-using-fine-tuned-distilbert

Successfully established a multiclass text classification model by fine-tuning pretrained DistilBERT transformer model to classify several distinct types of mental health statuses such as anxiety, stress, personality disorder, etc. with an accuracy of 77%.

data-visualization deep-learning distilbert-fine-tuning distilbert-model model-evaluation model-inference model-training-and-evaluation multiclass-text-classification natural-language-processing text-classification text-preprocessing text-tokenization

Last synced: 08 Oct 2025

https://github.com/sayamalt/luxury-apparel-product-category-classification-using-fine-tuned-distilbert

Successfully developed a multiclass text classification model by fine-tuning pretrained DistilBERT transformer model to classify various distinct types of luxury apparels into their respective categories i.e. pants, accessories, underwear, shoes, etc.

deep-learning distilbert-fine-tuning distilbert-model exploratory-data-analysis fine-tuning-bert model-inference model-training-and-evaluation multiclass-classification natural-language-processing text-classification text-preprocessing text-tokenization

Last synced: 08 Oct 2025

https://github.com/hnthap/cat-or-dog

An End-to-end AI Application classifying images as either a cat or a dog. The project leverages OpenVINO Model Server, a Node.js backend, and a React-based frontend.

axios docker docker-compose express full-stack image-classification image-preprocessing javascript machine-learning model-deployment model-inference nodejs react rest-api self-host sharp typescript vite website

Last synced: 14 Apr 2026

https://github.com/sayamalt/wine-cultivator-classification-using-ann

Successfully established an ANN model which can classify wine cultivators based on several characteristics of distinct wines.

artificial-neural-networks classification deep-learning model-inference model-training-and-evaluation multiclass-classification pytorch

Last synced: 15 May 2026

https://github.com/sayamalt/oral-disease-classification-using-cnn

Successfully developed an image classification model using PyTorch to classify two types of oral diseases, namely caries and gingivitis.

binary-classification convolutional-neural-networks data-loader deep-learning image-classification image-transformations model-inference model-training-and-evaluation pytorch

Last synced: 14 May 2026

https://github.com/sayamalt/global-news-headlines-text-summarization

Successfully established a text summarization model using Seq2Seq modeling with Luong Attention, which can give a short and concise summary of the global news headlines.

attention-mechanism data-exploration-and-preprocessing luong-attention model-architecture-and-implementation model-inference natural-language-processing seq2seq-model text-generation text-summarization text-tokenization

Last synced: 09 Nov 2025