Projects in Awesome Lists tagged with model-inference
A curated list of projects in awesome lists tagged with model-inference .
https://github.com/bentoml/openllm
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
bentoml fine-tuning llama llama2 llama3-1 llama3-2 llama3-2-vision llm llm-inference llm-ops llm-serving llmops mistral mlops model-inference open-source-llm openllm vicuna
Last synced: 23 Oct 2025
https://github.com/bentoml/OpenLLM
Run any open-source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.
bentoml fine-tuning llama llama2 llama3-1 llama3-2 llama3-2-vision llm llm-inference llm-ops llm-serving llmops mistral mlops model-inference open-source-llm openllm vicuna
Last synced: 14 Mar 2025
https://github.com/bentoml/clip-api-service
CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search
ai-applications clip cloud-native mlops model-inference model-inference-service model-serving openai-clip
Last synced: 04 May 2025
https://github.com/davidnyarko123/edge-tpu-silva
Streamlining the process for seamless execution of PyCoral in running TensorFlow Lite models on an Edge TPU USB.
coral-tpu edge-computing fps machine-learning model-inference object-detection pycoral raspberry-pi raspberry-pi-4
Last synced: 28 Oct 2025
https://github.com/koldim2001/image_captioning
Генерация описаний к изображениям с помощью различных архитектур нейронных сетей
attention-mechanism captioning-images computer-vision image-captioning lstm model-deployment model-inference nlp soft-attention streamlit website word-embeddings
Last synced: 23 Oct 2025
https://github.com/chaitanyac22/udacity-aws-mle-nd-project2-build-a-ml-workflow-for-scones-unlimited-on-amazon-sagemaker
The primary objective of this project was to build and deploy an image classification model for Scones Unlimited, a scone-delivery-focused logistic company, using AWS SageMaker.
aws aws-ec2 aws-iam aws-lambda aws-s3 aws-sagemaker aws-statemachine aws-step-functions deployment endpoint image-classification json lambda-functions model-evaluation model-inference model-testing python python3 sagemaker-deployment sagemaker-studio
Last synced: 09 May 2026
https://github.com/akrisanov/inference-engineering-journey
A personal journey into model inference engineering — learning, building, and sharing along the way.
inference-engineering jupyter-notebook learning-in-public machine-learning-journey model-inference personal-notes python
Last synced: 17 Apr 2026
https://github.com/itancio/churn
machine-learning model-inference python3 streamlit
Last synced: 02 May 2026
https://github.com/sayamalt/english-to-spanish-language-translation-using-seq2seq-and-attention
Successfully established a Seq2Seq with attention model which can perform English to Spanish language translation up to an accuracy of almost 97%.
attention-is-all-you-need attention-model bert-transformer exploratory-data-analysis fine-tuning-bert hugging-face-transformers language-translation luong-attention model-architecture-and-implementation model-inference model-training-and-evaluation natural-language-processing neural-machine-translation seq2seq-modeling text-generation text-preprocessing text-tokenization
Last synced: 09 Nov 2025
https://github.com/sayamalt/cyberbullying-classification-using-fine-tuned-distilbert
Successfully fine-tuned a pretrained DistilBERT transformer model that can classify social media text data into one of 4 cyberbullying labels i.e. ethnicity/race, gender/sexual, religion and not cyberbullying with a remarkable accuracy of 99%.
cyberbullying-detection data-exploration distilbert-model exploratory-data-analysis fine-tune-bert-tensorflow llm model-inference model-training-and-evaluation multiclass-classification natural-language-processing text-classification text-preprocessing text-tokenization
Last synced: 09 Nov 2025
https://github.com/kwame-mintah/gcp-cloud-run-function-model-inference
A cloud run function to invoke a prediction against a machine learning model that has been trained outside of a cloud provider.
cloud-functions fastapi gcp google-cloud-platform model-inference
Last synced: 02 May 2026
https://github.com/sayamalt/natural-scenes-image-classification-using-cnns
Successfully established an image classification model using PyTorch to classify the images of several distinct natural sceneries such as mountains, glaciers, forests, seas, streets and buildings with an accuracy of 86%.
convolutional-neural-networks deep-learning fine-tuning-cnns flask-deployment image-classification image-transformations model-inference model-training-and-evaluation multiclass-classification pytorch resnet50-model torch-dataloader
Last synced: 25 Apr 2026
https://github.com/sayamalt/mental-health-classification-using-fine-tuned-distilbert
Successfully established a multiclass text classification model by fine-tuning pretrained DistilBERT transformer model to classify several distinct types of mental health statuses such as anxiety, stress, personality disorder, etc. with an accuracy of 77%.
data-visualization deep-learning distilbert-fine-tuning distilbert-model model-evaluation model-inference model-training-and-evaluation multiclass-text-classification natural-language-processing text-classification text-preprocessing text-tokenization
Last synced: 08 Oct 2025
https://github.com/sayamalt/luxury-apparel-product-category-classification-using-fine-tuned-distilbert
Successfully developed a multiclass text classification model by fine-tuning pretrained DistilBERT transformer model to classify various distinct types of luxury apparels into their respective categories i.e. pants, accessories, underwear, shoes, etc.
deep-learning distilbert-fine-tuning distilbert-model exploratory-data-analysis fine-tuning-bert model-inference model-training-and-evaluation multiclass-classification natural-language-processing text-classification text-preprocessing text-tokenization
Last synced: 08 Oct 2025
https://github.com/hnthap/cat-or-dog
An End-to-end AI Application classifying images as either a cat or a dog. The project leverages OpenVINO Model Server, a Node.js backend, and a React-based frontend.
axios docker docker-compose express full-stack image-classification image-preprocessing javascript machine-learning model-deployment model-inference nodejs react rest-api self-host sharp typescript vite website
Last synced: 14 Apr 2026
https://github.com/sayamalt/financial-news-sentiment-analysis
Successfully developed a fine-tuned DistilBERT transformer model which can accurately predict the overall sentiment of a piece of financial news up to an accuracy of nearly 81.5%.
data-exploration-and-preprocessing distilbert-model fine-tune-bert-tensorflow hugging-face-transformers model-architecture-and-implementation model-inference model-training-and-evaluation multiclass-classification natural-language-processing sentiment-analysis text-preprocessing text-tokenization
Last synced: 17 Oct 2025
https://github.com/sayamalt/wine-cultivator-classification-using-ann
Successfully established an ANN model which can classify wine cultivators based on several characteristics of distinct wines.
artificial-neural-networks classification deep-learning model-inference model-training-and-evaluation multiclass-classification pytorch
Last synced: 15 May 2026
https://github.com/sayamalt/global-equity-forecasting-using-lstm
Successfully established an LSTM model to effectively forecast global equity based on over 20+ years of historical data of global equity.
data-loaders deep-learning feature-scaling forecast-evaluation gradient-clipping lstm-neural-networks model-inference model-training-and-evaluation pytorch recurrent-neural-networks time-series-datasets time-series-forecasting xavier-initialization
Last synced: 02 May 2026
https://github.com/sayamalt/oral-disease-classification-using-cnn
Successfully developed an image classification model using PyTorch to classify two types of oral diseases, namely caries and gingivitis.
binary-classification convolutional-neural-networks data-loader deep-learning image-classification image-transformations model-inference model-training-and-evaluation pytorch
Last synced: 14 May 2026
https://github.com/gauravg-20/udacity-aws-mle-nd-project2-build-a-ml-workflow-for-scones-unlimited-on-amazon-sagemaker
The primary objective of this project was to build and deploy an image classification model for Scones Unlimited, a scone-delivery-focused logistic company, using AWS SageMaker.
aws aws-endpoint aws-lambda aws-s3 aws-sagemaker aws-state-machine json jupyter-notebook model-inference python sagemaker-deployment sagemaker-studio udacity-machine-learning-fundamentals udacity-nanodegree udacity-scholarship-course
Last synced: 12 Apr 2026
https://github.com/sayamalt/global-news-headlines-text-summarization
Successfully established a text summarization model using Seq2Seq modeling with Luong Attention, which can give a short and concise summary of the global news headlines.
attention-mechanism data-exploration-and-preprocessing luong-attention model-architecture-and-implementation model-inference natural-language-processing seq2seq-model text-generation text-summarization text-tokenization
Last synced: 09 Nov 2025
https://github.com/sayamalt/symptoms-disease-text-classification
Successfully developed a fine-tuned BERT transformer model which can accurately classify symptoms to their corresponding diseases upto an accuracy of 89%.
bert-fine-tuning data-exploration-and-preprocessing exploratory-data-analysis fine-tune-bert-tensorflow hugging-face-transformers model-architecture-and-implementation model-inference model-training-and-evaluation multiclass-classification natural-language-processing text-classification text-preprocessing text-tokenization
Last synced: 09 Nov 2025