An open API service indexing awesome lists of open source software.

https://github.com/kingabzpro/deploying-llama-3.3-70b

Serve Llama 3.3 70B (with AWQ quantization) using vLLM and deploy it on BentoCloud.
https://github.com/kingabzpro/deploying-llama-3.3-70b

bentocloud bentoml cloud fastapi llama3-3 mlops vllm

Last synced: about 2 months ago
JSON representation

Serve Llama 3.3 70B (with AWQ quantization) using vLLM and deploy it on BentoCloud.

Awesome Lists containing this project