{"id":18798998,"url":"https://github.com/cncf/llm-in-action","last_synced_at":"2025-04-13T17:22:42.020Z","repository":{"id":228574315,"uuid":"768866522","full_name":"cncf/llm-in-action","owner":"cncf","description":"🤖 Discover how to apply your LLM app skills on Kubernetes!","archived":false,"fork":false,"pushed_at":"2024-03-08T21:47:45.000Z","size":59,"stargazers_count":143,"open_issues_count":0,"forks_count":8,"subscribers_count":7,"default_branch":"main","last_synced_at":"2025-03-27T08:11:36.140Z","etag":null,"topics":["cloudnative","inference","llm"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/cncf.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null}},"created_at":"2024-03-07T22:03:04.000Z","updated_at":"2025-02-26T15:04:41.000Z","dependencies_parsed_at":"2024-03-19T13:58:10.596Z","dependency_job_id":null,"html_url":"https://github.com/cncf/llm-in-action","commit_stats":null,"previous_names":["cncf/llm-in-action"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cncf%2Fllm-in-action","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cncf%2Fllm-in-action/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cncf%2Fllm-in-action/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cncf%2Fllm-in-action/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/cncf","download_url":"https://codeload.github.com/cncf/llm-in-action/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248750870,"owners_count":21155797,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["cloudnative","inference","llm"],"created_at":"2024-11-07T22:13:42.299Z","updated_at":"2025-04-13T17:22:41.994Z","avatar_url":"https://github.com/cncf.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# LLMs in Action: A Cloud Native Story\n\n## Prerequisites\n\n- [Docker](https://docs.docker.com/install/)\n  - Docker is a platform for developers and sysadmins to develop, ship, and run applications. Docker enables you to separate your applications from your infrastructure so you can deliver software quickly.\n- [Ollama](https://ollama.com/)\n  - Ollama is a Language Model as a Service (LMaaS) that provides a RESTful API for interacting with large language models. It's a great way to get started with LLMs without having to worry about the infrastructure.\n- [kind](https://kind.sigs.k8s.io/)\n  - kind is \"Kubernetes in Docker,\" used by the Kubernetes project to help test features and run integration tests. It turns out it's a handy way for anyone to spin up a cluster quickly. Big thank you to @bentheelder for developing it 👏🏼 👏🏼\n- [kubectl](https://kubernetes.io/docs/tasks/tools/install-kubectl/)\n  - kubectl is how you interact with k8s using the command line. This allows you to become a k8s whisperer :-)\n\nWith Ollama installed on your machine, you will need to pull the LLaVa model by running\n\n```sh\nollama pull llava\n```\n\nYou can verify that the model is installed by running\n\n```sh\nollama list\n\nNAME            ID           SIZE   MODIFIED\nllava:latest    8dd30f6b0cb1 4.7 GB 17 seconds ago \n```\n\n## Startup\n\nWe have crafted a few scripts to make this demo run as quickly as possible on your machine once you've installed the prerequisites.\n\nThis script will:\n\n- Create a kind cluster\n- Apply the Kubernetes manifests we need for our demo\n- Use port-forwarding to help us access our service in the browser so we can take photos and describe them with LLaVa (Large Language and Vision Assistant)\n\n```sh\n./startup.sh\n```\n\nTo access the service, open your browser and navigate to [http://localhost:8501](http://localhost:8501)\n\n## Shutdown\n\nTo shut down the demo, run the following command, which will:\n\n- Remove the Kubernetes manifests\n- Remove the port-forwarding\n- Delete the kind cluster\n\n```sh\n./shutdown.sh\n```\n\n## Operating System Information\n\nThis demo has been tested on the following operating systems and will work if you have the prerequisites installed.\n\n- macOS\n- Linux\n- Windows\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcncf%2Fllm-in-action","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fcncf%2Fllm-in-action","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcncf%2Fllm-in-action/lists"}