{"id":28170114,"url":"https://github.com/oracle-quickstart/oci-ai-blueprints","last_synced_at":"2026-03-18T02:34:02.083Z","repository":{"id":269104858,"uuid":"883420881","full_name":"oracle-quickstart/oci-ai-blueprints","owner":"oracle-quickstart","description":"Deploy, manage and monitor Gen AI workloads in minutes in your own tenancy and GPU infrastructure resources.","archived":false,"fork":false,"pushed_at":"2026-02-24T04:04:09.000Z","size":63140,"stargazers_count":45,"open_issues_count":17,"forks_count":17,"subscribers_count":6,"default_branch":"main","last_synced_at":"2026-02-24T09:52:03.853Z","etag":null,"topics":["ai","applied-ai","generative-ai","gpu-computing","kubernetes","oracle-cloud"],"latest_commit_sha":null,"homepage":"","language":"HCL","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"upl-1.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/oracle-quickstart.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE.txt","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":"SECURITY.md","support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2024-11-04T23:50:46.000Z","updated_at":"2026-02-24T01:04:42.000Z","dependencies_parsed_at":"2025-01-09T20:25:00.426Z","dependency_job_id":"a3efa4e8-4a8c-4e5b-a707-314f3adf411e","html_url":"https://github.com/oracle-quickstart/oci-ai-blueprints","commit_stats":null,"previous_names":["oracle-quickstart/oci-corrino-oke-ai-ml-toolkit","oracle-quickstart/oci-ai-blueprints"],"tags_count":35,"template":false,"template_full_name":"oracle-quickstart/oci-quickstart-template","purl":"pkg:github/oracle-quickstart/oci-ai-blueprints","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/oracle-quickstart%2Foci-ai-blueprints","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/oracle-quickstart%2Foci-ai-blueprints/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/oracle-quickstart%2Foci-ai-blueprints/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/oracle-quickstart%2Foci-ai-blueprints/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/oracle-quickstart","download_url":"https://codeload.github.com/oracle-quickstart/oci-ai-blueprints/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/oracle-quickstart%2Foci-ai-blueprints/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":30642995,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-03-18T01:41:58.583Z","status":"online","status_checked_at":"2026-03-18T02:00:07.824Z","response_time":104,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["ai","applied-ai","generative-ai","gpu-computing","kubernetes","oracle-cloud"],"created_at":"2025-05-15T17:16:55.186Z","updated_at":"2026-03-18T02:34:02.048Z","avatar_url":"https://github.com/oracle-quickstart.png","language":"HCL","funding_links":[],"categories":[],"sub_categories":[],"readme":"# OCI AI Blueprints\n\n**Deploy, scale, and monitor AI workloads with the OCI AI Blueprints platform, and reduce your GPU onboarding time from weeks to minutes.**\n\nOCI AI Blueprints is a streamlined, no-code solution for deploying and managing Generative AI workloads on Kubernetes Engine (OKE). By providing opinionated hardware recommendations, pre-packaged software stacks, and out-of-the-box observability tooling, OCI AI Blueprints helps you get your AI applications running quickly and efficiently—without wrestling with the complexities of infrastructure decisions, software compatibility, and MLOps best practices.\n\n[![Install OCI AI Blueprints](https://raw.githubusercontent.com/oracle-quickstart/oci-ai-blueprints/refs/heads/main/docs/images/install.svg)](./GETTING_STARTED_README.md)\n\n## Table of Contents\n\n**Getting Started**\n\n- [Install AI Blueprints](./GETTING_STARTED_README.md)\n- [Access AI Blueprints Portal and API](docs/usage_guide.md)\n\n**About OCI AI Blueprints**\n\n- [What is OCI AI Blueprints?](docs/about.md)\n- [Why use OCI AI Blueprints?](docs/about.md)\n- [Features](docs/about.md)\n- [List of Blueprints](#blueprints)\n- [FAQ](docs/about.md)\n- [Support \u0026 Contact](https://github.com/oracle-quickstart/oci-ai-blueprints/blob/vkammari/doc_improvements/docs/about/README.md#frequently-asked-questions-faq)\n\n**API Reference**\n\n- [API Reference Documentation](docs/api_documentation.md)\n\n**Additional Resources**\n\n- [Publish Custom Blueprints](./docs/custom_blueprints)\n- [Installing Updates](docs/installing_new_updates.md)\n- [IAM Policies](docs/iam_policies.md)\n- [Repository Contents](docs/about.md)\n- [Known Issues](docs/known_issues.md)\n\n## Getting Started\n\nInstall OCI AI Blueprints by clicking on the button below:\n\n[![Install OCI AI Blueprints](https://raw.githubusercontent.com/oracle-quickstart/oci-ai-blueprints/refs/heads/main/docs/images/install.svg)](./GETTING_STARTED_README.md)\n\n## Blueprints\n\nBlueprints go beyond basic Terraform templates. Each blueprint:\n\n- Offers validated hardware suggestions (e.g., optimal shapes, CPU/GPU configurations),\n- Includes end-to-end application stacks customized for different GenAI use cases, and\n- Comes with monitoring, logging, and auto-scaling configured out of the box.\n\nAfter you install OCI AI Blueprints to an OKE cluster in your tenancy, you can deploy these pre-built blueprints:\n\n| Blueprint                                                                                     | Description                                                                                                                                     |\n| --------------------------------------------------------------------------------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------- |\n| [**LLM \u0026 VLM Inference with vLLM**](docs/sample_blueprints/model_serving/llm_inference_with_vllm/README.md) | Deploy Llama 2/3/3.1 7B/8B models using NVIDIA GPU shapes and the vLLM inference engine with auto-scaling.                                      |\n| [**Llama Stack**](docs/sample_blueprints/partner_blueprints/llama-stack/README.md)                                       | Complete GenAI runtime with vLLM, ChromaDB, Postgres, and Jaeger for production deployments with unified API for inference, RAG, and telemetry. |\n| [**Fine-Tuning Benchmarking**](docs/sample_blueprints/gpu_benchmarking/lora-benchmarking/README.md)                    | Run MLCommons quantized Llama-2 70B LoRA finetuning on A100 for performance benchmarking.                                                       |\n| [**LoRA Fine-Tuning**](docs/sample_blueprints/model_fine_tuning/lora-fine-tuning/README.md)                             | LoRA fine-tuning of custom or HuggingFace models using any dataset. Includes flexible hyperparameter tuning.                                    |\n| [**GPU Performance Benchmarking**](docs/sample_blueprints/gpu_health_check/gpu-health-check/README.md)                 | Comprehensive evaluation of GPU performance to ensure optimal hardware readiness before initiating any intensive computational workload.        |\n| [**CPU Inference**](docs/sample_blueprints/model_serving/cpu-inference/README.md)                                   | Leverage Ollama to test CPU-based inference with models like Mistral, Gemma, and more.                                                          |\n| [**Multi-node Inference with RDMA and vLLM**](docs/sample_blueprints/model_serving/multi-node-inference/README.md) | Deploy Llama-405B sized LLMs across multiple nodes with RDMA using H100 nodes with vLLM and LeaderWorkerSet.                                    |\n| [**Autoscaling Inference with vLLM**](docs/sample_blueprints/model_serving/auto_scaling/README.md)                 | Serve LLMs with auto-scaling using KEDA, which scales to multiple GPUs and nodes using application metrics like inference latency.              |\n| [**LLM Inference with MIG**](docs/sample_blueprints/model_serving/mig_multi_instance_gpu/README.md)                | Deploy LLMs to a fraction of a GPU with Nvidia’s multi-instance GPUs and serve them with vLLM.                                                  |\n| [**Job Queuing**](docs/sample_blueprints/platform_features/teams/README.md)                                             | Take advantage of job queuing and enforce resource quotas and fair sharing between teams.                                                       |\n\n## Support \u0026 Contact\n\nIf you have any questions, issues, or feedback, contact [vishnu.kammari@oracle.com](mailto:vishnu.kammari@oracle.com) or [grant.neuman@oracle.com](mailto:grant.neuman@oracle.com).\n\n## Contributing\n\nThis project welcomes contributions from the community. Before submitting a pull request, please [review our contribution guide](./CONTRIBUTING.md)\n\n## Security\n\nPlease consult the [security guide](./SECURITY.md) for our responsible security vulnerability disclosure process\n\n## License\n\nCopyright (c) 2024, 2025 Oracle and/or its affiliates.\n\nReleased under the Universal Permissive License v1.0 as shown at\n\u003chttps://oss.oracle.com/licenses/upl/\u003e.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Foracle-quickstart%2Foci-ai-blueprints","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Foracle-quickstart%2Foci-ai-blueprints","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Foracle-quickstart%2Foci-ai-blueprints/lists"}