{"id":21672382,"url":"https://github.com/redis-developer/gcp-redis-llm-stack","last_synced_at":"2025-07-17T06:04:07.128Z","repository":{"id":177624253,"uuid":"660320081","full_name":"redis-developer/gcp-redis-llm-stack","owner":"redis-developer","description":"Reference architecture for LLM-based applications on Google Cloud Platform with Redis Enterprise as a high-performance data layer.","archived":false,"fork":false,"pushed_at":"2024-12-11T16:39:43.000Z","size":3405,"stargazers_count":32,"open_issues_count":1,"forks_count":12,"subscribers_count":6,"default_branch":"main","last_synced_at":"2025-04-12T03:53:14.402Z","etag":null,"topics":["caching","chatbot","gcp","google-cloud","llms","memory","palm-api","redis","redis-enterprise","vector-database","vertex-ai"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/redis-developer.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2023-06-29T18:37:32.000Z","updated_at":"2025-04-07T03:50:38.000Z","dependencies_parsed_at":null,"dependency_job_id":"4b27cd9f-235e-4bd1-9be7-edc7c956f6c4","html_url":"https://github.com/redis-developer/gcp-redis-llm-stack","commit_stats":{"total_commits":14,"total_committers":2,"mean_commits":7.0,"dds":0.4285714285714286,"last_synced_commit":"be8e9bb75a78058a0b0a651da5056a1a9af3b344"},"previous_names":["redisventures/redis-google-llms","redis-developer/gcp-redis-llm-stack","redisventures/gcp-redis-llm-stack"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/redis-developer/gcp-redis-llm-stack","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/redis-developer%2Fgcp-redis-llm-stack","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/redis-developer%2Fgcp-redis-llm-stack/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/redis-developer%2Fgcp-redis-llm-stack/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/redis-developer%2Fgcp-redis-llm-stack/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/redis-developer","download_url":"https://codeload.github.com/redis-developer/gcp-redis-llm-stack/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/redis-developer%2Fgcp-redis-llm-stack/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":265571061,"owners_count":23790015,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["caching","chatbot","gcp","google-cloud","llms","memory","palm-api","redis","redis-enterprise","vector-database","vertex-ai"],"created_at":"2024-11-25T13:29:11.494Z","updated_at":"2025-07-17T06:04:07.121Z","avatar_url":"https://github.com/redis-developer.png","language":"Jupyter Notebook","readme":"# Scalable LLM Architectures with Redis \u0026 GCP Vertex AI\n\n☁️ [Generative AI](https://cloud.google.com/vertex-ai/generative-ai/docs/learn/overview) with Google Vertex AI comes with a specialized [in-console studio experience](https://cloud.google.com/vertex-ai/docs/generative-ai/start/quickstarts/quickstart), a [dedicated API for Gemini](https://cloud.google.com/vertex-ai/docs/generative-ai/start/quickstarts/api-quickstart) and easy-to-use [Python SDK](https://cloud.google.com/vertex-ai/docs/python-sdk/use-vertex-ai-python-sdk) designed for deploying and managing instances of Google's powerful language models.\n\n⚡ Redis Enterprise offers fast and scalable [vector search](https://redis.io/solutions/vector-search/), with an API for index creation, management, blazing-fast search, and hybrid filtering. When coupled with its [versatile data structures](https://redis.io/docs/latest/develop/data-types/) - Redis Enterprise shines as the optimal solution for building high-quality Large Language Model (LLM) apps.\n\n\u003eThis repo serves as a foundational architecture for building LLM applications with Redis and GCP services.\n\n## Reference architecture\n\n![](assets/Redis_GCN_GenAI_20240325.png)\n\n1. Primary Data Sources\n2. Data Extraction and Loading\n3. Large Language Models\n    - `text-embedding-gecko@003` for embeddings\n    - `gemini-1.5-flash-001` for LLM generation and chat\n3. High-Performance Data Layer (Redis)\n    - Semantic caching to improve LLM performance and associated costs\n    - Vector search for context retrieval from knowledge base\n\n\n## RAG + Semantic Caching demo\n\u003ca href=\"https://github.com/redis-developer/redis-ai-resources/blob/main/python-recipes/semantic-cache/00_semantic_caching_gemini.ipynb\" target=\"_parent\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e\n\n**Open the code tutorial using the Colab notebook to get your hands dirty with Redis and Vertex AI on GCP.**\n\n\n## Additional resources\n- [Streamlit PDF chatbot example app](examples/chat-your-pdf/)\n- [Redis vector search documentation](https://redis.io/docs/latest/develop/interact/search-and-query/query/vector-search/)\n- [Get started with RedisVL](https://redis.io/blog/introducing-the-redis-vector-library-for-enhancing-genai-development/)\n- [Google VertexAI resources](https://cloud.google.com/vertex-ai)\n- [More Redis ai resources](https://github.com/redis-developer)","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fredis-developer%2Fgcp-redis-llm-stack","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fredis-developer%2Fgcp-redis-llm-stack","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fredis-developer%2Fgcp-redis-llm-stack/lists"}