https://github.com/genmech/rag
Retrieval-Augmented Generation (RAG) is a powerful and popular technique that applies specialized knowledge to large language models (LLMs).
https://github.com/genmech/rag
Last synced: 3 months ago
JSON representation
Retrieval-Augmented Generation (RAG) is a powerful and popular technique that applies specialized knowledge to large language models (LLMs).
- Host: GitHub
- URL: https://github.com/genmech/rag
- Owner: GenMech
- Created: 2024-06-25T10:08:31.000Z (12 months ago)
- Default Branch: master
- Last Pushed: 2024-06-25T10:12:50.000Z (12 months ago)
- Last Synced: 2025-01-21T12:11:32.799Z (5 months ago)
- Language: Jupyter Notebook
- Size: 4.88 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Retrieval-Augmented Generation (RAG)
If is a powerful and popular technique that applies specialized knowledge to large language models (LLMs). However, traditional RAG methods tend to have increasingly long prompts, sometimes exceeding 40k, which can result in high financial and latency costs. Moreover, the decreased information density within the prompts can lead to performance degradation in LLMs, such as the "lost in the middle" issue.# Tools Used
- Pinecone
- GroqCloud
- Cohere