An open API service indexing awesome lists of open source software.

https://github.com/nirantk/rag-to-riches


https://github.com/nirantk/rag-to-riches

evals rag search

Last synced: about 2 months ago
JSON representation

Awesome Lists containing this project

README

        

# ๐Ÿš€ RAG to Riches

> A comprehensive course on building production-ready RAG (Retrieval Augmented Generation) systems

[![Course Link](https://img.shields.io/badge/Course-Maven-blue)](https://maven.com/nirantk/rag-to-riches)
[![Website](https://img.shields.io/badge/Website-nirantk.com-green)](https://nirantk.com)

This repository contains all the code and datasets used in the [Search for RAG](https://maven.com/nirantk/rag-to-riches) course. Guest speakers are encouraged to contribute their code, notebooks, and datasets by raising a PR to the respective folders.

## ๐Ÿ“š Course Curriculum

### Module 1: Foundations of RAG
- **01 RAG Evals** ๐Ÿ”
- Understanding RAG metrics and evaluation frameworks
- Setting up evaluation pipelines
- Best practices for RAG testing

- **02 Query Understanding** ๐Ÿ’ญ
- Query analysis techniques
- Query expansion and reformulation
- Handling different query types and intents

- **03 Jerry Liu** ๐Ÿ—๏ธ ![Guest Lecture](https://img.shields.io/badge/Guest-Lecture-purple)
- Hybrid search approaches
- Multi-stage retrieval
- Custom retrievers and rankers

### Module 2: Advanced RAG Techniques
- **04 Ofer** โšก ![Guest Lecture](https://img.shields.io/badge/Guest-Lecture-purple)
- Performance optimization strategies
- Caching and indexing techniques
- Scaling RAG systems

- **05 Automatic Prompting** ๐Ÿค–
- Dynamic prompt generation
- Prompt optimization techniques
- Automated prompt testing

- **06 Working with Complex Docs** ๐Ÿ“„
- Handling structured and unstructured documents
- Document chunking strategies
- Multi-modal document processing

### Module 3: Industry Applications
- **07 Aditya Gushwork** ๐Ÿข ![Guest Lecture](https://img.shields.io/badge/Guest-Lecture-purple)
- Enterprise-grade RAG implementations
- Security and compliance considerations
- Integration patterns

- **08 John Gilhuly** ๐Ÿš€ ![Guest Lecture](https://img.shields.io/badge/Guest-Lecture-purple)
- Deployment strategies
- Monitoring and observability
- Production best practices

### Module 4: Advanced Topics
- **09 Neural IR** ๐Ÿง 
- Neural search architectures
- Dense retrievers
- Cross-encoders and bi-encoders

- **10 Testset Generation** ๐Ÿงช
- Synthetic data generation
- Test set validation
- Quality assurance techniques

- **11 Embedding Models** ๐Ÿ”ค
- Understanding embedding spaces
- Model selection and fine-tuning
- Multi-modal embeddings

### Module 5: Optimization and Tricks
- **12 Vectorsearch Tricks** ๐ŸŽฏ
- Advanced indexing techniques
- Query optimization
- Performance tuning

- **13 Shreya Shankar** ๐Ÿ—๏ธ ![Guest Lecture](https://img.shields.io/badge/Guest-Lecture-purple)
- System architecture patterns
- Scalability considerations
- Error handling and recovery

### Module 6: Specialized Applications
- **14 Atita Arora** ๐ŸŽฏ ![Guest Lecture](https://img.shields.io/badge/Guest-Lecture-purple)
- Industry-specific implementations
- Custom knowledge bases
- Specialized retrieval techniques

- **15 Text Profiling** ๐Ÿ“Š
- Text classification
- Content analysis
- Metadata extraction

- **16 Alberto Romero** ๐Ÿ”ฎ ![Guest Lecture](https://img.shields.io/badge/Guest-Lecture-purple)
- Emerging trends
- Research directions
- Future applications

### Additional Resources
- **Lab01 Finance Bench** ๐Ÿ’ฐ
- Finance-specific RAG implementations
- **Office Hours** ๐ŸŽฅ
- Recordings and materials from office hours

---


Created with โค๏ธ by Nirant Kasliwal