An open API service indexing awesome lists of open source software.

https://github.com/tonykipkemboi/research-paper-to-podcast

Yet Another Deep Dive Clone (YADDC): An automated system that transforms academic research papers into engaging podcast conversations using CrewAI and ElevenLabs.
https://github.com/tonykipkemboi/research-paper-to-podcast

ai aiagents crewai deepdiveclone elevenlabs podcast

Last synced: 6 months ago
JSON representation

Yet Another Deep Dive Clone (YADDC): An automated system that transforms academic research papers into engaging podcast conversations using CrewAI and ElevenLabs.

Awesome Lists containing this project

README

          

# AI Podcast Generator

An automated podcast generation system using [CrewAI](https://crewai.com/) and [ElevenLabs](https://elevenlabs.io/). The system reads research papers and creates engaging, conversational podcasts with natural-sounding voices.

## 🎧 Latest AI Research Paper Podcast

[![Spotify](https://img.shields.io/badge/Listen_on-Spotify-1DB954?style=for-the-badge&logo=spotify&logoColor=white)](https://open.spotify.com/episode/1PexEFSrxDyTiy9mcBt9D3?si=FM1_sDcPQ52ifrnmyOLLkw)

Latest Episode: "Long Context vs. RAG for LLMs"
- 🔍 Deep dive into the performance comparison
- 🧠 Analysis of memory vs retrieval approaches
- 💡 Real-world implications and use cases

Generated using AI agents for research analysis, script writing, and voice synthesis.

## Features
- Research paper analysis and summarization
- Natural conversational script generation
- Enhanced script refinement for engagement
- High-quality voice synthesis using ElevenLabs
- Professional audio mixing and processing

## Setup
1. Clone this repository
2. Install requirements:
```bash
pip install -r requirements.txt
```
3. Create a `.env` file with your API keys:
```
OPENAI_API_KEY=your_key_here
ELEVENLABS_API_KEY=your_key_here
ANTHROPIC_API_KEY=your_key_here
```

## Usage
1. Place your research paper in the `knowledge/` directory or use the one provided in the repository
2. Run the agents script:
```bash
python podcast_generator.py
```
3. Find outputs in the `outputs/` directory:
- Generated scripts
- Audio segments
- Final podcast

## Configuration
- Voice settings can be adjusted in `tools.py`
- Agent behaviors configured in `podcast_generator.py`

## TODO
- [ ] Add Docling to perform RAG on various file types
- [ ] Add multilingual support
- [ ] Add search functionality to reaserch supporting materials for the podcast

## Acknowledgements
- [CrewAI](https://crewai.com/)
- [ElevenLabs](https://elevenlabs.io/)