https://github.com/knowsuchagency/pdf-to-podcast
Convert any PDF into a podcast episode!
https://github.com/knowsuchagency/pdf-to-podcast
ai audio research utilities
Last synced: 7 months ago
JSON representation
Convert any PDF into a podcast episode!
- Host: GitHub
- URL: https://github.com/knowsuchagency/pdf-to-podcast
- Owner: knowsuchagency
- License: apache-2.0
- Created: 2024-06-09T04:28:05.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-03-17T21:29:38.000Z (9 months ago)
- Last Synced: 2025-05-13T20:16:42.020Z (7 months ago)
- Topics: ai, audio, research, utilities
- Language: Python
- Homepage: https://pdf-to-podcast.com
- Size: 4.22 MB
- Stars: 750
- Watchers: 16
- Forks: 341
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# PDF to Podcast
## Overview
This project provides a tool to convert any PDF document into a podcast episode! Using Google's Gemini for dialogue generation and OpenAI's text-to-speech models, this tool processes the content of a PDF, generates a natural dialogue suitable for an audio podcast, and outputs it as an MP3 file.
## Features
- **Convert PDF to Podcast:** Upload a PDF and convert its content into a podcast dialogue.
- **AI-Powered Dialogue:** Uses Google's Gemini LLM to create engaging, natural conversations.
- **High-Quality Audio:** Leverages OpenAI's text-to-speech for lifelike voices.
- **User-friendly Interface:** Simple interface using Gradio for easy interaction.
## Installation
To set up the project, follow these steps:
1. **Clone the repository:**
```bash
git clone https://github.com/knowsuchagency/pdf-to-podcast.git
cd pdf-to-podcast
```
2. **Install dependencies:**
```bash
uv sync
```
## Usage
1. **Set up API Key(s):**
You'll need an api key for OpenAI which you can either pass through the interface or set as the `OPENAI_API_KEY` environment variable.
2. **Run the application:**
```bash
python main.py
```
This will launch a Gradio interface in your web browser.
3. **Upload a PDF:**
Upload the PDF document you want to convert into a podcast.
4. **Enter OpenAI API Key:**
Provide your OpenAI API key in the designated textbox.
5. **Generate Audio:**
Click the button to start the conversion process. The output will be an MP3 file containing the podcast dialogue.
## License
This project is licensed under the Apache 2.0 License. See the [LICENSE](LICENSE) file for more information.