Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/knowsuchagency/pdf-to-podcast
Convert any PDF into a podcast episode!
https://github.com/knowsuchagency/pdf-to-podcast
ai audio research utilities
Last synced: 2 days ago
JSON representation
Convert any PDF into a podcast episode!
- Host: GitHub
- URL: https://github.com/knowsuchagency/pdf-to-podcast
- Owner: knowsuchagency
- License: apache-2.0
- Created: 2024-06-09T04:28:05.000Z (8 months ago)
- Default Branch: main
- Last Pushed: 2024-11-15T18:07:09.000Z (2 months ago)
- Last Synced: 2025-01-18T02:01:55.742Z (9 days ago)
- Topics: ai, audio, research, utilities
- Language: Python
- Homepage: https://pdf-to-podcast.com
- Size: 4.22 MB
- Stars: 654
- Watchers: 13
- Forks: 280
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# PDF to Podcast
## Overview
This project provides a tool to convert any PDF document into a podcast episode! Using Google's Gemini for dialogue generation and OpenAI's text-to-speech models, this tool processes the content of a PDF, generates a natural dialogue suitable for an audio podcast, and outputs it as an MP3 file.
## Features
- **Convert PDF to Podcast:** Upload a PDF and convert its content into a podcast dialogue.
- **AI-Powered Dialogue:** Uses Google's Gemini LLM to create engaging, natural conversations.
- **High-Quality Audio:** Leverages OpenAI's text-to-speech for lifelike voices.
- **User-friendly Interface:** Simple interface using Gradio for easy interaction.## Installation
To set up the project, follow these steps:
1. **Clone the repository:**
```bash
git clone https://github.com/knowsuchagency/pdf-to-podcast.git
cd pdf-to-podcast
```2. **Install dependencies:**
```bash
uv sync
```## Usage
1. **Set up API Key(s):**
You'll need an api key for OpenAI which you can either pass through the interface or set as the `OPENAI_API_KEY` environment variable.
2. **Run the application:**
```bash
python main.py
```
This will launch a Gradio interface in your web browser.3. **Upload a PDF:**
Upload the PDF document you want to convert into a podcast.4. **Enter OpenAI API Key:**
Provide your OpenAI API key in the designated textbox.5. **Generate Audio:**
Click the button to start the conversion process. The output will be an MP3 file containing the podcast dialogue.## License
This project is licensed under the Apache 2.0 License. See the [LICENSE](LICENSE) file for more information.