https://github.com/theleoborges/podcaster

Last synced: 10 months ago
JSON representation

Host: GitHub
URL: https://github.com/theleoborges/podcaster
Owner: theleoborges
Created: 2024-11-05T23:29:49.000Z (about 1 year ago)
Default Branch: main
Last Pushed: 2024-11-05T23:55:52.000Z (about 1 year ago)
Last Synced: 2025-01-22T13:46:45.435Z (12 months ago)
Language: Python
Size: 345 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Podcaster

[NotebookLM](https://notebooklm.google.com/) is amazing. If you haven't tried it out, do it. But what if you would like to generate a podcast from some private or confidential document? This is where this script may come in handy.

The idea is that you generate a podcast script based on some material, such as a PDF file, using an LLM such as LLaMa. You can use LLaMa on your local laptop via [LMStudio](https://lmstudio.ai/) or [Ollama](https://ollama.com/). This script then uses [bark](https://github.com/suno-ai/bark) to convert text to speech.

There is a sample script in this repo that was created from the included PDF file — an HBR article about hybrid work. You can create the podcast script using the following prompts in LMStudio (or other similar tool):

1. <>
2. User: Please extract the text from the attached PDF while preserving context.
3. User: Now turn this into a podcast transcript from the perspective of hosts Jane and Matt, tech and business journalists and AI enthusiasts.

Once you have the script, save it to `script.txt` and generate the audio file:

$ pip install -r requirements.txt
$ python main.py script.txt

This will run **slow** on your laptop but it will work. It took about 10-15min to generate a 2.5min podcast.
The generated audio file will be saved to `podcast.wav`.

Have fun!

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/theleoborges/podcaster

Awesome Lists containing this project

README