https://github.com/liddiard/harmontown-search
Search all transcripts from the Harmontown podcast. Transcription powered by OpenAI's Whisper model. Search powered by Typesense.
https://github.com/liddiard/harmontown-search
fusejs nextjs openai-whisper podcast react search search-engine typesense
Last synced: about 2 months ago
JSON representation
Search all transcripts from the Harmontown podcast. Transcription powered by OpenAI's Whisper model. Search powered by Typesense.
- Host: GitHub
- URL: https://github.com/liddiard/harmontown-search
- Owner: liddiard
- Created: 2023-07-08T00:59:44.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2025-06-15T13:49:06.000Z (4 months ago)
- Last Synced: 2025-06-15T14:57:39.768Z (4 months ago)
- Topics: fusejs, nextjs, openai-whisper, podcast, react, search, search-engine, typesense
- Language: TypeScript
- Homepage: https://harmonsearch.com
- Size: 31 MB
- Stars: 13
- Watchers: 1
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# 🎤 [Harmontown podcast search](https://harmonsearch.com/)
Suite of scripts to download, transcribe, and search-index episodes of Harmontown, along with a [web application](https://harmonsearch.com) to search and play the episodes.
Background: [_Harmontown_](https://en.wikipedia.org/wiki/Harmontown) was a 2012–2019 podcast hosted by Dan Harmon, creator of _Community_ and _Rick and Morty_. Through its 360-episode run as a live show in Los Angeles, often featuring celebrity guests, the podcast gained a loyal fanbase and was profiled in an eponymous [documentary](https://www.imdb.com/title/tt3518988/).
See the [About page](https://harmonsearch.com/about/) for more on the motivation behind building this.
## Structure
See READMEs in each directory listed below for more details.
### [`episodes/`](/episodes)
Tools for downloading and renaming audio and video files from the podcast. Required for local transcription and media hosting in development on the frontend web application.
### [`transcripts/`](/transcripts)
Tools for transcribing downloaded episodes and uploading transcripts to a [Typesense](https://typesense.org/) server for indexing and search.
### [`frontend/`](/frontend)
A Next.js + TypeScript web application for searching and listening to and watching the episodes. Intended for [static export](https://nextjs.org/docs/app/building-your-application/deploying/static-exports) and hosting on a cloud object storage service like [Amazon S3](https://aws.amazon.com/s3/).