https://github.com/bugbakery/transcribee
open source audio and video transcription software
https://github.com/bugbakery/transcribee
collaborative speech-to-text transcription
Last synced: 5 months ago
JSON representation
open source audio and video transcription software
- Host: GitHub
- URL: https://github.com/bugbakery/transcribee
- Owner: bugbakery
- License: agpl-3.0
- Created: 2023-03-02T13:23:42.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2025-03-29T14:15:37.000Z (6 months ago)
- Last Synced: 2025-04-19T04:42:48.874Z (6 months ago)
- Topics: collaborative, speech-to-text, transcription
- Language: TypeScript
- Homepage: https://transcribee.net
- Size: 5.25 MB
- Stars: 392
- Watchers: 16
- Forks: 27
- Open Issues: 99
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
Awesome Lists containing this project
README
![]()
#
π€ transcribee βοΈ> **
[going to be] an open source audio- and videotranscription software**> **Note**:
>
> Currently, transcribee is heavily work-in-progress and not yet ready for production use. Please come back in a few weeks / months.`transcribee π` aims to make the workflow for media transcription easier, faster and more accessible.
- It can **automatically generate a draft transcript** of your audio
- It allows you to **quickly improve** the automatic draft and fix any errors
- It's **collaborative** β split the work with your friends or colleagues
- It's **open-source**## Develop!
To get started with developing or to try the current state of transcribee, follow the instructions
in the [development setup document](doc/development_setup.md).## How does it work?
> **Note**:
>
> We're heavily working on transcribee. Not all steps described here are already implemented.Creating a transcript with transcribee π is done with the following steps:
1. Import your media file
During import, your audio file is automatically converted to text using state-of-the-art models[^models].
transcribee π also automatically detects different speakers in your file.2. Manually improve the transcript
After the automatic transcript is created, you can edit it to correct any mistakes the automatic transcription made.[^editor]
You can also name the speakers.Since transcribee π is a collaborative software, you can do this step (and all other manual steps) together with others.
All changes are instantly synced with everyone working on the transcript.3. Automatic re-alignment
To make sure that the timestamps of your corrected text are still correct, transcribee π matches this text back up with the audio.
4. Manual re-alignment
Now you can check the automatically generated timestamps and correct them.
5. Export
Once you are happy with the transcript, you can export it.
[^models]: At the moment we use whisper.cpp for transcription, Wav2Vec2 for realignment and speechbrain for speaker identification.
[^editor]: The editor is based on slate with collaboration using the automerge CRDT.## Acknowledgements
- Funded from March 2023 until September 2023 by
