https://github.com/sukso96100/transcribe-tools
https://github.com/sukso96100/transcribe-tools
Last synced: 3 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/sukso96100/transcribe-tools
- Owner: sukso96100
- Created: 2021-06-07T15:37:27.000Z (about 4 years ago)
- Default Branch: main
- Last Pushed: 2021-06-08T16:26:26.000Z (about 4 years ago)
- Last Synced: 2025-01-18T02:44:00.843Z (5 months ago)
- Language: Python
- Size: 592 KB
- Stars: 1
- Watchers: 3
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# transcribe-tools
Set of python scripts and GitHub Actions workflow to automate video transcrib batch job
## Updating videos.json
Extract `*.flac` audio from video using following ffmpeg command
```bash
ffmpeg -y -i input.mp4 -ac 1 output.flac
```Then upload the flace file on Google cloud storage bucket for audio file input.
Add audio and transcribe info in `include` array in `videos.json`- `input`: Name of audio file
- `from`: Audio source language code (e.g. ko, en, zh, zh-CN, zh-TW)
- `to`: Destination languages with comma seperated value. (e.g. `ko,zh` for translating into Korean and Chinese)```json
{
"include": [
{
"input": "audio_file_name.flac",
"from": "ko",
"to": "en"
},
...
]
}
```Commit and push update, Github Action will run subtitle batch job
## Credits
- [GoogleCloudPlatform/community](https://github.com/GoogleCloudPlatform/community)