https://github.com/tuzibr/Real_time_caption_translate
A real-time caption translation tool based on VOSK speech recognition and machine translation, which supports transcribing audio into target language subtitles in real time and displaying the translated content.
https://github.com/tuzibr/Real_time_caption_translate
captions microphone real-time speaker speech-recognition subtitles transcribe translate
Last synced: 7 months ago
JSON representation
A real-time caption translation tool based on VOSK speech recognition and machine translation, which supports transcribing audio into target language subtitles in real time and displaying the translated content.
- Host: GitHub
- URL: https://github.com/tuzibr/Real_time_caption_translate
- Owner: tuzibr
- License: mit
- Created: 2025-02-26T17:36:44.000Z (7 months ago)
- Default Branch: master
- Last Pushed: 2025-03-08T05:45:26.000Z (7 months ago)
- Last Synced: 2025-03-08T06:25:56.504Z (7 months ago)
- Topics: captions, microphone, real-time, speaker, speech-recognition, subtitles, transcribe, translate
- Language: Python
- Homepage:
- Size: 38.7 MB
- Stars: 4
- Watchers: 1
- Forks: 0
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
[English](README_en.md) | 简体中文
# 实时字幕翻译

一个基于VOSK语音识别和机器翻译的实时字幕翻译工具,支持将音频实时转录为目标语言字幕并进行翻译显示。
## ✨ 特性
- 🎙️ 实时音频转录与翻译
- 🌐 多翻译引擎支持(Google/DeepL/Ollama)
- 📊 主界面显示历史翻译内容
![]()
- 🖥️ 实时字幕框
![]()
- 🎚️ 支持扬声器/麦克风
- 🔧 多语言配置文件支持
- 📚 查阅[VOSK官网](https://alphacephei.com/vosk/models)模型目录确认你需要的语言是否够识别(或者训练你自己的模型)
![]()
- ⚙️ 可在设置中设置仅当文本结束时翻译,减少资源消耗并且获得更加准确结果
## 📥 安装方法
### 方法一:直接使用Release版本
1. 前往 [Releases页面](https://github.com/tuzibr/Real_time_caption_translate/releases/tag/v1.0.0) 下载最新版本压缩包
2. 解压到任意目录
3. 双击运行 `Realtimecaptiontranslate.exe`
4. Release版包括一个小型英文识别模型,可直接识别英文并翻译,若要识别其他语言,请从[VOSK官网](https://alphacephei.com/vosk/models)下载对应模型,放在任意文件夹,在设置中设置模型路径即可### 方法二:从源代码运行
```bash
# 克隆仓库
git clone https://github.com/your-repo/real-time-caption-translation.git
cd real-time-caption-translation# 安装依赖
pip install -r requirements.txt# 启动程序
python Run.py
```## ⚙️ 配置说明
### 翻译引擎配置
| 引擎 | 必需参数 | 注意事项 |
|----------|------------------|--------------------------------------------------------|
| Google | 无 | 支持100+语言,免费使用 |
| DeepL | API密钥 | 需注册获取[DeepL密钥](https://www.deepl.com) |
| Ollama | 本地服务地址 | 需要先安装并启动Ollama服务 |
| DeepSeek | API密钥 | 需DeepSeek密钥 [DeepSeek密钥](https://platform.deepseek.com) |
| OpenAI | 平台url,模型名称,API密钥 | 支持使用openai sdk的模型平台,阅读平台使用文档,获取平台对应url,模型名称,API密钥 |