{"id":13600434,"url":"https://github.com/wxbool/video-srt","last_synced_at":"2025-09-13T00:34:06.848Z","repository":{"id":46529176,"uuid":"222620959","full_name":"wxbool/video-srt","owner":"wxbool","description":"这是一个可以识别视频语音自动生成字幕SRT文件的开源命令行工具。","archived":false,"fork":false,"pushed_at":"2022-03-19T05:10:35.000Z","size":18936,"stargazers_count":394,"open_issues_count":8,"forks_count":67,"subscribers_count":10,"default_branch":"master","last_synced_at":"2025-03-30T07:11:15.340Z","etag":null,"topics":["ffmpeg","go","golang","srt","video"],"latest_commit_sha":null,"homepage":"","language":"Go","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/wxbool.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2019-11-19T06:06:26.000Z","updated_at":"2025-03-22T13:59:30.000Z","dependencies_parsed_at":"2022-07-19T23:02:53.998Z","dependency_job_id":null,"html_url":"https://github.com/wxbool/video-srt","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/wxbool%2Fvideo-srt","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/wxbool%2Fvideo-srt/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/wxbool%2Fvideo-srt/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/wxbool%2Fvideo-srt/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/wxbool","download_url":"https://codeload.github.com/wxbool/video-srt/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247451667,"owners_count":20940944,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["ffmpeg","go","golang","srt","video"],"created_at":"2024-08-01T18:00:38.981Z","updated_at":"2025-04-06T08:15:34.682Z","avatar_url":"https://github.com/wxbool.png","language":"Go","funding_links":[],"categories":["Go"],"sub_categories":[],"readme":"## video-srt\n\n这是一个可以识别视频语音自动生成字幕SRT文件的开源命令行工具。\n\n本项目使用了阿里云的[OSS对象存储](https://www.aliyun.com/product/oss?spm=5176.12825654.eofdhaal5.13.e9392c4aGfj5vj\u0026aly_as=K11FcpO8)、[录音文件识别](https://ai.aliyun.com/nls/filetrans?spm=5176.12061031.1228726.1.47fe3cb43I34mn)的相关业务接口。\n\nWindows-GUI版本：[https://github.com/wxbool/video-srt-windows](https://github.com/wxbool/video-srt-windows)\n\n## 下载安装\n```shell\ngo get -u github.com/wxbool/video-srt\n```\n\n## 使用\n###### 项目使用了 [ffmpeg](http://ffmpeg.org/) 依赖，请先下载安装，并设置环境变量.\n\n* 设置服务接口配置（config.ini）\n```ini\n#字幕相关设置\n[srt]\n#智能分段处理：true（开启） false（关闭）\nintelligent_block=true\n\n#阿里云Oss对象服务配置\n#文档：https://help.aliyun.com/document_detail/31827.html?spm=a2c4g.11186623.6.582.4e7858a85Dr5pA\n[aliyunOss]\n# OSS 对外服务的访问域名\nendpoint=your.Endpoint\n# 存储空间（Bucket）名称\nbucketName=your.BucketName\n# 存储空间（Bucket 域名）地址\nbucketDomain=your.BucketDomain\naccessKeyId=your.AccessKeyId\naccessKeySecret=your.AccessKeySecret\n\n#阿里云语音识别配置\n#文档：\n[aliyunClound]\n# 在管控台中创建的项目Appkey，项目的唯一标识\nappKey=your.AppKey\naccessKeyId=your.AccessKeyId\naccessKeySecret=your.AccessKeySecret\n```\n\n* 生成字幕文件（CLI）\n\n```shell\ngo run main.go video.mp4\n```\n\n* 生成字幕文件（可执行文件 | [video-srt.exe](https://github.com/wxbool/video-srt/blob/master/video-srt.exe)）\n```shell\nvideo-srt video.mp4\n```\n\n\n## FAQ\n* 支持哪些语言？\n    * 视频字幕文本识别的核心服务是由阿里云`录音文件识别`业务提供的接口进行的，支持汉语普通话、方言、欧美英语等语言\n* 如何才能使用这个工具？\n    * 注册阿里云账号\n    * 账号快速实名认证\n    * 开通 `访问控制` 服务，并创建角色，设置开放 `OSS对象存储`、`智能语音交互` 的访问权限 \n    * 开通 `OSS对象存储` 服务，并创建一个存储空间（Bucket）（读写权限设置为公共读）\n    * 开通 `智能语音交互` 服务，并创建项目（根据使用场景选择识别语言以及偏好等）\n    * 设置 `config.ini` 文件的配置项\n    * 命令行执行（详见`使用`）","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fwxbool%2Fvideo-srt","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fwxbool%2Fvideo-srt","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fwxbool%2Fvideo-srt/lists"}