https://github.com/SocialSisterYi/bcut-asr

使用必剪API的语音字幕识别
https://github.com/SocialSisterYi/bcut-asr

Last synced: 5 months ago
JSON representation

使用必剪API的语音字幕识别

Host: GitHub
URL: https://github.com/SocialSisterYi/bcut-asr
Owner: SocialSisterYi
License: mit
Created: 2022-10-11T00:54:57.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2024-04-28T14:57:30.000Z (about 1 year ago)
Last Synced: 2024-10-29T19:59:30.682Z (6 months ago)
Language: Python
Size: 48.8 KB
Stars: 300
Watchers: 3
Forks: 41
Open Issues: 3
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

StarryDivineSky - SocialSisterYi/bcut-asr

README

        
Bcut-ASR


使用必剪 API 进行云端语音字幕识别，支持 CLI 和 module 调用

## ✨Feature

- 可直接上传`flac`, `aac`, `m4a`, `mp3`, `wav`音频格式

- 自动调用 ffmpeg, 实现视频伴音和其他音频格式转码

- 支持`srt`, `json`, `lrc`, `txt`格式字幕输出

- 字幕支持断句和首位时间标记

- 可使用 stdout 输出字幕文本

## 🚀Install

首先确保 ffmpeg 已安装，且 PATH 中可以访问，若未安装可以使用如下命令（已安装请无视）：

Linux：

```bash

sudo apt install ffmpeg

```

Windows：

```powershell

winget install ffmpeg

```

本项目暂时未发布 pypi，应使用本地安装，Python 版本应 >= 3.10，需要安装 poetry 

```bash

git clone https://github.com/SocialSisterYi/bcut-asr

cd bcut-asr

poetry lock

poetry build -f wheel

pip install dist/bcut_asr-0.0.3-py3-none-any.whl # Example

```

## 📃Usage

### CLI Interface

```bash

bcut_asr video.mp4

```

或

```bash

bcut_asr video.mp4 subtitle.srt

```

或

```bash

bcut_asr video.mp4 -f srt - > subtitle.srt

```

长音频指定任务状态轮询间隔(秒)，避免接口频繁调用

```bash

bcut_asr video.mp4 -f srt -i 30 - > subtitle.srt

```

```

bcut_asr -h

usage: bcut-asr [-h] [-f [{srt,json,lrc,txt}]] [-i [1.0]] input [output]

必剪语音识别

positional arguments:

  input                 输入媒体文件

  output                输出字幕文件, 可stdout

options:

  -h, --help            show this help message and exit

  -f [{srt,json,lrc,txt}], --format [{srt,json,lrc,txt}]

                        输出字幕格式

  -i [1.0], --interval [1.0]

                        任务状态轮询间隔(秒)

支持输入音频格式: flac, aac, m4a, mp3, wav 支持自动调用ffmpeg提取视频伴音

```

### Module

```python

from bcut_asr import BcutASR

from bcut_asr.orm import ResultStateEnum

asr = BcutASR('voice.mp3')

asr.upload() # 上传文件

asr.create_task() # 创建任务

# 轮询检查结果

while True:

    result = asr.result()

    # 判断识别成功

    if result.state == ResultStateEnum.COMPLETE:

        break

# 解析字幕内容

subtitle = result.parse()

# 判断是否存在字幕

if subtitle.has_data():

    # 输出srt格式

    print(subtitle.to_srt())

```

输入视频

```python

from bcut_asr import run_everywhere

from argparse import Namespace

f = open("file.mp4", "rb")

argg = Namespace(format="srt", interval=30.0, input=f, output=None)

run_everywhere(argg)

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/SocialSisterYi/bcut-asr

Awesome Lists containing this project

README

Bcut-ASR