{"id":18273602,"url":"https://github.com/rapidai/rapidasr","last_synced_at":"2025-05-16T17:07:51.900Z","repository":{"id":45519478,"uuid":"438525528","full_name":"RapidAI/RapidASR","owner":"RapidAI","description":"📣 商用级开源语音自动识别程序库，开箱即用，全平台支持，中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and  FunASR. We provide a set of easier APIs to call ASR models.","archived":false,"fork":false,"pushed_at":"2024-05-15T04:52:36.000Z","size":37580,"stargazers_count":550,"open_issues_count":14,"forks_count":64,"subscribers_count":18,"default_branch":"main","last_synced_at":"2025-05-14T19:14:44.447Z","etag":null,"topics":["asr","paddlespeech","paraformer","wenet"],"latest_commit_sha":null,"homepage":"","language":"C++","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/RapidAI.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2021-12-15T06:49:38.000Z","updated_at":"2025-05-10T12:08:33.000Z","dependencies_parsed_at":"2025-01-15T07:08:38.698Z","dependency_job_id":"eb0a06b7-9d18-43f7-9c35-f695ecefa7fe","html_url":"https://github.com/RapidAI/RapidASR","commit_stats":null,"previous_names":[],"tags_count":3,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/RapidAI%2FRapidASR","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/RapidAI%2FRapidASR/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/RapidAI%2FRapidASR/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/RapidAI%2FRapidASR/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/RapidAI","download_url":"https://codeload.github.com/RapidAI/RapidASR/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":254209851,"owners_count":22032897,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["asr","paddlespeech","paraformer","wenet"],"created_at":"2024-11-05T12:07:02.064Z","updated_at":"2025-05-16T17:07:51.880Z","avatar_url":"https://github.com/RapidAI.png","language":"C++","funding_links":[],"categories":[],"sub_categories":[],"readme":"## Rapid ASR\n\u003cp align=\"left\"\u003e\n    \u003ca href=\"https://huggingface.co/spaces/SWHL/RapidASRDemo\" target=\"_blank\"\u003e\u003cimg src=\"https://img.shields.io/badge/%F0%9F%A4%97-Hugging Face Demo-blue\"\u003e\u003c/a\u003e\n    \u003ca href=\"https://www.modelscope.cn/studios/liekkas/RapidASRDemo/summary\" target=\"_blank\"\u003e\u003cimg src=\"https://img.shields.io/badge/ModelScope-Demo-blue\"\u003e\u003c/a\u003e\n    \u003ca href=\"\"\u003e\u003cimg src=\"https://img.shields.io/badge/OS-Linux%2C%20Win%2C%20Mac-pink.svg\"\u003e\u003c/a\u003e\n    \u003ca href=\"\"\u003e\u003cimg src=\"https://img.shields.io/badge/Python-\u003e=3.6,\u003c3.13-aff.svg\"\u003e\u003c/a\u003e\n    \u003ca href=\"\"\u003e\u003cimg src=\"https://img.shields.io/badge/C++-aff.svg\"\u003e\u003c/a\u003e\n\u003c/p\u003e\n\n- 🎉 推出知识星球[RapidAI私享群](https://t.zsxq.com/0duLBZczw)，这里的提问会优先得到回答和支持，也会享受到RapidAI组织后续持续优质的服务。欢迎大家的加入。\n- Paraformer模型出自阿里达摩院[Paraformer语音识别-中文-通用-16k-离线-large-pytorch](https://www.modelscope.cn/models/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary)。\n- 本仓库仅对模型做了转换，只采用ONNXRuntime推理引擎。该项目核心代码已经并入[FunASR](https://github.com/alibaba-damo-academy/FunASR)。\n- 项目仍会持续更新，欢迎关注。\n- QQ群号：645751008\n\n#### 📖文档导航\n- 语音识别：\n    - rapid_paraformer:\n        - [rapid_paraformer-Python](./python/README.md)\n        - [rapid_C++/C](./cpp_onnx/readme.md)\n    - [rapid_wenet](https://github.com/RapidAI/RapidASR/tree/rapid_wenet)\n        - [Python](https://github.com/RapidAI/RapidASR/tree/rapid_wenet/python)\n        - [C++](https://github.com/RapidAI/RapidASR/tree/rapid_wenet/cpp)\n    - [rapid_paddlespeech-Python](https://github.com/RapidAI/RapidASR/tree/rapid_paddlespeech)\n- 标点符号\n    - [RapidPunc](https://github.com/RapidAI/RapidPunc)\n\n#### 📆TODO以及任务认领\n- 参见这里：[link](https://github.com/RapidAI/RapidASR/issues/15)\n\n#### 🎨整体框架\n```mermaid\nflowchart LR\n\nA([wav]) --RapidVad--\u003e B([各个小段的音频]) --RapidASR--\u003e C([识别的文本内容]) --RapidPunc--\u003e D([最终识别内容])\n```\n\n#### 📣更新日志\n\u003cdetails\u003e\n\u003csummary\u003e详情\u003c/summary\u003e\n- 2023-08-21 v2.0.4 update:\n  - 添加whl包支持\n  - 更新文档\n- 2023-02-25\n   - 添加C++版本推理，使用onnxruntime引擎，预/后处理代码来自： [FastASR](https://github.com/chenkui164/FastASR)\n- 2023-02-14 v2.0.3 update:\n  - 修复librosa读取wav文件错误\n  - 修复fbank与torch下fbank提取结果不一致bug\n- 2023-02-11 v2.0.2 update:\n  - 模型和推理代码解耦（`rapid_paraformer`和`resources`）\n  - 支持批量推理（通过`resources/config.yaml`中`batch_size`指定）\n  - 增加多种输入方式（`Union[str, np.ndarray, List[str]]`）\n- 2023-02-10 v2.0.1 update:\n  - 添加对输入音频为噪音或者静音的文件推理结果捕捉。\n\n\u003c/details\u003e\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Frapidai%2Frapidasr","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Frapidai%2Frapidasr","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Frapidai%2Frapidasr/lists"}