https://github.com/lifeiteng/tts-textanalyzer
TTS Text Analyzer
https://github.com/lifeiteng/tts-textanalyzer
analyzer bert text tts
Last synced: 3 months ago
JSON representation
TTS Text Analyzer
- Host: GitHub
- URL: https://github.com/lifeiteng/tts-textanalyzer
- Owner: lifeiteng
- License: apache-2.0
- Created: 2023-07-20T07:11:32.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2023-07-20T07:31:41.000Z (almost 2 years ago)
- Last Synced: 2025-01-15T01:17:56.490Z (5 months ago)
- Topics: analyzer, bert, text, tts
- Homepage:
- Size: 5.86 KB
- Stars: 32
- Watchers: 6
- Forks: 3
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Funding: .github/FUNDING.yml
- License: LICENSE
Awesome Lists containing this project
README
# TTS-TextAnalyzer
受 [Introducing Unified Neural Text Analyzer: an innovation for Neural Text-to-Speech pronunciation accuracy improvement](https://techcommunity.microsoft.com/t5/azure-ai-services-blog/unified-neural-text-analyzer-an-innovation-to-improve-neural-tts/ba-p/2102187) 启发,可在 BERT 模型基础上构建多个任务的 heads 来统一语音合成文本分析的任务,包括:分词,词性预测、文本归一化、多音词消歧等。这个项目用来收集适用于各任务的数据集信息。
Inspired by [Introducing Unified Neural Text Analyzer: an innovation for Neural Text-to-Speech pronunciation accuracy improvement](https://techcommunity.microsoft.com/t5/azure-ai-services-blog/unified-neural-text-analyzer-an-innovation-to-improve-neural-tts/ba-p/2102187), Different tasks of speech synthesis text analysis can be built on the BERT model, including: Word Segmentation, Part-of-Speech Tagging, Text Normalization, Polyphone Disambiguation and etc. This project is used to collect dataset information suitable for each task.
# Pretrained BERT
* [bert-base-chinese](https://huggingface.co/bert-base-chinese)
* [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased)
* [xlm-roberta-base](https://huggingface.co/xlm-roberta-base)# Word Segmentation
| datasets | code |
| ---- | ------ |
| TODO | |# Part-of-Speech Tagging
| datasets | code |
| ---- | ------ |
| TODO | |# Text Normalization
| datasets / rules | code |
| ---- | ------ |
| rules | [WeTextProcessing](https://github.com/wenet-e2e/WeTextProcessing) |
| Text normalization covering grammars | [TextNormalizationCoveringGrammars](https://github.com/google-research-datasets/TextNormalizationCoveringGrammars) |
| TODO | |# Polyphone Disambiguation
| datasets | code |
| ---- | ------ |
| g2PL | [https://github.com/whzikaros/g2pL](https://github.com/whzikaros/g2pL) |
| CPP (g2pM) | [https://github.com/kakaobrain/g2pm](https://github.com/kakaobrain/g2pm) |
| TODO | |