https://github.com/lifeiteng/tts-textanalyzer

TTS Text Analyzer
https://github.com/lifeiteng/tts-textanalyzer

analyzer bert text tts

Last synced: 3 months ago
JSON representation

TTS Text Analyzer

Host: GitHub
URL: https://github.com/lifeiteng/tts-textanalyzer
Owner: lifeiteng
License: apache-2.0
Created: 2023-07-20T07:11:32.000Z (almost 2 years ago)
Default Branch: main
Last Pushed: 2023-07-20T07:31:41.000Z (almost 2 years ago)
Last Synced: 2025-01-15T01:17:56.490Z (5 months ago)
Topics: analyzer, bert, text, tts
Homepage:
Size: 5.86 KB
Stars: 32
Watchers: 6
Forks: 3
Open Issues: 0
Metadata Files:
- Readme: README.md
- Funding: .github/FUNDING.yml
- License: LICENSE

Awesome Lists containing this project

README

        # TTS-TextAnalyzer

受 [Introducing Unified Neural Text Analyzer: an innovation for Neural Text-to-Speech pronunciation accuracy improvement](https://techcommunity.microsoft.com/t5/azure-ai-services-blog/unified-neural-text-analyzer-an-innovation-to-improve-neural-tts/ba-p/2102187) 启发，可在 BERT 模型基础上构建多个任务的 heads 来统一语音合成文本分析的任务，包括：分词，词性预测、文本归一化、多音词消歧等。这个项目用来收集适用于各任务的数据集信息。

Inspired by [Introducing Unified Neural Text Analyzer: an innovation for Neural Text-to-Speech pronunciation accuracy improvement](https://techcommunity.microsoft.com/t5/azure-ai-services-blog/unified-neural-text-analyzer-an-innovation-to-improve-neural-tts/ba-p/2102187), Different tasks of speech synthesis text analysis can be built on the BERT model, including: Word Segmentation, Part-of-Speech Tagging, Text Normalization, Polyphone Disambiguation and etc. This project is used to collect dataset information suitable for each task.

# Pretrained BERT

* [bert-base-chinese](https://huggingface.co/bert-base-chinese)

* [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased)

* [xlm-roberta-base](https://huggingface.co/xlm-roberta-base)

# Word Segmentation

| datasets | code |

| ----  | ------ |

| TODO | |

# Part-of-Speech Tagging

| datasets | code |

| ----  | ------ |

| TODO | |

# Text Normalization

| datasets / rules | code |

| ----  | ------ |

| rules | [WeTextProcessing](https://github.com/wenet-e2e/WeTextProcessing) |

| Text normalization covering grammars | [TextNormalizationCoveringGrammars](https://github.com/google-research-datasets/TextNormalizationCoveringGrammars) |

| TODO | |

# Polyphone Disambiguation

| datasets | code |

| ----  | ------ |

| g2PL | [https://github.com/whzikaros/g2pL](https://github.com/whzikaros/g2pL) |

| CPP (g2pM) | [https://github.com/kakaobrain/g2pm](https://github.com/kakaobrain/g2pm) |

| TODO | |

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/lifeiteng/tts-textanalyzer

Awesome Lists containing this project

README