https://github.com/nguyenvulebinh/spoken-norm

Transformation spoken text to written text
https://github.com/nguyenvulebinh/spoken-norm

Last synced: about 2 months ago
JSON representation

Transformation spoken text to written text

Host: GitHub
URL: https://github.com/nguyenvulebinh/spoken-norm
Owner: nguyenvulebinh
Created: 2021-12-27T19:47:32.000Z (over 3 years ago)
Default Branch: taggen
Last Pushed: 2024-05-14T18:59:34.000Z (about 1 year ago)
Last Synced: 2024-11-04T16:46:00.238Z (7 months ago)
Language: Python
Size: 1.4 MB
Stars: 28
Watchers: 2
Forks: 4
Open Issues: 1
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

        # Transformation spoken text to written text

This model is used for formatting raw asr text output from spoken text to written text (Eg. date, number, id, ...). It also supports formatting "out of vocab" by using external vocabulary. 

Some of examples:

```text

input  : tám giờ chín phút ngày mười tám tháng năm năm hai nghìn không trăm hai mươi hai

output : 8h9 18/5/2022

input  : mã số quy đê tê tê đê hai tám chéo hai không không ba

output : mã số qdttd28/2003

input  : thể tích tám mét khối trọng lượng năm mươi ki lô gam

output : thể tích 8 m3 trọng lượng 50 kg

input    : ngày hai tám tháng tư cô vít bùng phát ở sờ cốt lờn chiếm tám mươi phần trăm là biến chủng đen ta và bê ta

ex_vocab : ['scotland', 'covid', 'delta', 'beta']

output   : 28/4 covid bùng phát ở scotland chiếm 80 % là biến chủng delta và beta

```

## Model architecture

![Model architecture](./model_spoken_norm.svg)

# Infer model

- Play around at [Huggingface Space](https://huggingface.co/spaces/nguyenvulebinh/spoken-norm-taggen)

## Contact 

[email protected]

[![Follow](https://img.shields.io/twitter/follow/nguyenvulebinh?style=social)](https://twitter.com/intent/follow?screen_name=nguyenvulebinh)

```

@INPROCEEDINGS{10094599,

  author={Nguyen, Thai-Binh and Nhat, Le Duc Minh and Nguyen, Quang Minh and Do, Quoc Truong and Luong, Chi Mai and Waibel, Alexander},

  booktitle={ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)}, 

  title={AdapITN: A Fast, Reliable, and Dynamic Adaptive Inverse Text Normalization}, 

  year={2023},

  volume={},

  number={},

  pages={1-5},

  keywords={Adaptation models;Runtime;Transforms;Signal processing;Natural language processing;Semiotics;Reliability;ASR;inverse text normalization;semiotic pharse;phonetization phrase},

  doi={10.1109/ICASSP49357.2023.10094599}}

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/nguyenvulebinh/spoken-norm

Awesome Lists containing this project

README