Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/bigcash/awesome-punctuator

A curated list of awesome punctuator
https://github.com/bigcash/awesome-punctuator

List: awesome-punctuator

awesome awesome-list punctuation punctuation-marks punctuation-restoration punctuator

Last synced: 5 days ago
JSON representation

A curated list of awesome punctuator

Lists

README

        

# Awesome Punctuator List

awesome list of **punctuator** ( a.k.a., **Punctuation**)

**The list is in no particular order!!!**

Pull requests are welcome!

## List

| Code | LANG | model | Summary | Paper |
| ------------------------------------------------------------ | ----- | ----- | ------------------------------------------------------------ | ------------------------------------------------------------ |
| [*punctuator*](https://github.com/ottokart/punctuator) | | Y | An LSTM RNN for restoring missing punctuation in unsegmented text. | |
| [*punctuator*2](https://github.com/ottokart/punctuator2) | en | Y | A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text | |
| [keras-*punctuator*](https://github.com/vackosar/keras-punctuator) | en | Y | Experimental project to punctuate text using a embedding layer, single convolutional layer and output softmax layer written in Keras. | |
| [X-*Punctuator*](https://github.com/kaituoxu/X-Punctuator) | en | | A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text without punctuation. | |
| [*punctuator*](https://github.com/FerdinandZhong/punctuator) | en/cn | Y | A small seq2seq punctuator tool based on DistilBERT | |
| [punctuation-prediction](https://github.com/cadia-lvl/punctuation-prediction) | | Y | Support tools for punctuation and boundary detection for ASR output. | |
| [punkProse](https://github.com/alpoktem/punkProse) | en | | Punctuation generation for speech transcripts using lexical and prosodic features | [link](https://link.springer.com/chapter/10.1007/978-3-319-68456-7_11) |
| [sequence-labeler](https://github.com/marekrei/sequence-labeler) | | | Neural network sequence labeling model | |
| [punctuator](https://github.com/jiali-ms/punctuator) | jp | | JP puncuator | |
| [tf2-punctuator2](https://github.com/batvoice-org/tf2-punctuator2) | | | Tensorflow 2.0 implementation of automatic punctuation with RNN + attention | |
| [PPASR](https://github.com/yeyupiaoling/PPASR) | cn | y | 基于PaddlePaddle2实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。 | |
| [-](https://github.com/camel2000/-) | cn | | 用多层BLSTM模型同时进行中文分词和标点符号预测 | |
| [BertPunc](https://github.com/nkrnrnk/BertPunc) | en | | SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model | [link](https://www.isca-speech.org/archive/Interspeech_2016/pdfs/1517.PDF) |
| [PaddleSpeech](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/iwslt2012/punc0) | cn | y | Punctuation Restoration | |
| [ffa](https://github.com/Young1993/ffa) | en | | A Context-aware Feature Fusion Framework for Punctuation Restoration | |
| [indic-punct](https://github.com/Open-Speech-EkStep/indic-punct) | multi | y | We currently only handle numbers as a part of our ITN pipeline, and have developed and open-sourced WFST (weighted finite state transducer) based ITN support for 11 Indic languages -Hindi, Gujarati, Telugu, Marathi, Punjabi, Tamil, Bengali, Malayalam, Odia, Assamese, Kannada, using NVIDIA’s NEMO toolkit. | |
| [UniPunc](https://github.com/Yaoming95/UniPunc) | en | n | The case study and multilingfual performance of ICASSP submission | [link](https://ieeexplore.ieee.org/document/9747131) |