Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with punctuation

A curated list of projects in awesome lists tagged with punctuation .

https://github.com/modelscope/funasr

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

audio-visual-speech-recognition conformer dfsmn paraformer pretrained-model punctuation pytorch rnnt speaker-diarization speech-recognition speechgpt speechllm vad voice-activity-detection whisper

Last synced: 17 Dec 2024

https://github.com/modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

audio-visual-speech-recognition conformer dfsmn paraformer pretrained-model punctuation pytorch rnnt speaker-diarization speech-recognition speechgpt speechllm vad voice-activity-detection whisper

Last synced: 29 Oct 2024

https://github.com/ottokart/punctuator2

A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text

attention demo punctuation recurrent-neural-networks theano

Last synced: 14 Nov 2024

https://github.com/26hzhang/neural_sequence_labeling

A TensorFlow implementation of Neural Sequence Labeling model, which is able to tackle sequence labeling tasks such as POS Tagging, Chunking, NER, Punctuation Restoration and etc.

chunking lstm-networks named-entity-recognition pos-tagger punctuation python3 sentence-boundary-detection sequence-labeling tensorflow

Last synced: 19 Dec 2024

https://github.com/yeyupiaoling/punctuationmodel

中文标点符号模型,可以给文本添加标点符号。

asr ernie punctuation

Last synced: 31 Oct 2024

https://github.com/motazsaad/process-arabic-text

Pre-process arabic text (remove diacritics, punctuations and repeating characters)

arabic-nlp punctuation remove-diacritics

Last synced: 14 Nov 2024

https://github.com/LanguageMachines/ucto

Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules for several languages and can be easily extended to suit other languages. It has been incorporated for tokenizing Dutch text in Frog, our Dutch morpho-syntactic processor. http://ilk.uvt.nl/ucto --

computational-linguistics folia language natural-language-processing nlp punctuation tokeniser

Last synced: 30 Oct 2024

https://github.com/languagemachines/ucto

Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules for several languages and can be easily extended to suit other languages. It has been incorporated for tokenizing Dutch text in Frog, our Dutch morpho-syntactic processor. http://ilk.uvt.nl/ucto --

computational-linguistics folia language natural-language-processing nlp punctuation tokeniser

Last synced: 16 Dec 2024

https://github.com/kaituoxu/X-Punctuator

A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text without punctuation.

lstm punctuation pytorch

Last synced: 14 Nov 2024

https://github.com/ferdinandzhong/punctuator

A small seq2seq punctuator tool based on DistilBERT

bert bert-ner chinese-nlp deep-learning nlp punctuation pytorch seq2seq

Last synced: 16 Dec 2024

https://github.com/FerdinandZhong/punctuator

A small seq2seq punctuator tool based on DistilBERT

bert bert-ner chinese-nlp deep-learning nlp punctuation pytorch seq2seq

Last synced: 14 Nov 2024

https://github.com/regexhq/punctuation-regex

Regular expression for matching punctuation characters.

punctuation punctuation-character regex regular-expression

Last synced: 20 Nov 2024

https://github.com/rewired-gh/tep

A blazingly fast tool for converting to English punctuations

cli command-line command-line-tool converter punctuation rust text

Last synced: 02 Nov 2024

https://github.com/vishwagauravin/string-tools-pro

🤏 Tiny & versatile 🔥 Node.js library for in-depth text analysis, manipulation and data extraction.

character-counter cookie parse-email parse-url punctuation text-analysis text-manipulation vowels whitespace word-count

Last synced: 06 Nov 2024

https://github.com/populated/punctuation

A full guide on understanding punctuation, etc.

english grammar punctuation spelling typing writing

Last synced: 15 Nov 2024

https://github.com/cgnieder/fnpct

Manage interaction between footnotes and punctuation

footnotes latex latex-package punctuation punctuation-marks

Last synced: 11 Nov 2024

https://github.com/Fusyong/zhpunc

a ConTeXt LMTX module to support Chinese punctuation

chinese cjk context luatex punctuation tex typesetting

Last synced: 23 Oct 2024

https://github.com/gitfaf/node-punctuation-stats

A small library for getting stats on punctuation in files. - Node Module

node node-module node-punctuation-stats punctuation stats

Last synced: 06 Nov 2024

https://github.com/guevara-chan/unicide

⋮Forced evolution for unicellular entites⋮

browser-application coffeescript grammer-checker html5 punctuation

Last synced: 19 Nov 2024

https://github.com/snowdreams1006/gitbook-plugin-punctuation-converter

基于正则表达式实现全局英文标点符号转换成中文标点符号的 Gitbook 插件

gitbook-plugin punctuation punctuation-converter

Last synced: 23 Nov 2024

https://github.com/kouisamine/punctuation-remover

Remove Punctuation is a tool that help you to strip all punctuation marks and symbols from a text document or input string.

js online php punctuation punctuation-remover remove-punctuation remove-punctuations-from-a-string script source-code string text text-tools tools

Last synced: 12 Nov 2024

https://github.com/rbardini/dashes

A quick reference guide to the use of dashes

dash em-dash en-dash m-dash m-rule mutton n-dash n-rule nut punctuation

Last synced: 08 Nov 2024

https://github.com/harmanveer-2546/tweets-cleaning-with-python

Twitter is one of the most used data sources for data analysis. The reason is that it’s open and free to collect unless you subscribe to the paid version one. Besides, it’s pretty simple to collect data from it.

filtering function-calling nlptk numpy os pandas punctuation python re removing-links tokenization tweet-cleaning twitter-data-analysis twitter-sentiment-analysis

Last synced: 12 Nov 2024

https://github.com/rotten-lkz/nopun

Try to read writings in classical Chinese without punctuation!(学古人尝试知句读,读没有标点的文言文!)

classical-chinese punctuation

Last synced: 17 Dec 2024

https://github.com/sholladay/denizen

Username validation and processing utilities

assert character punctuation test username validate

Last synced: 11 Dec 2024

https://github.com/whistlingzephyr/espanso-package-quotes

Type different type of quotes from many languages using espanso.

espanso espanso-package punctuation quotations unicode

Last synced: 09 Dec 2024

https://github.com/turekbot/autodash

Want to type an Em Dash—now you can. Just type "--".

autocomplete dash em grammar punctuation punctuation-correction

Last synced: 05 Nov 2024