Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/mewo2/syllpos

Wordlists by part of speech and syllable count
https://github.com/mewo2/syllpos

Last synced: about 2 months ago
JSON representation

Wordlists by part of speech and syllable count

Awesome Lists containing this project

README

        

# Wordlists by part of speech and syllable count

This is a collection of wordlists, taken from the [Brown University Standard
Corpus of Present-Day American English][brown]. Filenames have the form
*postag*-*syllablecount*.txt, where *postag* is the part of speech tag, and
syllable count is the number of syllables in the word.

The part-of-speech tags form part of the corpus, and are described further
[here][postags].

The syllable counts are taken from the pronunciations in the [CMU Pronouncing
Dictionary][cmudict]. Words not included in the CMU dictionary are ignored. In
cases where there is more than one pronunciation listed, the first is used.

[cmudict]: http://www.speech.cs.cmu.edu/cgi-bin/cmudict
[brown]: http://www.helsinki.fi/varieng/CoRD/corpora/BROWN/
[postags]: https://en.wikipedia.org/wiki/Brown_Corpus#Part-of-speech_tags_used