Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/mewo2/syllpos
Wordlists by part of speech and syllable count
https://github.com/mewo2/syllpos
Last synced: 3 days ago
JSON representation
Wordlists by part of speech and syllable count
- Host: GitHub
- URL: https://github.com/mewo2/syllpos
- Owner: mewo2
- License: other
- Created: 2015-06-28T22:58:45.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2015-10-24T17:19:59.000Z (about 9 years ago)
- Last Synced: 2024-08-02T05:12:20.252Z (3 months ago)
- Language: Python
- Size: 273 KB
- Stars: 22
- Watchers: 3
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE.md
Awesome Lists containing this project
README
# Wordlists by part of speech and syllable count
This is a collection of wordlists, taken from the [Brown University Standard
Corpus of Present-Day American English][brown]. Filenames have the form
*postag*-*syllablecount*.txt, where *postag* is the part of speech tag, and
syllable count is the number of syllables in the word.The part-of-speech tags form part of the corpus, and are described further
[here][postags].The syllable counts are taken from the pronunciations in the [CMU Pronouncing
Dictionary][cmudict]. Words not included in the CMU dictionary are ignored. In
cases where there is more than one pronunciation listed, the first is used.[cmudict]: http://www.speech.cs.cmu.edu/cgi-bin/cmudict
[brown]: http://www.helsinki.fi/varieng/CoRD/corpora/BROWN/
[postags]: https://en.wikipedia.org/wiki/Brown_Corpus#Part-of-speech_tags_used