https://github.com/chimeracoder/textcorpora
A Go helper package that provides an interface for various corpora used in text analysis
https://github.com/chimeracoder/textcorpora
Last synced: 6 months ago
JSON representation
A Go helper package that provides an interface for various corpora used in text analysis
- Host: GitHub
- URL: https://github.com/chimeracoder/textcorpora
- Owner: ChimeraCoder
- License: lgpl-3.0
- Created: 2014-05-24T19:08:43.000Z (over 11 years ago)
- Default Branch: master
- Last Pushed: 2015-04-01T01:23:31.000Z (almost 11 years ago)
- Last Synced: 2025-03-23T07:14:25.093Z (11 months ago)
- Language: Go
- Size: 152 KB
- Stars: 3
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README
- License: LICENSE
Awesome Lists containing this project
README
textcorpora
==============
[](https://godoc.org/github.com/ChimeraCoder/textcorpora)
TextCorpora is a helper package that provides an interface for various [corpora](https://en.wikipedia.org/wiki/Text_corpus). It was originally written for use in the [ReadingLevel](https://github.com/ChimeraCoder/readinglevel) library. It is provided as a separate package for convenience - both to faciliate use of corpora in other applications and libraries, and also to allow users of the ReadingLevel library the ability to plug in an alternative corpus if desired.
### Storage
The location for each corpus is stored in a location provided by [appdirs](github.com/Wessie/appdirs). For example, on Linux, the current version of the CMU corpus will be downloaded and saved to `~/.local/share/cmudict/.1/cmudict.0.7a.corpus`.