Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/ggerganov/ggwords

Generate language n-gram statistics
https://github.com/ggerganov/ggwords

language ngrams statistics

Last synced: 19 days ago
JSON representation

Generate language n-gram statistics

Awesome Lists containing this project

README

        

# ggwords

Generate n-gram statistics by processing the contents of English books/texts.

## Usage

```bash
git clone https://github.com/ggerganov/ggwords
cd ggwords
mkdir build
cd build
cmake ..
make -j4

./bin/analyze /path/to/metadata/books.txt /path/to/books/text
```

## Sample data

The data in [./data](./data) was generated using https://github.com/pgcorpus/gutenberg