https://github.com/jgontrum/corpushashtagstatistics
Generates statistics for the hashtag usage in text corpus.
https://github.com/jgontrum/corpushashtagstatistics
analysis corpus hashtags linguistics nlp statistics twitter
Last synced: over 1 year ago
JSON representation
Generates statistics for the hashtag usage in text corpus.
- Host: GitHub
- URL: https://github.com/jgontrum/corpushashtagstatistics
- Owner: jgontrum
- License: mit
- Created: 2017-03-06T21:16:52.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2017-03-06T21:22:06.000Z (over 9 years ago)
- Last Synced: 2025-01-27T08:27:38.239Z (over 1 year ago)
- Topics: analysis, corpus, hashtags, linguistics, nlp, statistics, twitter
- Language: Python
- Size: 4.88 KB
- Stars: 1
- Watchers: 3
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# CorpusHashtagStatistics
Generates statistics for the hashtag usage in text corpus.
## Preparations
- Install Python 3.5
- Install virtualenv
## Usage
```
usage: main.py [-h] --input INPUT --output OUTPUT [--filter FILTER] --format
{text,pipe,csv}
optional arguments:
-h, --help show this help message and exit
--input INPUT Folder containing corpus files.
--output OUTPUT Folder to store the generated statistics in.
--filter FILTER Filenames must contain this.
--format {text,pipe,csv}
Format of the used corpus.
```