https://github.com/nikitaeverywhere/edu-texts-analyzer
Analyzes texts for compliance with the second Zipf's and Heaps law.
https://github.com/nikitaeverywhere/edu-texts-analyzer
analysis education heaps law statistical-analysis statistics text zipf
Last synced: about 1 year ago
JSON representation
Analyzes texts for compliance with the second Zipf's and Heaps law.
- Host: GitHub
- URL: https://github.com/nikitaeverywhere/edu-texts-analyzer
- Owner: nikitaeverywhere
- License: mit
- Created: 2017-09-18T17:29:44.000Z (almost 9 years ago)
- Default Branch: master
- Last Pushed: 2017-09-19T08:20:14.000Z (almost 9 years ago)
- Last Synced: 2025-01-28T14:48:39.649Z (over 1 year ago)
- Topics: analysis, education, heaps, law, statistical-analysis, statistics, text, zipf
- Language: Python
- Homepage:
- Size: 4.33 MB
- Stars: 1
- Watchers: 2
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: readme.md
- License: license
Awesome Lists containing this project
README
# Texts Statistical Analysis
Analyzes texts for compliance with the Zipf's second law and Heaps' law.
Prerequisites
-------------
Having Python 3 installed, clone the project and install its dependencies:
```bash
git clone git@github.com:ZitRos/edu-texts-analyzer.git
cd edu-texts-analyzer
pip3 install -r requirements.txt
```
Texts for analysis are taken from `texts` directory.
Every file in this directory and its subdirectories will be treated as a text file.
There are already some articles, but you may place your own.
Zipf's Law
----------
Having Python 3 installed, install dependencies and run the program:
```bash
py index.py
```
It will generate `Zipf.xlsx` file with word ranks/frequencies data.

Heaps' Law
-----------
Output will go to `Heaps.xlsx` file.
