https://github.com/oleggator/doc-moderator
Highlight words in txt/docx document using Levenshtein distance
https://github.com/oleggator/doc-moderator
Last synced: 10 months ago
JSON representation
Highlight words in txt/docx document using Levenshtein distance
- Host: GitHub
- URL: https://github.com/oleggator/doc-moderator
- Owner: oleggator
- License: mit
- Created: 2018-12-12T21:23:17.000Z (about 7 years ago)
- Default Branch: master
- Last Pushed: 2018-12-30T15:13:53.000Z (about 7 years ago)
- Last Synced: 2025-01-22T08:44:53.656Z (12 months ago)
- Language: Python
- Homepage:
- Size: 11.7 KB
- Stars: 0
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Document moderator
Searches for words in the selected document with selected Levenshtein distance to words in the selected dictionary
## Dependencies
- [Python 3.7](https://www.python.org/downloads/)
- [pipenv](https://pipenv.readthedocs.io/en/latest/)
## Installation
```
pipenv install
```
## Usage
```
usage: main.py [-h] -d TARGET_WORDS -a ARTICLE -o OUTPUT -l DISTANCE
find words in document
optional arguments:
-h, --help show this help message and exit
-d TARGET_WORDS dictionary (txt)
-a ARTICLE article (docx)
-o OUTPUT article with highlighted words
-l DISTANCE editor distance
```
## Authors
- Kirill Syomin
- Oleg Utkin