https://github.com/zouharvi/lansher
Language Distinctor (winter semester 2017/2018 at MFF)
https://github.com/zouharvi/lansher
language-detection python-3
Last synced: 10 months ago
JSON representation
Language Distinctor (winter semester 2017/2018 at MFF)
- Host: GitHub
- URL: https://github.com/zouharvi/lansher
- Owner: zouharvi
- Created: 2017-10-13T18:18:08.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2018-12-20T16:33:32.000Z (over 7 years ago)
- Last Synced: 2025-01-08T07:11:46.254Z (over 1 year ago)
- Topics: language-detection, python-3
- Language: Python
- Homepage:
- Size: 1.03 MB
- Stars: 1
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# LANSHER
Language Distinguisher is a small program written in Python 3, which faciliates work with language detection.
## Usage
1. feed data into the program with JSON in the format of:
```
{
'en': 'sample text in english',
'cs': 'ukázkový text v češtině',
...
}
```
2. ask the program to categorize input text
## More Info
The program was written by Vilém Zouhar for winter semester 2017/2018 at MFF.
## Changelog
#### 0.1.2
* 4 tuples by default
#### 0.1.1
* added support for n-tuples
* easier word/char/tuples balancing
#### 0.1.0
* first release