https://github.com/xxyzz/proficiency
Language files for WordDumb
https://github.com/xxyzz/proficiency
dbnary dictionary kaikki language-learning python wiktionary
Last synced: 20 days ago
JSON representation
Language files for WordDumb
- Host: GitHub
- URL: https://github.com/xxyzz/proficiency
- Owner: xxyzz
- License: gpl-3.0
- Created: 2022-08-21T07:53:49.000Z (almost 3 years ago)
- Default Branch: master
- Last Pushed: 2025-03-05T08:26:39.000Z (3 months ago)
- Last Synced: 2025-05-12T14:28:18.841Z (20 days ago)
- Topics: dbnary, dictionary, kaikki, language-learning, python, wiktionary
- Language: Python
- Homepage:
- Size: 1.64 MB
- Stars: 6
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Funding: .github/FUNDING.yml
- License: LICENSE
Awesome Lists containing this project
README
# Proficiency
Language files for [WordDumb](https://github.com/xxyzz/WordDumb).
## Data source
Wiktionary data come from kaikki.org and [Dbnary](https://kaiko.getalp.org/about-dbnary). Word difficulty data sources are listed in each language subfolders.
## Dependencies
- Python
- [Wget](https://www.gnu.org/software/wget)
- [lemminflect](https://github.com/bjascob/LemmInflect): inflect English words
- [Open Chinese Convert](https://github.com/BYVoid/OpenCC): convert Chinese characters
- [wordfreq](https://github.com/rspeer/wordfreq): get word frequency data
- [wiktextract-lemmatization](https://github.com/Vuizur/wiktextract-lemmatization): remove [stress](https://en.wikipedia.org/wiki/Stress_(linguistics))
- [Perl](https://www.perl.org): Remove invalid text in Dbnary files
- lbzip2 or bzip2
- [oxigraph](https://github.com/oxigraph/oxigraph)
- pigz or gzip
## Create files
```
$ python -m venv .venv
$ source .venv/bin/activate.fish
$ python -m pip install .
$ proficiency en
```Change the [venv](https://docs.python.org/3/library/venv.html) invoke command according to your shell.
## License
This work is licensed under GPL version 3 or later.