https://github.com/simenholmestad/common_norwegian_words
A repo for creating lists of common norwegian words
https://github.com/simenholmestad/common_norwegian_words
Last synced: 3 months ago
JSON representation
A repo for creating lists of common norwegian words
- Host: GitHub
- URL: https://github.com/simenholmestad/common_norwegian_words
- Owner: SimenHolmestad
- Created: 2021-12-26T18:56:05.000Z (over 3 years ago)
- Default Branch: master
- Last Pushed: 2021-12-26T18:56:53.000Z (over 3 years ago)
- Last Synced: 2025-01-25T21:09:34.218Z (5 months ago)
- Language: Python
- Size: 4.05 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: readme.md
Awesome Lists containing this project
README
# Common Norwegian words
This repo was made to create a list of common Norwegian words to use for the [MonkeyType](https://monkeytype.com/) website.
The best Norwegian frequency list I could find online is this: . More specifically this file: . The problem is that this frequency list contains words like "finland" (country), "hvad" (old, outdated language) and "2" (number).
To remove such words, this repo uses the Norwegian Scrabble Associasions dictionary () to check whether the words in the frequency list should be included in the final list or not.
An additional list of words which did not feel right to include in the MonkeyType word list is also added and used for additional filtering.
## Usage
Download the repo and run the follwing command to create the MonkeyType files in the correct format:``` sh
python3 create_monkeytype_files.py
```A list of .json-files should appear.