Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/tfiamietsh/popular-japanese-words
Analyzing and studying commonly used Japanese words
https://github.com/tfiamietsh/popular-japanese-words
analysis ipynb japanese japanese-dictionary java javafx
Last synced: about 1 month ago
JSON representation
Analyzing and studying commonly used Japanese words
- Host: GitHub
- URL: https://github.com/tfiamietsh/popular-japanese-words
- Owner: tfiamietsh
- License: apache-2.0
- Created: 2023-06-13T19:15:21.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2023-08-19T13:33:12.000Z (over 1 year ago)
- Last Synced: 2024-11-13T08:41:13.863Z (3 months ago)
- Topics: analysis, ipynb, japanese, japanese-dictionary, java, javafx
- Language: Jupyter Notebook
- Homepage:
- Size: 5.77 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Popular Japanese words
There are two projects in the repository that uses the [popular Japanese words dataset](https://www.kaggle.com/datasets/dinislamgaraev/popular-japanese-words).
## Analysis
The `main` branch contains a Jupyter notebook with an explorative analysis of the dataset. The analysis looks at how the words are distributed across different contexts and allows to ascertain some interesting facts about the Japanese language.
## Offline dictionary
The `dev` branch contains source code in Java for a dictionary app based on the words from the dataset. Key features of the dictionary are listed below.
Check the Releases page to download the latest version.P.S. Don't let the title of the app scare you :smile:.
**Features:**
- Up to 30k commonly used Japanese words are included
- An example sentence and an English translation are provided for each word
- Integrated Romaji to Hiragana IME
- Hiragana-Kanji prefix tree allows for fast and accurate word retrieval**How does search with IME works?**
Let's say we want to look up the word 免許証 (katakana: めんきょしょう, romaji: menkyoshou), so we start typing romaji
![]()