https://github.com/najibadan/jouyou_kanji_analysis
Cleaning, exploring analyzing and visualizing the 常用漢字(Jouyou Kanji)
https://github.com/najibadan/jouyou_kanji_analysis
Last synced: 7 months ago
JSON representation
Cleaning, exploring analyzing and visualizing the 常用漢字(Jouyou Kanji)
- Host: GitHub
- URL: https://github.com/najibadan/jouyou_kanji_analysis
- Owner: NajibAdan
- Created: 2023-02-12T18:06:00.000Z (almost 3 years ago)
- Default Branch: master
- Last Pushed: 2023-02-19T15:46:40.000Z (almost 3 years ago)
- Last Synced: 2025-02-16T19:17:43.822Z (9 months ago)
- Language: Jupyter Notebook
- Size: 3.92 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Project: Investigating the Jouyou Kanji (常用漢字)
## Introduction
The Jouyou kanji (literally regular use Chinese Characters) are a list of kanji characters and readings annouced officially by the Japanese Ministry of Education. Currently it's a list of 2,136 characters but it's not an exhaustive list of all the kanjis in regular use. It is intended as a literacy baseline for those who have completed compulsory education, as well as a list of permitted characters and readings for use in official government documents.
The dataset I used to investigate is from [kaggle](https://www.kaggle.com/datasets/anthaus/japanese-jy-kanji).
## Structure Explanation
- `exploration.ipynb` contains my wrangling and exploration
- `explanatory.ipynb` is where I present my findings and insights
## Insights
- 50% of the jouyou kanji are taught in secondary grade
- 鬱 has the most number of strokes (29)
- Kanjis that were added in 2010 and 1981 were for the secondary grade
- Kanji with an old (kyuujitai) form have a minimum stroke number of 3 and a maximum stroke number of 20. Majority are in the 7-15 range with 11 having the most kanji.
- The water radical (水) is the most common radical with 120 kanji. The radical 隶 had the least number of kanji.
- Only 121 kanji are similiar to it's radical (i.e where the kanji and radical are exactly the same)
## Recommendation
Maybe more insights could be found if we access to kanjis for the Kanji Kentei levels Pre-Level One and Level One.