{"id":24177659,"url":"https://github.com/vitto4/minnanods","last_synced_at":"2026-06-09T00:31:39.723Z","repository":{"id":265535706,"uuid":"895699636","full_name":"vitto4/MinnaNoDS","owner":"vitto4","description":"Lightly-opinionated compendium of vocabulary from Minna no Nihongo Shokyū I \u0026 II books.","archived":false,"fork":false,"pushed_at":"2025-02-05T17:32:55.000Z","size":562,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-03-02T17:12:26.397Z","etag":null,"topics":["japanese-language","japanese-study","language-learning","minna-no-nihongo","vocabulary-lists","yaml"],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/vitto4.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-11-28T17:54:32.000Z","updated_at":"2025-02-05T17:32:59.000Z","dependencies_parsed_at":"2025-01-27T08:33:06.866Z","dependency_job_id":null,"html_url":"https://github.com/vitto4/MinnaNoDS","commit_stats":null,"previous_names":["vitto4/minnanodb"],"tags_count":1,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vitto4%2FMinnaNoDS","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vitto4%2FMinnaNoDS/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vitto4%2FMinnaNoDS/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vitto4%2FMinnaNoDS/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/vitto4","download_url":"https://codeload.github.com/vitto4/MinnaNoDS/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":241541450,"owners_count":19979121,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["japanese-language","japanese-study","language-learning","minna-no-nihongo","vocabulary-lists","yaml"],"created_at":"2025-01-13T04:15:25.251Z","updated_at":"2026-06-09T00:31:39.716Z","avatar_url":"https://github.com/vitto4.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# MinnaNoDS\n\u003cp align=\"center\"\u003e\n  \u003ca href=\"https://yaml.org/\"\u003e\n    \u003cimg alt=\"YouTube\" src=\"https://img.shields.io/badge/YAML-CB171E?logo=yaml\u0026logoColor=fff\u0026style=flat-square\"\n  /\u003e\u003c/a\u003e\n  \u003ca href=\"https://en.wikipedia.org/wiki/Japanese_language\"\u003e\n    \u003cimg alt=\"Python\" src=\"https://img.shields.io/badge/lang-%20%E6%97%A5%E6%9C%AC%E8%AA%9E-forestgreen?style=flat-square\"\n  /\u003e\u003c/a\u003e\n  \u003ca href=\"https://github.com/vitto4/MinnaNoDS/releases\"\u003e\n    \u003cimg alt=\"GitHub Release\" src=\"https://img.shields.io/github/v/release/vitto4/MinnaNoDS?style=flat-square\"\n  /\u003e\u003c/a\u003e\n  \n\n\u003c/p\u003e\n\n\u003cp align=\"center\"\u003eAll of the vocabulary from your \u003ccode\u003eMinna no Nihongo Shokyū Ⅰ \u0026 Ⅱ\u003c/code\u003e textbooks formulated into a tiny little YAML file.\u003c/p\u003e\n\n\u003cbr\u003e\n\n\n```yaml\n  - id: [2, 10]\n    edition: [1, 2]\n    kanji: \"新聞\"\n    kana: \"しんぶん\"\n    romaji: \"shinbun\"\n    meaning: {\n      en: \"newspaper\",\n      fr: \"journal\",\n    }\n```\n\u003cp align=\"center\"\u003e\u003csup\u003e\u003cins\u003e\u003ci\u003eFigure\u003c/i\u003e\u003c/ins\u003e – A word straight from the dataset. More information on the format \u003ca href=\"https://github.com/vitto4/MinnaNoDS/blob/ae20b1408e2752642618312728ff817fe2479dd4/minna-no-ds.yaml#L93-L107\"\u003ethis way\u003c/a\u003e.\u003c/sup\u003e\u003c/p\u003e\n\n\n\n## 🧭 Table of contents\n1. [Overview](#-overview)\n1. [Usage](#️-usage)\n1. [Bibliography](#-bibliography)\n1. [Conventions](#-conventions)\n1. [Notes](#-notes)\n1. [Warning](#-warning)\n\n## ☁ Overview\n\nThis project aims to serve as a comprehensive vocabulary list for the [`Minna no Nihongo Shokyū`](https://www.3anet.co.jp/np/en/list.html?af=1\u0026g=7-12-33\u0026s=1 \"みんなの日本語 初級 Ⅰ \u0026 Ⅱ\") series, packaged into a single YAML file – which we'll refer to as *the dataset*. \u003cbr\u003e\nTo be more specific, it intends to be as close as possible to the source material, in an effort to (hopefully) provide a foundation anyone can use or expand on.\n\n\nThe dataset currently targets two languages for `meaning` :\n```yaml\nlanguages:\n  en: \"English\"\n  fr: \"Français\"\n```\n\u003cp align=\"center\"\u003e\u003csup\u003e Further information \u003ca href=\"https://github.com/vitto4/MinnaNoDS/blob/ae20b1408e2752642618312728ff817fe2479dd4/minna-no-ds.yaml#L26-L31\"\u003ehere\u003c/a\u003e.\u003c/sup\u003e\u003c/p\u003e\n\n\n**Rōmaji** are provided solely for convenience, and do not correspond to those of the [*rōmaji edition*](https://www.3anet.co.jp/np/en/books/2303/ \"みんなの日本語 初級 ローマ字版\") of the books. \u003cbr\u003e\nThese were generated using a mix of [`pykakasi`](https://pypi.org/project/pykakasi/) and readings supplied by [Google Translate](https://translate.google.com/). As a result, they more or less follow standards set by the [*Modified Hepburn*](https://en.wikipedia.org/wiki/Hepburn_romanization#Variants) system (yes, mācrōns inclūdēd !).\n\n## ⚙️ Usage\n\nHere is a basic example in [`python`](https://www.python.org/).\n\n```py\nimport yaml\n\n# Load the dataset\nwith open(\"minna-no-ds.yaml\", \"r\", encoding=\"utf-8\") as f:\n  ds = yaml.load(f, Loader=yaml.FullLoader)\n\n\n# Extract the keys for all available lessons\nlessons: list = [lesson[\"key\"] for lesson in ds[\"lessons\"]] # ['lesson-01', 'lesson-02', ...]\n\n\n# Go through each lesson and print out its contents\nfor key in lessons:\n  print(f\"Contents of {key}\") # Outputs : Contents of lesson-01\n  print(ds[key]) # Outputs : [{'id': [1, 1], 'edition': [1, 2], 'kanji': None, 'kana': 'わたし', 'romaji': 'watashi', 'meaning': {'en': 'I', 'fr': 'je, moi'}}, ...]\n```\n\n## 📚 Bibliography\n\nAs you may know, `Minna no Nihongo Shokyū` comes in two books of twenty-five lessons each ; both available in two editions (the second of which is an updated version of the original).\n\nPresented bellow is a table showing the books used in the making of the dataset.\n\n| 📗📘📙 | First Edition | Second Edition |\n|:-----:|:-------------:|:--------------:|\n| **Book 1**\u003cbr\u003e*English Version*  | みんなの日本語初級Ⅰ 翻訳・文法解説英語版\u003cbr\u003e[`ISBN : 9784883191079`](https://web.archive.org/web/20040820203739/http://www.3anet.co.jp/english/text_e_m_trans.html) | みんなの日本語初級Ⅰ 第２版 翻訳・文法解説 英語版\u003cbr\u003e[`ISBN : 9784883196043`](https://www.3anet.co.jp/np/en/books/2302/) |\n| **Book 2**\u003cbr\u003e*English Version* | みんなの日本語初級Ⅱ 翻訳・文法解説英語版\u003cbr\u003e[`ISBN : 9784883191086`](https://web.archive.org/web/20040820203739/http://www.3anet.co.jp/english/text_e_m_trans.html) | みんなの日本語初級Ⅱ 第２版 翻訳・文法解説 英語版\u003cbr\u003e[`ISBN : 9784883196647`](https://www.3anet.co.jp/np/en/books/2402/) |\n| **Book 1**\u003cbr\u003e*French Version*  | みんなの日本語初級Ⅰ 翻訳・文法解説フランス語版\u003cbr\u003e[`ISBN : 9784883191338`](https://web.archive.org/web/20040820203739/http://www.3anet.co.jp/english/text_e_m_trans.html)|  みんなの日本語初級Ⅰ 第２版 翻訳・文法解説 フランス語版\u003cbr\u003e[`ISBN : 9784883196456`](https://www.3anet.co.jp/np/en/books/2312/) |\n| **Book 2**\u003cbr\u003e*French Version* | みんなの日本語初級Ⅱ 翻訳・文法解説フランス語版\u003cbr\u003e[`ISBN : 9784883191383`](https://web.archive.org/web/20040820203739/http://www.3anet.co.jp/english/text_e_m_trans.html) | みんなの日本語初級Ⅱ 第２版 翻訳・文法解説 フランス語版\u003cbr\u003e[`ISBN : 9784883197057`](https://www.3anet.co.jp/np/en/books/2412/) |\n\n\n## 🚦 Conventions\n\nWhat I call a *convention* is any rule I set while creating the dataset that is not directly derived from the source material.\n\n\u003cp align=\"center\"\u003e\n  See \n  \u003ca href=\"https://github.com/vitto4/MinnaNoDS/blob/main/CONVENTIONS.md\"\u003e\n    \u003ccode\u003eCONVENTIONS.md\u003c/code\u003e\n  \u003c/a\u003e\n\u003c/p\u003e\n\nThis file also includes general information about the structure of the dataset.\n\n## 🔖 Notes\n\n- When starting out with this project, I used [Paul Denisowski's vocabulary lists](http://www.denisowski.org/Japanese/Japanese.html) to generate a blank template for me to fill in. Serious time-saver right there !\n- As strings in `romaji` do not need to be spellchecked, you may use the following config with [`CSpell`](https://cspell.org/).\n  ```json\n    \"cSpell.ignoreRegExpList\": [\n        \"/romaji:\\\\s*\\\"[^\\\"]*\\\"/gi\"\n    ]\n  ```\n- This project should have shipped with the set of scripts I used to lint and validate the dataset. \u003cbr\u003e\n  It didn't, but who knows, I may get to it when (if) I stop being obsessed with that one space dwarves simulator ¯\\\\\\_(ツ)_/¯\n- Adding or removing a word will alter the `id` of all subsequent words in the same lesson. \u003cbr\u003e\n  Therefore, any time this has to be done, the version number will have to be bumped to the next major release as this could be considered breaking change for anyone using `id` as a [primary key](## \"Which it intends to be, when it is in fact more of something I believed to be called a `natural key` and may thus be unstable.\").\n- « This must have taken quite the amount of time to make » well you don't say ! (笑) \u003cbr\u003e\n  Though I think I'm happy with how it turned out c:\n- ~~I haven't yet managed to get my hands on a French version of the first edition of book 1, so words found exclusively in `Book 1, edition 1` have no French `meaning` for now~~　—　fixed as of [v1.1.0](https://github.com/vitto4/MinnaNoDS/releases/tag/v1.1.0).\n\n\n\n## 🚧 Warning\n\n```yaml\n#   * The selection of words and their respective translations are the sole property of 3A Corporation.\n#     This dataset and subsequent projects that depend on it shall only be used *in conjunction with* – and not *as a substitute for* – the books ; so as to not cause any financial harm to the IP owners.\n#   * As per previous remarks, no commercial use of this file shall be admissible.\n```\n\u003cp align=\"center\"\u003e\u003csup\u003e More \u003ca href=\"https://github.com/vitto4/MinnaNoDS/blob/ae20b1408e2752642618312728ff817fe2479dd4/minna-no-ds.yaml#L12-L14\"\u003ehere\u003c/a\u003e.\u003c/sup\u003e\u003c/p\u003e\n\nThe lack of license is deliberate, as I am uncertain about the appropriate licensing options for this project.\nContent isn't mine, only the dataset structure and the actual work of filling it in. \u003cbr\u003e\nIf you know a suitable option, feel free to open an issue !\n\nHopefully that doesn't stop anyone from using the dataset though.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fvitto4%2Fminnanods","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fvitto4%2Fminnanods","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fvitto4%2Fminnanods/lists"}