{"id":13776521,"url":"https://github.com/suminb/hanja","last_synced_at":"2025-04-06T03:07:46.752Z","repository":{"id":8140644,"uuid":"9559426","full_name":"suminb/hanja","owner":"suminb","description":"한글, 한자 라이브러리","archived":false,"fork":false,"pushed_at":"2024-04-24T16:41:03.000Z","size":710,"stargazers_count":122,"open_issues_count":9,"forks_count":14,"subscribers_count":6,"default_branch":"develop","last_synced_at":"2024-05-03T01:16:45.378Z","etag":null,"topics":["hangul","hanja","nlp","python"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/suminb.png","metadata":{"files":{"readme":"README.rst","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2013-04-20T04:49:12.000Z","updated_at":"2024-07-31T12:20:09.856Z","dependencies_parsed_at":"2024-05-31T08:41:44.744Z","dependency_job_id":"88e7752c-5028-4235-9c77-4392f1c0addd","html_url":"https://github.com/suminb/hanja","commit_stats":{"total_commits":77,"total_committers":8,"mean_commits":9.625,"dds":0.2597402597402597,"last_synced_commit":"2138463a5b4b9251df2f310144d32591f0ddbca3"},"previous_names":[],"tags_count":17,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/suminb%2Fhanja","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/suminb%2Fhanja/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/suminb%2Fhanja/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/suminb%2Fhanja/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/suminb","download_url":"https://codeload.github.com/suminb/hanja/tar.gz/refs/heads/develop","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247427006,"owners_count":20937201,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["hangul","hanja","nlp","python"],"created_at":"2024-08-03T18:00:28.071Z","updated_at":"2025-04-06T03:07:46.729Z","avatar_url":"https://github.com/suminb.png","language":"Python","funding_links":[],"categories":["Programming Languages"],"sub_categories":["Python"],"readme":"hanja: 한자-한글 변환 라이브러리\n================================\n\n`한자-한글 변환기`__\\ 에서 사용되는 모듈입니다.\n\n__ http://hanja.suminb.com\n\n\nImprove Hanja Library\n---------------------\n\n사용 하시다가 빠진 한자 또는 틀린 독음을 발견하시면 `이 링크\n\u003chttps://docs.google.com/forms/d/e/1FAIpQLScAtw6ylAhy1t0hMn5K25ZbN1vSNPlRdUtebS9PVtKeLQRfvw/viewform\u003e`_\\\n를 통해 제보해주세요. 확인 후 반영하도록 하겠습니다. GitHub을 통해 직접 PR을\n보내주셔도 좋습니다.\n\n\nInstallation\n------------\n\n.. code-block:: console\n\n   pip install hanja\n\n\nUsage\n------\n\n필요한 모듈 import 하기\n```````````````````````\n\n.. code-block:: python\n\n   \u003e\u003e\u003e import hanja\n   \u003e\u003e\u003e from hanja import hangul\n\n한글 초성, 중성, 종성 분리\n``````````````````````````\n.. code-block:: python\n\n   \u003e\u003e\u003e hangul.separate('가')\n   (0, 0, 0)\n   \u003e\u003e\u003e hangul.separate('까')\n   (1, 0, 0)\n\n튜플(tuple)의 마지막 원소가 0이면 종성이 없는 글자라고 판단할 수 있다.\n\n.. code-block:: python\n\n   \u003e\u003e\u003e hangul.separate('한')\n   (18, 0, 4)\n\n'ㅎ'은 19번째 자음, 'ㅏ'는 첫번째 모음, 'ㄴ'은 다섯번째 자음이라는 것을 알 수 있다.\n\n\n초성, 중성, 종성을 조합하여 한 글자를 만듦\n``````````````````````````````````````````\n\n.. code-block:: python\n\n   \u003e\u003e\u003e hangul.build(0, 0, 0)\n   '가'\n\n\n주어진 글자가 한글인지의 여부를 판별\n````````````````````````````````````\n\n.. code-block:: python\n\n   \u003e\u003e\u003e hangul.is_hangul('가')\n   True\n   \u003e\u003e\u003e hangul.is_hangul('a')\n   False\n\n\n한글로 된 부분과 한자로 된 부분을 분리\n``````````````````````````````````````\n\n리스트가 아닌 제네레이터(generator)를 반환한다.\n\n.. code-block:: python\n\n   \u003e\u003e\u003e '|'.join(hanja.split_hanja('大韓民國은 民主共和國이다.'))\n   大韓民國|은 |民主共和國|이다.\n\n   \u003e\u003e\u003e [x for x in hanja.split_hanja('大韓民國은 民主共和國이다.')]\n   ['大韓民國', '은 ', '民主共和國', '이다.']\n\n주어진 글자가 한자인지의 여부를 판별\n````````````````````````````````````\n\n.. code-block:: python\n\n   \u003e\u003e\u003e hanja.is_hanja('韓')\n   True\n\n   \u003e\u003e\u003e hanja.is_hanja('한')\n   False\n\n문장 변환\n`````````\n\n치환 모드 변환:\n\n.. code-block:: python\n\n   \u003e\u003e\u003e hanja.translate('大韓民國은 民主共和國이다.', 'substitution')\n   '대한민국은 민주공화국이다.'\n\n혼용 모드 변환 (text):\n\n.. code-block:: python\n\n   \u003e\u003e\u003e hanja.translate('大韓民國은 民主共和國이다.', 'combination-text')\n   '大韓民國(대한민국)은 民主共和國(민주공화국)이다.'\n\n혼용 모드 변환 version 2 (text):\n\n.. code-block:: python\n\n   \u003e\u003e\u003e hanja.translate('大韓民國은 民主共和國이다.', 'combination-text-reversed')\n   '대한민국(大韓民國)은 민주공화국(民主共和國)이다.'\n\n혼용 모드 변환 (HTML):\n\n.. code-block:: python\n\n   \u003e\u003e\u003e hanja.translate(u'大韓民國은 民主共和國이다.', 'combination-html')\n   '\u003cspan class=\"hanja\"\u003e大韓民國\u003c/span\u003e\u003cspan class=\"hangul\"\u003e(대한민국)\u003c/span\u003e은 \u003cspan class=\"hanja\"\u003e民主共和國\u003c/span\u003e\u003cspan class=\"hangul\"\u003e(민주공화국)\u003c/span\u003e이다.'\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsuminb%2Fhanja","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fsuminb%2Fhanja","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsuminb%2Fhanja/lists"}