{"id":20751026,"url":"https://github.com/vaaaaanquish/docker-uth-bert","last_synced_at":"2025-06-21T10:08:03.303Z","repository":{"id":39728424,"uuid":"260040273","full_name":"vaaaaanquish/docker-UTH-BERT","owner":"vaaaaanquish","description":"docker for UTH-BERT: https://ai-health.m.u-tokyo.ac.jp/uth-bert","archived":false,"fork":false,"pushed_at":"2023-03-24T23:30:56.000Z","size":29,"stargazers_count":14,"open_issues_count":1,"forks_count":0,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-06-21T10:07:46.085Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"https://github.com/vaaaaanquish/gokart-UTH-BERT","language":"Dockerfile","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/vaaaaanquish.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2020-04-29T20:35:53.000Z","updated_at":"2021-09-15T00:30:27.000Z","dependencies_parsed_at":"2024-11-17T23:16:41.353Z","dependency_job_id":null,"html_url":"https://github.com/vaaaaanquish/docker-UTH-BERT","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/vaaaaanquish/docker-UTH-BERT","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vaaaaanquish%2Fdocker-UTH-BERT","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vaaaaanquish%2Fdocker-UTH-BERT/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vaaaaanquish%2Fdocker-UTH-BERT/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vaaaaanquish%2Fdocker-UTH-BERT/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/vaaaaanquish","download_url":"https://codeload.github.com/vaaaaanquish/docker-UTH-BERT/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vaaaaanquish%2Fdocker-UTH-BERT/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":261103451,"owners_count":23109932,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-17T08:29:44.557Z","updated_at":"2025-06-21T10:07:58.264Z","avatar_url":"https://github.com/vaaaaanquish.png","language":"Dockerfile","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Docker for UTH-BERT\n\n東京大学大学院 医学系研究科 医療AI開発講座より公開されているBERT学習済みモデルであるUTH-BERTを動かすためのDockerfileとスクリプト群です。  \n  \nUTH-BERT: https://ai-health.m.u-tokyo.ac.jp/uth-bert\n\n\n# Usage\n\n以下のコマンドでdockerコンテナを作成し動かします。mecab、neologd、J-MeDic(万病辞書)とgoogle-researchのbertモデルrepogitoryが入り、活用できる状態のコンテナが生成されbashが走ります。\n\n```\ndocker build -t bert .\ndocker run -it bert bash\n```\n\n# 特徴抽出\n\nテキストファイルを特徴量に変換するには以下のコマンドを実行します。\n入力テキストをtokenizeしたファイル、featureのファイルが出力されます。\n\n```\n./extract_feature.sh\n```\n\noutput.jsonlファイルがinput.txtに対する特徴量です。  \ninputはサンプルとして[Wikipediaのインフルエンザのページ](https://ja.wikipedia.org/wiki/%E3%82%A4%E3%83%B3%E3%83%95%E3%83%AB%E3%82%A8%E3%83%B3%E3%82%B6\n)の一部設置しています。\n```\nインフルエンザ\n\nインフルエンザ（イタリア語: influenza、ラテン語: influentia）とはインフルエンザウイルス急性感染症。上気道炎症状・呼吸器疾患などを呈する。流行性感冒（りゅうこうせいかんぼう）略して流感（りゅうかん）とも呼ばれる。日本語ではインフル、英語ではfluと略されることも多い。\n...\n```\n\n上記サンプルに対するoutput.jsonlは以下のような形式で1行1jsonの形で出力されます。\n```\n{\"linex_index\": 0, \"features\": [ {\"token\": \"インフルエンザ\", \"layers\": {\"index\": -1, \"values\": [0.1, 0.2, ...]}}, ... ]}\n...\n```\n\n# THX \u0026 LICENCE\n\n順不同\n\n - [UTH-BERT](https://ai-health.m.u-tokyo.ac.jp/uth-bert): [Creative Commons 4.0 International License](https://creativecommons.org/licenses/by-nc-sa/4.0/deed.en) (CC BY-NC-SA 4.0)\n - [万病辞書](http://sociocom.jp/~data/2018-manbyo/index.html): [Creative Commons 4.0 International License](https://creativecommons.org/licenses/by-nc-sa/4.0/deed.en) (CC BY 4.0)\n - [mecab-ipadic-NEologd](https://github.com/neologd/mecab-ipadic-neologd): [Apache License, Version 2.0](https://github.com/neologd/mecab-ipadic-neologd/blob/master/COPYING)\n - [google-research/bert](https://github.com/google-research/bert): [Apache License, Version 2.0](https://github.com/google-research/bert/blob/master/LICENSE)\n - [MeCab](https://taku910.github.io/mecab/): [BSD,LGPL,GPL](https://github.com/taku910/mecab/blob/3a07c4eefaffb4e7a0690a7f4e5e0263d3ddb8a3/mecab/COPYING)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fvaaaaanquish%2Fdocker-uth-bert","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fvaaaaanquish%2Fdocker-uth-bert","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fvaaaaanquish%2Fdocker-uth-bert/lists"}