{"id":13535171,"url":"https://github.com/zhpmatrix/bert-sequence-tagging","last_synced_at":"2025-04-12T23:09:43.994Z","repository":{"id":117937422,"uuid":"156714199","full_name":"zhpmatrix/bert-sequence-tagging","owner":"zhpmatrix","description":"基于BERT的中文序列标注","archived":false,"fork":false,"pushed_at":"2018-11-08T14:04:09.000Z","size":9,"stargazers_count":141,"open_issues_count":3,"forks_count":37,"subscribers_count":13,"default_branch":"master","last_synced_at":"2025-03-26T17:11:20.035Z","etag":null,"topics":["bert","chinese","sequence-tagging"],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/zhpmatrix.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null}},"created_at":"2018-11-08T13:52:11.000Z","updated_at":"2025-01-17T09:44:30.000Z","dependencies_parsed_at":"2024-01-14T02:37:02.407Z","dependency_job_id":"370e2443-ca31-4b6a-bc5f-8bdb3425dd80","html_url":"https://github.com/zhpmatrix/bert-sequence-tagging","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zhpmatrix%2Fbert-sequence-tagging","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zhpmatrix%2Fbert-sequence-tagging/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zhpmatrix%2Fbert-sequence-tagging/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zhpmatrix%2Fbert-sequence-tagging/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/zhpmatrix","download_url":"https://codeload.github.com/zhpmatrix/bert-sequence-tagging/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248643004,"owners_count":21138355,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["bert","chinese","sequence-tagging"],"created_at":"2024-08-01T08:00:50.728Z","updated_at":"2025-04-12T23:09:43.972Z","avatar_url":"https://github.com/zhpmatrix.png","language":"Python","readme":"# bert-sequence-tagging\n基于BERT的中文序列标注\n\nBERT的一作Jacob说他不准备放出序列标注的代码，不过你可以在issues中看到Jacob参与的讨论。\n\n该Repo不是可以直接运行的，不过如果你真的需要这个实现，很大程度上应该一看就能明白。\n\nsequence tagging可以用在很多地方，NER, POS等，也许最近的天池瑞金比赛也可以尝试，Kaggle最新的一个文本分类的比赛也可以尝试，不过Kaggle的这个新赛直接使用Jacob公布的代码应该不需要修改很多地方。\n\n基于google-bert源代码，写sequence tagging模块，并在IJCNLP的CGED数据上做了初步测试，实验结果如下图所示。\n\n![实验结果](http://wx1.sinaimg.cn/mw690/aba7d18bly1fx0zcmf50qj20fr0th79n.jpg)\n\n其中，最上图是2017年哈工大的一个组公布的代码C++跑出的结果；中图是我用Tensorflow复现出的结果；下图是基于BERT做fine-tuning得到的一个结果。\n\n从结果上来看，实现上暂时看起来似乎没有问题。\n\n\n","funding_links":[],"categories":["BERT  NER  task:","Tasks"],"sub_categories":["Named-Entity Recognition (NER)"],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fzhpmatrix%2Fbert-sequence-tagging","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fzhpmatrix%2Fbert-sequence-tagging","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fzhpmatrix%2Fbert-sequence-tagging/lists"}