{"id":19993167,"url":"https://github.com/jimichan/mynlp","last_synced_at":"2025-05-04T12:30:55.706Z","repository":{"id":39381145,"uuid":"113726044","full_name":"mayabot/mynlp","owner":"mayabot","description":"一个生产级、高性能、模块化、可扩展的中文NLP工具包。（中文分词、平均感知机、fastText、拼音、新词发现、分词纠错、BM25、人名识别、命名实体、自定义词典）","archived":false,"fork":false,"pushed_at":"2023-12-21T16:14:59.000Z","size":6957,"stargazers_count":671,"open_issues_count":18,"forks_count":90,"subscribers_count":30,"default_branch":"master","last_synced_at":"2024-07-27T08:44:12.124Z","etag":null,"topics":["fasttext","nlp","pinyin","segment","starspace"],"latest_commit_sha":null,"homepage":"https://mynlp.mayabot.com/","language":"Java","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/mayabot.png","metadata":{"files":{"readme":"README.adoc","changelog":"CHANGES.md","contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null}},"created_at":"2017-12-10T05:35:01.000Z","updated_at":"2024-07-21T18:00:47.000Z","dependencies_parsed_at":"2023-12-21T19:00:37.745Z","dependency_job_id":"488b0187-470d-4527-b09b-c21def342c7d","html_url":"https://github.com/mayabot/mynlp","commit_stats":{"total_commits":551,"total_committers":8,"mean_commits":68.875,"dds":0.03266787658802173,"last_synced_commit":"b980da3a6f9cdcb83e0800f6cab50656df94a22a"},"previous_names":[],"tags_count":17,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mayabot%2Fmynlp","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mayabot%2Fmynlp/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mayabot%2Fmynlp/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mayabot%2Fmynlp/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/mayabot","download_url":"https://codeload.github.com/mayabot/mynlp/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":224393297,"owners_count":17303705,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["fasttext","nlp","pinyin","segment","starspace"],"created_at":"2024-11-13T04:52:29.653Z","updated_at":"2025-05-04T12:30:49.650Z","avatar_url":"https://github.com/mayabot.png","language":"Java","funding_links":[],"categories":["Java","人工智能"],"sub_categories":[],"readme":"= Mynlp: 高性能、可扩展的中文NLP工具包\n:version: 4.0.0\n:icons: font\n\nimage:https://img.shields.io/github/license/mayabot/mynlp.svg[]\nimage:https://maven-badges.herokuapp.com/maven-central/com.mayabot.mynlp/mynlp/badge.svg[link=https://maven-badges.herokuapp.com/maven-central/com.mayabot.mynlp/mynlp]\nimage:https://img.shields.io/github/release/mayabot/mynlp/all.svg[link=https://github.com/mayabot/mynlp/releases/latest]\nimage:https://img.shields.io/github/repo-size/mayabot/mynlp[link=https://github.com/mayabot/mynlp]\nimage:https://img.shields.io/github/issues-raw/mayabot/mynlp.svg[link=https://github.com/mayabot/mynlp/issues]\n\nimage::https://cdn.mayabot.com/mynlp/mynlp-banner.png[,500,align=center,link=https://mynlp.mayabot.com]\n\n[NOTE]\n访问完整在线文档link:https://mynlp.mayabot.com/[ mynlp.mayabot.com]\n\n== QQ群（2）：747892793\n\n== 安装\n\n该章节介绍如何安装和简单使用mynlp的基础功能。\n\nmynlp已经发布在Maven中央仓库中，所以只需要在Maven或者Gradle中引入mynlp.jar依赖即可。\n\n.Gradle\n[subs=\"attributes+\"]\n----\ncompile 'com.mayabot.mynlp:mynlp:{version}'\n----\n\n.Maven\n[source,xml,subs=\"attributes+\"]\n----\n\u003cdependency\u003e\n  \u003cgroupId\u003ecom.mayabot.mynlp\u003c/groupId\u003e\n  \u003cartifactId\u003emynlp\u003c/artifactId\u003e\n  \u003cversion\u003e{version}\u003c/version\u003e\n\u003c/dependency\u003e\n----\n\n因为资源文件较大，所以mynlp.jar包默认不包括资源文件（词典和模型文件）依赖。\n\n懒人方案，通过引用mynlp-all依赖默认提供的资源词典，满足大部分需求。\n\n.依赖 mynlp-all\n[subs=\"attributes+\"]\n----\ncompile 'com.mayabot.mynlp:mynlp-all:{version}'\n----\n\n=== 词典和模型资源\n\n.词典\u0026模型资源列表\n[cols=\"6,^1,^1,4\"]\n|===\n|Gradle 坐标 | mynlp-all依赖 |文件大小 |说明\n\n|com.mayabot.mynlp.resource:mynlp-resource-coredict:1.0.0\n|Y\n|18.2M\n|核心词典（20w+词，500w+二元）\n\n|com.mayabot.mynlp.resource:mynlp-resource-pos:1.0.0\n|Y\n|17.5M\n|词性标注模型（感知机模型）\n\n|com.mayabot.mynlp.resource:mynlp-resource-ner:1.0.0\n|Y\n|13.4M\n|命名实体识别（人名识别、其他NER）\n\n|com.mayabot.mynlp.resource:mynlp-resource-pinyin:1.1.0\n|Y\n|272K\n|拼音词典、拼音切分模型\n\n|com.mayabot.mynlp.resource:mynlp-resource-transform:1.0.0\n|Y\n|478K\n|繁简体词典\n\n|com.mayabot.mynlp.resource:mynlp-resource-cws:1.0.0\n|N\n|62.4M\n|感知机分词模型\n\n|com.mayabot.mynlp.resource:mynlp-resource-custom:1.0.0\n|N\n|2.19M\n|自定义扩展词库\n\n|===\n\n根据实际的需要，按需引入资源包。\n\n[source]\n.一个Gradle引用的例子\n----\ncompile 'com.mayabot.mynlp:mynlp:3.2.0'\n\n// 核心词典\nimplementation 'com.mayabot.mynlp.resource:mynlp-resource-coredict:1.0.0'\n\n// 词性标注\nimplementation 'com.mayabot.mynlp.resource:mynlp-resource-pos:1.0.0'\n\n// 命名实体\nimplementation 'com.mayabot.mynlp.resource:mynlp-resource-ner:1.0.0'\n\n// 拼音\nimplementation 'com.mayabot.mynlp.resource:mynlp-resource-pinyin:1.1.0'\n\n// 繁简体转换\nimplementation 'com.mayabot.mynlp.resource:mynlp-resource-transform:1.0.0'\n\n// 感知机分词模型\n//   implementation 'com.mayabot.mynlp.resource:mynlp-resource-cws:1.0.0'\n\n// 自定义扩展词库\n//   implementation 'com.mayabot.mynlp.resource:mynlp-resource-custom:1.0.0'\n----\n\n\n== 访问完整在线文档\n\nlink:https://mynlp.mayabot.com/[mynlp.mayabot.com]\n\n== 致谢以下优秀开源项目\n\n- HanLP\n- ansj_seg\n\nmynlp实现参考了他们算法实现和部分代码","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjimichan%2Fmynlp","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fjimichan%2Fmynlp","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjimichan%2Fmynlp/lists"}