{"id":18600570,"url":"https://github.com/houbb/nlp-keyword","last_synced_at":"2025-05-05T23:54:17.792Z","repository":{"id":260441543,"uuid":"881310126","full_name":"houbb/nlp-keyword","owner":"houbb","description":"nlp-keyword based on segment.(基于分词的 NLP keyword 关键词) 支持关键词提取，自动摘要生成，文本相似度计算。","archived":false,"fork":false,"pushed_at":"2024-11-01T13:11:23.000Z","size":2173,"stargazers_count":6,"open_issues_count":0,"forks_count":3,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-05-05T23:54:11.914Z","etag":null,"topics":["ai","auto-summary","chat","chatgpt","gpt","keyword","nlp","segment","similarity"],"latest_commit_sha":null,"homepage":"","language":"Java","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"other","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/houbb.png","metadata":{"files":{"readme":"README.md","changelog":"CHANGELOG.md","contributing":null,"funding":null,"license":"LICENSE.txt","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-10-31T10:14:42.000Z","updated_at":"2025-01-26T02:19:35.000Z","dependencies_parsed_at":"2024-10-31T11:19:54.510Z","dependency_job_id":"989a9e55-8045-4b7f-a5c1-82981bfc8ed9","html_url":"https://github.com/houbb/nlp-keyword","commit_stats":null,"previous_names":["houbb/nlp-keyword"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/houbb%2Fnlp-keyword","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/houbb%2Fnlp-keyword/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/houbb%2Fnlp-keyword/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/houbb%2Fnlp-keyword/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/houbb","download_url":"https://codeload.github.com/houbb/nlp-keyword/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":252596396,"owners_count":21773844,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["ai","auto-summary","chat","chatgpt","gpt","keyword","nlp","segment","similarity"],"created_at":"2024-11-07T02:04:28.671Z","updated_at":"2025-05-05T23:54:17.774Z","avatar_url":"https://github.com/houbb.png","language":"Java","funding_links":[],"categories":[],"sub_categories":[],"readme":"# nlp-keyword\n\n[nlp-keyword](https://github.com/houbb/nlp-keyword) 高性能的 java 分词关键词提取实现，基于分词 [segment](https://github.com/houbb/segment)。\n\n愿景：成为 java 最好用的关键词工具。\n\n[![Build Status](https://travis-ci.com/houbb/nlp-keyword.svg?branch=master)](https://travis-ci.com/houbb/nlp-keyword)\n[![Maven Central](https://maven-badges.herokuapp.com/maven-central/com.github.houbb/keyword/badge.svg)](http://mvnrepository.com/artifact/com.github.houbb/nlp-keyword)\n[![](https://img.shields.io/badge/license-Apache2-FF0080.svg)](https://github.com/houbb/keyword/blob/master/LICENSE.txt)\n[![Open Source Love](https://badges.frapsoft.com/os/v2/open-source.svg?v=103)](https://github.com/houbb/nlp-keyword)\n\n## 特性\n\n- 灵活的条件指定\n\n- 基于 TF-IDF 算法的关键字算法\n\n- 支持 auto-summary 自动摘要生成\n\n- 文本相似度计算支持\n\n\u003e [变更日志](https://github.com/houbb/nlp-keyword/blob/master/CHANGELOG.md)\n\n# 关键词快速开始\n\n## maven 引入\n\n```xml\n\u003cdependency\u003e\n    \u003cgroupId\u003ecom.github.houbb\u003c/groupId\u003e\n    \u003cartifactId\u003enlp-keyword-core\u003c/artifactId\u003e\n    \u003cversion\u003e1.2.0\u003c/version\u003e\n\u003c/dependency\u003e\n```\n\n## 快速开始\n\n- 获取关键词\n\n```java\nfinal String text = \"这是一个伸手不见五指的黑夜，夜太美。\";\n\nList\u003cIKeywordResult\u003e keywords =  KeywordHelper.keyword(text);\nAssert.assertEquals(\"[伸手不见|0.86879235325, 太美|0.70848301840625, 伸手不见五指|0.63371734601875, 黑夜|0.509854654010625, 伸手|0.43563643037625]\", keywords.toString());\n```\n\n- 指定返回的个数\n\n```java\nfinal String text = \"这是一个伸手不见五指的黑夜，夜太美。\";\n\nList\u003cIKeywordResult\u003e keywords =  KeywordHelper.keyword(text, 1);\nAssert.assertEquals(\"[伸手不见|0.86879235325]\", keywords.toString());\n```\n\n- 指定返回的格式\n\n```java\nfinal String text = \"这是一个伸手不见五指的黑夜，夜太美。\";\n\nList\u003cString\u003e keywords =  KeywordHelper.keyword(text, 1, KeywordResultHandlers.words());\nAssert.assertEquals(\"[伸手不见]\", keywords.toString());\n```\n\n\n# 自动摘要\n\n## maven 引入\n\n```xml\n\u003cdependency\u003e\n    \u003cgroupId\u003ecom.github.houbb\u003c/groupId\u003e\n    \u003cartifactId\u003enlp-keyword-summary\u003c/artifactId\u003e\n    \u003cversion\u003e1.2.0\u003c/version\u003e\n\u003c/dependency\u003e\n```\n\n## 入门例子\n\n```java\nfinal String text = \"你好啊，我的老伙计。最近过得怎么样？我实在是觉得太久没有看见你了。听说北海的花开了，太久没去了。故事的最后会怎么样呢？我也不清楚。\";\n\nString summary = AutoSummaryHelper.autoSummary(text);\n```\n\n# 文本相似度\n\n## maven 引入\n\n```xml\n\u003cdependency\u003e\n    \u003cgroupId\u003ecom.github.houbb\u003c/groupId\u003e\n    \u003cartifactId\u003enlp-keyword-similarity\u003c/artifactId\u003e\n    \u003cversion\u003e1.2.0\u003c/version\u003e\n\u003c/dependency\u003e\n```\n\n## 入门例子\n\n```java\nfinal String source = \"我喜欢看电影，读书和旅游。\";\nfinal String target = \"我不喜欢看电影。我爱唱跳、RAP、Music~\";\n\ndouble rank = SimilarityHelper.similarity(source, target);\n```\n\n结果：\n\n```\n0.677537337470188\n```\n\n# ROAD-MAP\n\n- [] 字典数据独立，便于用户自定义选择\n\n- [x] 文本相似度\n\n- [x] auto-summary 自动摘要\n\n- [ ] 完善文档+测试用例","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fhoubb%2Fnlp-keyword","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fhoubb%2Fnlp-keyword","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fhoubb%2Fnlp-keyword/lists"}