{"id":21067151,"url":"https://github.com/idf/rake4j","last_synced_at":"2025-12-16T23:01:58.305Z","repository":{"id":24801593,"uuid":"28215596","full_name":"idf/rake4j","owner":"idf","description":"A Java implementation of the Rapid Automatic Keyword Extraction (RAKE)","archived":false,"fork":false,"pushed_at":"2017-05-30T15:16:27.000Z","size":211,"stargazers_count":7,"open_issues_count":2,"forks_count":5,"subscribers_count":4,"default_branch":"master","last_synced_at":"2025-04-03T19:38:15.356Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":"Java","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/idf.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2014-12-19T05:23:28.000Z","updated_at":"2023-03-10T09:41:20.000Z","dependencies_parsed_at":"2022-08-23T07:10:45.261Z","dependency_job_id":null,"html_url":"https://github.com/idf/rake4j","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/idf/rake4j","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/idf%2Frake4j","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/idf%2Frake4j/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/idf%2Frake4j/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/idf%2Frake4j/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/idf","download_url":"https://codeload.github.com/idf/rake4j/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/idf%2Frake4j/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":27772609,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-12-16T02:00:10.477Z","response_time":57,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-19T18:04:40.929Z","updated_at":"2025-12-16T23:01:58.277Z","avatar_url":"https://github.com/idf.png","language":"Java","funding_links":[],"categories":[],"sub_categories":[],"readme":"rake4j\n======\nThis is a re-write of [Python RAKE](https://github.com/aneesha/RAKE) in Java.  \n\nAn implementation of the Rapid Automatic Keyword Extraction (RAKE) algorithm as described in:  [Rose, S., Engel, D., Cramer, N., \u0026 Cowley, W. (2010). Automatic Keyword Extraction from Individual Documents](http://scholar.google.com.sg/scholar?q=Automatic+Keyword+Extraction+from+Individual+Documents\u0026btnG=\u0026hl=en\u0026as_sdt=0%2C5\u0026as_vis=1)\n\n# Run\n## Sample\nNormal run \n```java\n        Document doc = new Document(text);\n        RakeAnalyzer rake = new RakeAnalyzer();\n        rake.loadDocument(doc);\n        rake.runWithoutOffset();\n        System.out.println(doc.termListToString());\n```\nRun with offset information and stemming \n```java\n        Document doc = new Document(text);\n        RakeAnalyzer rake = new RakeAnalyzer();\n        rake.loadDocument(doc);\n        rake.run();\n        System.out.println(doc.termMapToString());\n```\n# Features\nRecognized keywords from the algorithm based on stop words\n* Adjoining keywords to recognized \"axis of evil\".\n* KStemming algorithm ported from Lucene, to stem \"university students\" to \"university student\".\n* Construct index of keywords with term frequency `tf` and document frequency `df`.\n\n# Dependencies\nIn pom.xml, another custom maven module dependency is required:\n```xml\n        \u003cdependency\u003e\n            \u003cgroupId\u003eio.deepreader.java.commons\u003c/groupId\u003e\n            \u003cartifactId\u003ecommons-util\u003c/artifactId\u003e\n            \u003cversion\u003e1.0-SNAPSHOT\u003c/version\u003e\n        \u003c/dependency\u003e\n```\nYou can get the module manually by:\n```\ngit clone https://github.com/idf/commons-util\n```\n, which is hosted [here](https://github.com/idf/commons-util).\n\n# References\n[Python RAKE](https://github.com/aneesha/RAKE)  \n[Python RAKE (forked)](https://github.com/idf/RAKE)  \n[Java RAKE](https://github.com/Neuw84/RAKE-Java)  \n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fidf%2Frake4j","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fidf%2Frake4j","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fidf%2Frake4j/lists"}