{"id":25056382,"url":"https://github.com/clojurenlp/core","last_synced_at":"2025-04-13T10:44:03.872Z","repository":{"id":2003214,"uuid":"2936829","full_name":"clojurenlp/core","owner":"clojurenlp","description":"Clojure wrapper for the Stanford CoreNLP Java library","archived":false,"fork":false,"pushed_at":"2021-01-04T18:56:47.000Z","size":1973,"stargazers_count":105,"open_issues_count":3,"forks_count":32,"subscribers_count":11,"default_branch":"master","last_synced_at":"2025-04-10T04:46:50.016Z","etag":null,"topics":["clojure","machine-learning","natural-language-processing","parsing"],"latest_commit_sha":null,"homepage":"","language":"Clojure","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/clojurenlp.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2011-12-08T00:29:26.000Z","updated_at":"2024-05-31T07:47:05.000Z","dependencies_parsed_at":"2022-07-19T12:54:08.513Z","dependency_job_id":null,"html_url":"https://github.com/clojurenlp/core","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/clojurenlp%2Fcore","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/clojurenlp%2Fcore/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/clojurenlp%2Fcore/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/clojurenlp%2Fcore/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/clojurenlp","download_url":"https://codeload.github.com/clojurenlp/core/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248702058,"owners_count":21148114,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["clojure","machine-learning","natural-language-processing","parsing"],"created_at":"2025-02-06T13:23:36.089Z","updated_at":"2025-04-13T10:44:03.851Z","avatar_url":"https://github.com/clojurenlp.png","language":"Clojure","funding_links":["https://paypal.me/clojurenlp"],"categories":[],"sub_categories":[],"readme":"# org.clojurenlp.core\n\n[![Clojars Project](https://img.shields.io/clojars/v/org.clojurenlp/core.svg)](https://clojars.org/org.clojurenlp/core)\n[![Build Status](https://travis-ci.org/clojurenlp/core.svg?branch=master)](https://travis-ci.org/clojurenlp/core)\n[![Gitter Lobby](https://badges.gitter.im/gitterHQ/gitter.png)](https://gitter.im/clojurenlp/Lobby)\n\nNatural language processing in Clojure based on the Stanford-CoreNLP parser.\n\n# 👋 MAINTAINERS WANTED!\n\nWe need help getting this project moving. Please feel free to email to leontalbot@gmail.com to join the org, or drop a line in the chat room.\n\n\nThis is a work in progress, currently in the POC phase.\n\n## Usage\n\n### Tokenization\n\n    (use 'org.clojurenlp.core)\n    (tokenize \"This is a simple sentence.\")\n    ;; =\u003e '({:token \"This\", :start-offset 0, :end-offset 4}\n            {:token \"is\", :start-offset 5, :end-offset 7}\n            {:token \"a\", :start-offset 8, :end-offset 9}\n            {:token \"simple\", :start-offset 10, :end-offset 16}\n            {:token \"sentence\", :start-offset 17, :end-offset 25}\n            {:token \".\", :start-offset 25, :end-offset 26}) \n        \n        \n### Part-of-Speech Tagging\n\nTo get a list of `TaggedWord` objects:\n\n    (use 'org.clojurenlp.core)\n    ;;  use any of these:\n    (-\u003e \"Short and sweet.\" tokenize pos-tag)\n    (-\u003e \"Short and sweet.\" split-sentences first pos-tag)\n    (-\u003e [\"Short\" \"and\" \"sweet\" \".\"] pos-tag)\n    (-\u003e \"Short and sweet.\" pos-tag)\n    \n    ;; =\u003e [#\u003cTaggedWord Short/JJ\u003e #\u003cTaggedWord and/CC\u003e ...]\n\nTo return a tag string from TaggedWord object:\n    \n    (-\u003e\u003e \"Short and sweet.\" tokenize pos-tag first .tag)\n    ;; =\u003e JJ\n    (-\u003e\u003e \"Short and sweet.\" tokenize pos-tag (map #(.tag %)))\n    ;; =\u003e (\"JJ\" \"CC\" \"JJ\" \".\")\n\nFor more information, see the [relevant Javadoc](http://nlp.stanford.edu/nlp/javadoc/javanlp/edu/stanford/nlp/ling/TaggedWord.html)\n\n### Named Entity Recognition\n\nTo tag named entities utilizing standard Stanford NER model:\n\n    (use 'org.clojurenlp.core)\n    (def pipeline (initialize-pipeline))\n    (def text \"The United States of America will be tagged as a location\")\n    (tag-ner pipeline text)\n\nTraining your own model [How to Train Your Own Model](https://nlp.stanford.edu/software/crf-faq.html#a)\n\nTo tag named entities utilizing custom trained model: \n    \n    (use 'org.clojurenlp.core)\n    (def pipeline (initialize-pipeline \"path-to-serialized-model\"))\n    (def text \"The United States of America will be tagged as a location\")\n    (tag-ner pipeline text)\n    \nUtilizing either NER tagging strategy, a map containing the original text, sentences, tokens, and ner tags will be returned.\n    \n### Parsing\n\nTo parse a sentence:\n\n\t(use 'org.clojurenlp.core)\n\t(parse (tokenize text))\n\nYou will get back a LabeledScoredTreeNode which you can plug in to\nother Stanford CoreNLP functions or can convert to a standard Treebank\nstring with:\n\n\t(str (parse (tokenize text)))\n\n### Stanford Dependencies\n\n\t(dependency-graph \"I like cheese.\")\n\nwill parse the sentence and return the dependency graph as a\n[loom](https://github.com/jkk/loom) graph, which you can then traverse with\nstandard graph algorithms like shortest path, etc. You can also view it:\n\n\t(def graph (dependency-graph \"I like cheese.\"))\n\t(use 'loom.io)\n\t(view graph)\n\nThis requires GraphViz to be installed.\n\n## License\n\n© 2018 The ClojureNLP Organization and Contributors\n\nDistributed under the Apache 2.0 License. See LICENSE for details.\n\n## The ClojureNLP Organization\n- Leon Talbot @leontalbot\n- Andrew McLoud @andrewmcloud\n\n## Contributors\n- Cory Giles\n- Hans Engel\n- Damien Stanton\n- Andrew McLoud\n- Leon Talbot\n- Marek Owsikowski\n\n\n\n[![donation](https://img.shields.io/badge/Donate_-to_this_project-green.svg)](https://paypal.me/clojurenlp)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fclojurenlp%2Fcore","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fclojurenlp%2Fcore","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fclojurenlp%2Fcore/lists"}