{"id":14987964,"url":"https://github.com/apache/opennlp-sandbox","last_synced_at":"2025-04-07T12:09:21.359Z","repository":{"id":65379638,"uuid":"70037034","full_name":"apache/opennlp-sandbox","owner":"apache","description":"Apache OpenNLP Sandbox","archived":false,"fork":false,"pushed_at":"2025-03-31T06:05:43.000Z","size":34567,"stargazers_count":42,"open_issues_count":4,"forks_count":32,"subscribers_count":17,"default_branch":"main","last_synced_at":"2025-03-31T07:22:31.153Z","etag":null,"topics":["apache","compling","languagetechnology","nlp","opennlp","textprocessing"],"latest_commit_sha":null,"homepage":"https://opennlp.apache.org/","language":"Java","has_issues":false,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/apache.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":".github/CONTRIBUTING.md","funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2016-10-05T07:00:08.000Z","updated_at":"2025-03-31T06:05:15.000Z","dependencies_parsed_at":"2024-04-30T07:03:42.689Z","dependency_job_id":"38fb8ddd-fefb-4926-af3c-59cee3f9ae60","html_url":"https://github.com/apache/opennlp-sandbox","commit_stats":{"total_commits":610,"total_committers":23,"mean_commits":26.52173913043478,"dds":0.3508196721311475,"last_synced_commit":"b7ff83fd8be28c0e2f2f2fcc5091da3b84e3f6a6"},"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/apache%2Fopennlp-sandbox","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/apache%2Fopennlp-sandbox/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/apache%2Fopennlp-sandbox/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/apache%2Fopennlp-sandbox/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/apache","download_url":"https://codeload.github.com/apache/opennlp-sandbox/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247648977,"owners_count":20972945,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["apache","compling","languagetechnology","nlp","opennlp","textprocessing"],"created_at":"2024-09-24T14:15:50.339Z","updated_at":"2025-04-07T12:09:21.340Z","avatar_url":"https://github.com/apache.png","language":"Java","readme":"\u003c!--\nLicensed to the Apache Software Foundation (ASF) under one or more\ncontributor license agreements.  See the NOTICE file distributed with\nthis work for additional information regarding copyright ownership.\nThe ASF licenses this file to You under the Apache License, Version 2.0\n(the \"License\"); you may not use this file except in compliance with\nthe License.  You may obtain a copy of the License at\n\n    http://www.apache.org/licenses/LICENSE-2.0\n\nUnless required by applicable law or agreed to in writing, software\ndistributed under the License is distributed on an \"AS IS\" BASIS,\nWITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.\nSee the License for the specific language governing permissions and\nlimitations under the License.\n--\u003e\n\nWelcome to Apache OpenNLP!\n===========\n\n[![Build Status](https://github.com/apache/opennlp-sandbox/workflows/Java%20CI/badge.svg)](https://github.com/apache/opennlp-sandbox/actions)\n[![Contributors](https://img.shields.io/github/contributors/apache/opennlp-sandbox)](https://github.com/apache/opennlp-sandbox/graphs/contributors)\n[![GitHub pull requests](https://img.shields.io/github/issues-pr-raw/apache/opennlp-sandbox.svg)](https://github.com/apache/opennlp-sandbox/pulls)\n[![Stack Overflow](https://img.shields.io/badge/stack%20overflow-opennlp-f1eefe.svg)](https://stackoverflow.com/questions/tagged/opennlp)\n\nThe Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.\n\nThis sandbox of the toolkit is written mostly in Java and provides support for special NLP tasks, such as \nword sense disambiguation, coreference resolution, text summarization, and more!\nThese tasks are usually required to build text processing services.\n\nThe goal of the OpenNLP sandbox is to provide extra components, potentially in an experimental stage.\n\nOpenNLP sandbox code can be used both programmatically through its Java API, some components even from a terminal through its CLI.\n\n## Useful Links\n\nFor additional information, visit the [OpenNLP Home Page](http://opennlp.apache.org/)\n\nYou can use OpenNLP with any language, demo models are provided [here](https://downloads.apache.org/opennlp/models/).\nThe models are fully compatible with the latest release, they can be used for testing or getting started.\n\n\u003e [!NOTE]  \n\u003e Please train your own models for all other use cases.\n\nDocumentation, including JavaDocs, code usage and command-line interface examples are available [here](http://opennlp.apache.org/docs/)\n\nYou can also follow our [mailing lists](http://opennlp.apache.org/mailing-lists.html) for news and updates.\n\n## Overview\n\nCurrently, the library has different components:\n\n* `caseeditor-corpus-server-plugin`: A set of Java classes for [Apache UIMA](https://uima.apache.org) as Eclipse plugin to integrate corpora.\n* `caseeditor-opennlp-plugin`: An OpenNLP plugin for [Apache UIMA](https://uima.apache.org).\n* `corpus-server`: A multi-module component to create, search, remove, and serve multiple corpora.\n* `mahout-addon`: An addon for [Apache Mahout](https://mahout.apache.org).\n* `mallet-addon`: An addon for [Mallet](https://mimno.github.io/Mallet/topics.html) targeting topic modelling techniques.\n* `modelbuilder-addon`: A set of classes to build models.\n* `nlp-utils`: A set of OpenNLP util classes.\n* `opennlp-coref`: A component to conduct co-reference resolution.\n* `opennlp-dl`: An adapter component for [deeplearning4j](https://deeplearning4j.konduit.ai).\n* `opennlp-grpc`: An implementation of a gRPC backend for OpenNLP.\n* `opennlp-similarity`: A set of components that solve a number of text processing and search tasks, see further details in this [README.md](opennlp-similarity/README.md).\n* `opennlp-wsd`: A set of components that allow for word sense disambiguation.\n* `summarizer`: A set of classes providing text summarization.\n* `tagging-server`: A RESTful webservice to allow for NER, POS tagging, sentence detection and tokenization.\n* `tf-ner-poc`: An adapter component for [Tensorflow](https://www.tensorflow.org), in an early proof-of-concept (poc) stage.\n* `wikinews-importer`: A set of classes to process and annotate text formatted in [MediaWiki markup](https://www.mediawiki.org/wiki/Help:Formatting).\n\n## Getting Started\n\nYou can import the core toolkit directly from Maven, SBT or Gradle after you have build it locally:\n\n#### Maven\n\n```\n\u003cdependency\u003e\n    \u003cgroupId\u003eorg.apache.opennlp\u003c/groupId\u003e\n    \u003cartifactId\u003eopennlp-sandbox\u003c/artifactId\u003e\n    \u003cversion\u003e${opennlp.version}\u003c/version\u003e\n\u003c/dependency\u003e\n```\n\n#### SBT\n\n```\nlibraryDependencies += \"org.apache.opennlp\" % \"opennlp-sandbox\" % \"${opennlp.version}\"\n```\n\n#### Gradle\n\n```\ncompile group: \"org.apache.opennlp\", name: \"opennlp-sandbox\", version: \"${opennlp.version}\"\n```\n\nFor more details please check our [documentation](http://opennlp.apache.org/docs/)\n\n## Building OpenNLP\n\nAt least JDK 21 and Maven 3.3.9 are required to build the sandbox components.\n\nAfter cloning the repository go into the destination directory and run:\n\n```\nmvn install\n```\n\n## Contributing\n\nThe Apache OpenNLP project is developed by volunteers and is always looking for new contributors to work on all parts of the project. \nEvery contribution is welcome and needed to make it better. \nA contribution can be anything from a small documentation typo fix to a new component.\n\nIf you would like to get involved please follow the instructions [here](https://github.com/apache/opennlp/blob/main/.github/CONTRIBUTING.md)\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fapache%2Fopennlp-sandbox","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fapache%2Fopennlp-sandbox","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fapache%2Fopennlp-sandbox/lists"}