Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/26hzhang/clausie
A mavenized ClauseIE project (in Java) of Max Planck Institute.
https://github.com/26hzhang/clausie
clausie information-extraction java-8 stanford-parser
Last synced: about 1 month ago
JSON representation
A mavenized ClauseIE project (in Java) of Max Planck Institute.
- Host: GitHub
- URL: https://github.com/26hzhang/clausie
- Owner: 26hzhang
- License: mit
- Created: 2017-11-22T11:45:34.000Z (about 7 years ago)
- Default Branch: master
- Last Pushed: 2017-12-01T03:27:19.000Z (about 7 years ago)
- Last Synced: 2024-05-15T18:44:35.795Z (8 months ago)
- Topics: clausie, information-extraction, java-8, stanford-parser
- Language: Java
- Homepage:
- Size: 47.5 MB
- Stars: 10
- Watchers: 0
- Forks: 6
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# ClausIE -- Clause-Based Open Information Extraction
It is a mavenized version of [ClauseIE](https://www.mpi-inf.mpg.de/departments/databases-and-information-systems/software/clausie/) project in [Max Planck Institute](https://www.mpi-inf.mpg.de/home/) (configuration and resource files as well as some of the codes structure are modified).ClausIE is an open information extractor, it identifies and extracts relations and their arguments in natural language text. ClausIE first detects "useful" pieces of information expressed in a sentence, and then represents this information in terms of one or more extractions. The representation of these extractions can be flexibly customized to the underlying application (e.g., binary or n-ary propositions).
Here is the online test provided by Max Planck Institute: **[ClausIE online demo!](https://gate.d5.mpi-inf.mpg.de/ClausIEGate/ClausIEGate)**.
ClausIE codes download: [[link]](http://resources.mpi-inf.mpg.de/d5/clausie/clausie-0-0-1.zip).
ClausIE tutorials: [[link]](http://resources.mpi-inf.mpg.de/d5/clausie/tutorial_.html).
## Requirements
- [Java 1.8](http://www.oracle.com/technetwork/java/javase/downloads/jdk8-downloads-2133151.html).
- JOpt Simple (A Java library for parsing command line options, `version>=4.4`), its maven snippet: [[link]](https://mvnrepository.com/artifact/net.sf.jopt-simple/jopt-simple).
- [Stanford Parser](https://stanfordnlp.github.io/CoreNLP/) (It is a program that works out the grammatical structure of sentences, `version==2.0.4`), its maven snippet: [[link]](https://mvnrepository.com/artifact/edu.stanford.nlp/stanford-parser/2.0.4), since ClausIE uses the pre-trained parser model of Stanford Parser, please also add the following model dependency:
```XMLedu.stanford.nlp
stanford-parser
${stanford-parser.version}
models```
## Usage
A pre-built ClausIE `jar` file is located at [`target/ClausIE.jar`](/target/), it includes the `JOpt` and `Stanford Parser` and its model already, which can be add into your project and used directly. (Use IntelliJ to build `jar` file please refer: [[link]](https://stackoverflow.com/questions/12508180/how-do-you-mavenize-a-project-using-intellij))Or, you can clone this source and follow the article [Maven in 5 Minutes](http://maven.apache.org/guides/getting-started/maven-in-five-minutes.html) to make some changes and mavenize it to use in your own project. To add it as a maven dependency:
```XML...
de.mpii.clausie
ClausIE
1.0-SNAPSHOT
...```