Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/thammegowda/tika-geo-ner-model

This maven project puts all required resources at once place for experimenting with Tika's GeoParser
https://github.com/thammegowda/tika-geo-ner-model

Last synced: about 1 month ago
JSON representation

This maven project puts all required resources at once place for experimenting with Tika's GeoParser

Awesome Lists containing this project

README

        

# Tika Geo NER Model

When this jar is added to classpath, it supplies NER model to Tika's GeoParser.

## Test Run
`mvn exec:java -Dexec.args=data/usc.geot`

## Update NER model:

Replace file at `src/main/resources/org/apache/tika/parser/geo/topic/en-ner-location.bin`.

## Register Customm Types:
Update file `src/main/resources/org/apache/tika/mime/custom-mimetypes.xml`

## Additional Setup
Requires `lucene-geo-gazetteer` in `PATH` as described in http://wiki.apache.org/tika/GeoTopicParser.

> In ubuntu, I had to set `PATH` in `/etc/profile` because the java process launched from IDE didn't
inherit `~/.bashrc`