Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/thammegowda/tika-geo-ner-model
This maven project puts all required resources at once place for experimenting with Tika's GeoParser
https://github.com/thammegowda/tika-geo-ner-model
Last synced: about 1 month ago
JSON representation
This maven project puts all required resources at once place for experimenting with Tika's GeoParser
- Host: GitHub
- URL: https://github.com/thammegowda/tika-geo-ner-model
- Owner: thammegowda
- License: apache-2.0
- Created: 2015-10-23T06:30:08.000Z (about 9 years ago)
- Default Branch: master
- Last Pushed: 2015-10-23T07:03:38.000Z (about 9 years ago)
- Last Synced: 2024-04-18T02:58:24.313Z (7 months ago)
- Language: Java
- Size: 4.91 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Tika Geo NER Model
When this jar is added to classpath, it supplies NER model to Tika's GeoParser.
## Test Run
`mvn exec:java -Dexec.args=data/usc.geot`## Update NER model:
Replace file at `src/main/resources/org/apache/tika/parser/geo/topic/en-ner-location.bin`.
## Register Customm Types:
Update file `src/main/resources/org/apache/tika/mime/custom-mimetypes.xml`## Additional Setup
Requires `lucene-geo-gazetteer` in `PATH` as described in http://wiki.apache.org/tika/GeoTopicParser.> In ubuntu, I had to set `PATH` in `/etc/profile` because the java process launched from IDE didn't
inherit `~/.bashrc`