https://github.com/arne-cl/balie-ner-cli
Balie (Named Entity Recognition) commandline interface
https://github.com/arne-cl/balie-ner-cli
Last synced: 4 months ago
JSON representation
Balie (Named Entity Recognition) commandline interface
- Host: GitHub
- URL: https://github.com/arne-cl/balie-ner-cli
- Owner: arne-cl
- Created: 2012-01-08T22:11:21.000Z (over 13 years ago)
- Default Branch: master
- Last Pushed: 2013-12-30T11:01:15.000Z (over 11 years ago)
- Last Synced: 2025-01-06T09:12:24.719Z (6 months ago)
- Language: Python
- Homepage:
- Size: 109 KB
- Stars: 1
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
Balie Commandline Interface
===========================`balie-ner-cli` provides a commandline interface to the named-entity
recognition features of [Balie](http://balie.sourceforge.net/
"baseline information extraction") — a Java library that can be used
to perform various NLP tasks.INSTALLATION
============Download and install Balie if you haven't done so. Switch back to the
`balie-ner-cli` directory. Edit the `balie_dir` variable in `config.yml` to
point to your Balie installation directory.USAGE
=====`python balie-cli.py -i input.txt` will read text from *input.txt* and print a
list of named entities to STDOUT (one NE per line).`python balie-cli.py -i input.txt -o output.txt` will read text from
*input.txt* and write a list of named entities to *output.txt* (one NE per
line).Example
-------$ cat input.txt
Barack Obama, Hillary Clinton and George Bush met in a bar in Wisconsin. They
were discussing issues regarding the Netherlands, the Queen of England and
Boy George.$ python ./balie-cli.py -i input.txt
('Hillary Clinton', 'PERSON')
('George Bush', 'PERSON')
('Wisconsin', 'LOCATION')
('Queen', 'PERSON')
('England', 'LOCATION')
('Boy George', 'PERSON')Note that *Barack Obama* wasn't recognized, as the training data that Balie
comes with is already a few years old (ca. 2007).LICENCE
=======GPL 2 or later.
CONTACT
=======Arne Neumann