https://github.com/mdda/berkeley-entity-server
Simple Java REST API wrapper for the Berkeley Entity Resolution System
https://github.com/mdda/berkeley-entity-server
Last synced: 22 days ago
JSON representation
Simple Java REST API wrapper for the Berkeley Entity Resolution System
- Host: GitHub
- URL: https://github.com/mdda/berkeley-entity-server
- Owner: mdda
- License: gpl-3.0
- Created: 2015-05-24T09:28:02.000Z (about 11 years ago)
- Default Branch: master
- Last Pushed: 2015-06-01T09:52:05.000Z (about 11 years ago)
- Last Synced: 2023-03-11T07:58:05.934Z (over 3 years ago)
- Language: Scala
- Size: 164 KB
- Stars: 1
- Watchers: 3
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# berkeley-entity-server
Simple Java REST API wrapper for the Berkeley Entity Resolution System
## This is simply a placeholder for the moment
The plan is to make it operate in a similar fashion to https://github.com/mdda/corenlp-java-server
### Downloading the Entity Data
From : http://nlp.cs.berkeley.edu/projects/entity.shtml
(which appears to be a very temperamental / *spurty* server):
```
curl -O http://nlp.cs.berkeley.edu/downloads/berkeley-entity-models.tgz
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 886M 100 886M 0 0 564k 0 0:26:46 0:26:46 --:--:-- 5044k
# Upon completion :
ls -l berkeley-entity-models.tgz
-rw-rw-r--. 929073602 May 21 02:01 berkeley-entity-models.tgz
```
### Downloading the Berkeley Entity Resolution System source
```
git clone https://github.com/gregdurrett/berkeley-entity.git
# This downloads 20.63Mb of git objects
```
Unless you need to alter the source code itself, all that this project
requires is the pre-compiled (15.2Mb) jar file contained in the
root of the 'berkeley-entity' repo :
```
ls -l berkeley-entity/*.jar
-rw-rw-r--. 15287983 May 25 13:19 berkeley-entity/berkeley-entity-1.0.jar
```
If you do change the source, or would like to rebuild the jar from scratch,
simply execute ```sbt``` in the 'berkeley-entity' repo root, then, in the ```sbt``` CLI :
```
assembly
# [info] Compiling 108 Scala sources and 15 Java sources to
# ... /target/scala-2.11/classes ...
# ... berkeley-entity/target/scala-2.11/berkeley-entity-assembly-1.jar
```
This takes ~1 minute on a moderately specified laptop.
```
# Original, packaged jar :
ls -l *.jar
-rw-rw-r--. 15287983 May 25 13:19 berkeley-entity-1.0.jar
# New, rebuilt jar :
ls -l target/scala-2.11/berkeley-entity-assembly-1.jar
-rw-rw-r--. 15288485 May 25 13:35 target/scala-2.11/berkeley-entity-assembly-1.jar
```
Differences in scala versions, and downloaded libraries likely account for the differences.
Then, copy ```berkeley-entity-1.0.jar``` into this repo's ```lib``` folder, next
to the placeholder file.
### Useful resources
https://github.com/chonger/ScalaFrontend