https://github.com/slub/entityfactssheetsharvester
a commandline command (Python3 program) that retrieves EntityFacts sheets from a given CSV with GND identifiers and returns them as line-delimited JSON records
https://github.com/slub/entityfactssheetsharvester
command-line-tool dnb entityfacts entityfacts-sheets gnd json line-delimited-json python
Last synced: 2 months ago
JSON representation
a commandline command (Python3 program) that retrieves EntityFacts sheets from a given CSV with GND identifiers and returns them as line-delimited JSON records
- Host: GitHub
- URL: https://github.com/slub/entityfactssheetsharvester
- Owner: slub
- License: apache-2.0
- Created: 2019-08-09T13:01:12.000Z (almost 6 years ago)
- Default Branch: master
- Last Pushed: 2019-08-15T11:56:47.000Z (almost 6 years ago)
- Last Synced: 2025-01-25T07:09:04.563Z (4 months ago)
- Topics: command-line-tool, dnb, entityfacts, entityfacts-sheets, gnd, json, line-delimited-json, python
- Language: Python
- Homepage:
- Size: 12.7 KB
- Stars: 2
- Watchers: 8
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE.md
Awesome Lists containing this project
README
# entityfactssheetsharvester - EntityFacts sheets harvester
entityfactssheetsharvester is a commandline command (Python3 program) that retrieves [EntityFacts](https://www.dnb.de/EN/Professionell/Metadatendienste/Datenbezug/Entity-Facts/entity-facts_node.html) sheets* from a given CSV with GND identifiers and returns them as line-delimited JSON records
*) EntityFacts are "fact sheets" on entities of the Integrated Authority File ([GND](https://www.dnb.de/EN/Professionell/Standardisierung/GND/gnd_node.html)), which is provided by German National Library ([DNB](https://www.dnb.de/EN/Home/home_node.html))
## Usage
It eats CSV with GND identifiers (i.e. GND identifier per line) from *stdin*.
It puts the EntityFacts sheets one by one as line-delimited JSON record to *stdout*.
```
entityfactssheetsharvesteroptional arguments:
-h, --help show this help message and exit
```* example:
```
entityfactssheetsharvester < [INPUT CSV FILE WITH GND IDENTIFIERS] > [PATH TO THE OUTPUT LINE-DELIMITED JSON RECORDS FILE]
```
## Run* clone this git repo or just download the [entityfactssheetsharvester.py](entityfactssheetsharvester/entityfactssheetsharvester.py) file
* run ./entityfactssheetsharvester.py
* for a hackish way to use entityfactssheetsharvester system-wide, copy to /usr/local/bin### Install system-wide via pip
```
sudo -H pip3 install --upgrade [ABSOLUTE PATH TO YOUR LOCAL GIT REPOSITORY OF ENTITYFACTSSHEETSHARVESTER]
```
(which provides you ```entityfactssheetsharvester``` as a system-wide commandline command)## See Also
* [entityfactspicturesharvester](https://github.com/slub/entityfactspicturesharvester) - a commandline command (Python3 program) that reads depiction information (images URLs) from given EntityFacts sheets (as line-delimited JSON records) and retrieves and stores the pictures and thumbnails contained in this information
* [entityfactspicturesmetadataharvester](https://github.com/slub/entityfactspicturesmetadataharvester) - a commandline command (Python3 program) that reads depiction information (images URLs) from given EntityFacts sheets (as line-delimited JSON records) and retrieves the (Wikimedia Commons file) metadata of these pictures (as line-delimited JSON records)