https://github.com/intranda/goobi-iiif-downloader
Downloads images and OCR results using IIIF manifest URLs
https://github.com/intranda/goobi-iiif-downloader
iiif
Last synced: 6 months ago
JSON representation
Downloads images and OCR results using IIIF manifest URLs
- Host: GitHub
- URL: https://github.com/intranda/goobi-iiif-downloader
- Owner: intranda
- License: gpl-2.0
- Created: 2019-06-24T07:18:06.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2021-08-15T11:00:14.000Z (about 4 years ago)
- Last Synced: 2025-02-13T18:49:23.807Z (8 months ago)
- Topics: iiif
- Language: Java
- Size: 69.3 KB
- Stars: 3
- Watchers: 3
- Forks: 0
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Goobi IIIF downloader
The Goobi IIIF downloader is a command line utility that allows one to
download images and OCR results using IIIF manifests.It is also possible to limit the downloaded pages by structures. The structures can be filtered by including or excluding metadata label/value pairs.
## Usage:
The binaries can be downloaded in the github releases section. The CLI can then be called as follows:
```
Usage: java -jar goobi-iiif-downloader.jar [-da] [-ri] -d= -m= [-max=] [-sm=] [-es=]... [-is=]...
-m, --manifest=
the manifest URL to parse and download from
-d, --destination=
the destination folder to download to
-is, --include_structure=
structure to include - example: "Strukturtyp::Abbildung". The option is repeatable.
-es, --exclude_structure=
structure to exclude - example: "Strukturtyp::Abbildung". The option is repeatable.
-sm, --structure_mode=
structure mode. Possible values: "firstpage" and "all"
-max, --maximum_images=
the maximum number of images to download
-ri, --random_images select random images
-da, --download_alto download alto (if present)
```