An open API service indexing awesome lists of open source software.

https://github.com/intranda/goobi-iiif-downloader

Downloads images and OCR results using IIIF manifest URLs
https://github.com/intranda/goobi-iiif-downloader

iiif

Last synced: 6 months ago
JSON representation

Downloads images and OCR results using IIIF manifest URLs

Awesome Lists containing this project

README

          

# Goobi IIIF downloader

The Goobi IIIF downloader is a command line utility that allows one to
download images and OCR results using IIIF manifests.

It is also possible to limit the downloaded pages by structures. The structures can be filtered by including or excluding metadata label/value pairs.

## Usage:

The binaries can be downloaded in the github releases section. The CLI can then be called as follows:

```
Usage: java -jar goobi-iiif-downloader.jar [-da] [-ri] -d= -m= [-max=] [-sm=] [-es=]... [-is=]...
-m, --manifest=
the manifest URL to parse and download from
-d, --destination=
the destination folder to download to
-is, --include_structure=
structure to include - example: "Strukturtyp::Abbildung". The option is repeatable.
-es, --exclude_structure=
structure to exclude - example: "Strukturtyp::Abbildung". The option is repeatable.
-sm, --structure_mode=
structure mode. Possible values: "firstpage" and "all"
-max, --maximum_images=
the maximum number of images to download
-ri, --random_images select random images
-da, --download_alto download alto (if present)
```