Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ub-mannheim/weisthuemer
Ground truth for Jakob Grimm / Weisthümer
https://github.com/ub-mannheim/weisthuemer
antiqua ground-truth ocr
Last synced: 24 days ago
JSON representation
Ground truth for Jakob Grimm / Weisthümer
- Host: GitHub
- URL: https://github.com/ub-mannheim/weisthuemer
- Owner: UB-Mannheim
- License: cc0-1.0
- Created: 2020-03-16T12:16:41.000Z (almost 5 years ago)
- Default Branch: master
- Last Pushed: 2023-12-21T05:49:26.000Z (about 1 year ago)
- Last Synced: 2023-12-21T08:37:08.573Z (about 1 year ago)
- Topics: antiqua, ground-truth, ocr
- Homepage:
- Size: 204 KB
- Stars: 6
- Watchers: 6
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Weisthümer
This repository contains transcriptions of 35 pages from Jacob Grimm's seven-volume work "Weisthümer", which can be used for training or validation of OCR models.#### Typeface class:
Antiqua#### Languages:
different variants of Middle High German, Latin#### Special characters:
Roman numerals, exponents, section break (§), long s (ſ), circumflex (â), caron (ǎ), acute accent (á), ring diacritic (å), diacritic umlauts (aͤ), cursive Greek letters Theta (ϑ), Beta (β), Pi (Π).#### Sources:
The transcriptions refer to digitised material available on archive.org:
Volume 1: https://archive.org/details/bub_gb_2J0ZKYG7on8C
Volume 2: https://archive.org/details/bub_gb_LFpLZSYYg34C
Volume 3: https://archive.org/details/bub_gb_o6S3yrj9TkwC
Volume 4: https://archive.org/details/bub_gb_eAqsmQrcWcQC
Volume 5: https://archive.org/details/bub_gb_MMcFAAAAQAAJ
Volume 6: https://archive.org/details/weisthmer02drongoog
Volume 7: https://archive.org/details/weisthmer09maurgoog#### Further details on the transcription and training workflow can be found in the Wiki