https://github.com/tboenig/17_frak_complex
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
https://github.com/tboenig/17_frak_complex
ground-truth ocr-d
Last synced: 5 months ago
JSON representation
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
- Host: GitHub
- URL: https://github.com/tboenig/17_frak_complex
- Owner: tboenig
- License: cc-by-sa-4.0
- Created: 2022-09-28T10:34:49.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2023-10-23T14:35:46.000Z (over 2 years ago)
- Last Synced: 2025-04-25T14:14:51.735Z (about 1 year ago)
- Topics: ground-truth, ocr-d
- Homepage: https://tboenig.github.io/17_frak_complex/
- Size: 54 MB
- Stars: 0
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE.md
Awesome Lists containing this project
README
17_frak_complex
Ground Truth with a collection of documents with the following characteristics:
font blackletter, musical notation, hand-written additions, with title page
Metadata
- Language:
- deu, lat
- Format:
- Page-XML
- Time:
- 1600-1700
- GT Type:
- data_structure_and_text
- License:
- CC0 1.0
- Transcription Guidelines:
- OCR-D Ground Truth Guidelines https://ocr-d.de/en/gt-guidelines/trans/
- Project:
- OCR-D
- Project-URL:
- https://ocr-d.de/
Sources
The volume of transcriptions:
TextLine
Page
TxtRegion
GraphRegion
SepRegion
MusicRegion
NoiseRegion
215
8
63
4
11
4
2
List of transcriptions
document
TxtRegion
ImgRegion
LineDrawRegion
GraphRegion
TabRegion
ChartRegion
SepRegion
MathRegion
ChemRegion
MusicRegion
AdRegion
NoiseRegion
UnkownRegion
CustomRegion
TextLine
Page
silesius_seelenlust01_1657
38
1
7
4
137
5
huebner_handbuch_1696
25
3
4
2
78
3
Extent
In this section they can insert additional information, instructions or notes.