https://github.com/tboenig/17_fontmix_simple
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
https://github.com/tboenig/17_fontmix_simple
ground-truth ocr-d
Last synced: 5 months ago
JSON representation
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
- Host: GitHub
- URL: https://github.com/tboenig/17_fontmix_simple
- Owner: tboenig
- License: cc-by-sa-4.0
- Created: 2022-09-28T10:41:37.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2023-10-23T14:28:19.000Z (over 2 years ago)
- Last Synced: 2025-04-25T14:14:53.339Z (about 1 year ago)
- Topics: ground-truth, ocr-d
- Homepage: https://tboenig.github.io/17_fontmix_simple/
- Size: 97.4 MB
- Stars: 0
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE.md
Awesome Lists containing this project
README
17_fontmix_simple
Ground Truth with a collection of documents with the following characteristics:
fonts blackletter and antiqua, ancient Greek, Hebrew, initials, with title page, colour chart
Metadata
- Language:
- grc, heb, deu
- Format:
- Page-XML
- Time:
- 1600-1700
- GT Type:
- data_structure_and_text
- License:
- CC0 1.0
- Transcription Guidelines:
- OCR-D Ground Truth Guidelines https://ocr-d.de/en/gt-guidelines/trans/
- Project:
- OCR-D
- Project-URL:
- https://ocr-d.de/
Sources
The volume of transcriptions:
TextLine
Page
TxtRegion
GraphRegion
SepRegion
332
12
81
4
3
List of transcriptions
document
TxtRegion
ImgRegion
LineDrawRegion
GraphRegion
TabRegion
ChartRegion
SepRegion
MathRegion
ChemRegion
MusicRegion
AdRegion
NoiseRegion
UnkownRegion
CustomRegion
TextLine
Page
bohse_helicon_1696
35
3
2
121
5
weigel_gnothi02_1618
24
1
130
4
rollenhagen_reysen_1603
22
1
81
3
Extent
In this section they can insert additional information, instructions or notes.