https://github.com/tboenig/16_ant_complex
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
https://github.com/tboenig/16_ant_complex
ground-truth ocr-d
Last synced: 4 months ago
JSON representation
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
- Host: GitHub
- URL: https://github.com/tboenig/16_ant_complex
- Owner: tboenig
- License: cc-by-sa-4.0
- Created: 2022-09-27T17:58:24.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2023-10-23T14:19:46.000Z (over 2 years ago)
- Last Synced: 2025-07-19T21:28:45.954Z (11 months ago)
- Topics: ground-truth, ocr-d
- Homepage: https://tboenig.github.io/16_ant_complex/
- Size: 14.2 MB
- Stars: 0
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE.md
Awesome Lists containing this project
README
16_ant_complex
Ground Truth with a collection of documents with the following characteristics:
font antiqua, marginal notes [both printed and hand written]
Metadata
- Language:
- lat, deu
- Format:
- Page-XML
- Time:
- 1500-1600
- GT Type:
- data_structure_and_text
- License:
- CC0 1.0
- Transcription Guidelines:
- OCR-D Ground Truth Guidelines https://ocr-d.de/en/gt-guidelines/trans/
- Project:
- OCR-D
- Project-URL:
- https://ocr-d.de/
Sources
The volume of transcriptions:
TextLine
Page
TxtRegion
NoiseRegion
94
3
22
1
List of transcriptions
document
TxtRegion
ImgRegion
LineDrawRegion
GraphRegion
TabRegion
ChartRegion
SepRegion
MathRegion
ChemRegion
MusicRegion
AdRegion
NoiseRegion
UnkownRegion
CustomRegion
TextLine
Page
alberti_pictura_1540
22
1
94
3
Extent
In this section they can insert additional information, instructions or notes.