An open API service indexing awesome lists of open source software.

https://github.com/tboenig/16_frak_complex

This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
https://github.com/tboenig/16_frak_complex

ground-truth ocr-d

Last synced: 4 months ago
JSON representation

This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.

Awesome Lists containing this project

README

          


16_frak_complex


Ground Truth with a collection of documents with the following characteristics:
font blackletter, two-columned, hand-written additions, stamps, partly tabular-like structures, labelled illustration, initial


Metadata



Language:

deu, lat

Format:

Page-XML

Time:

1500-1600

GT Type:

data_structure_and_text

License:

CC0 1.0

Transcription Guidelines:

OCR-D Ground Truth Guidelines https://ocr-d.de/en/gt-guidelines/trans/

Project:

OCR-D

Project-URL:

https://ocr-d.de/


Sources


The volume of transcriptions:





TextLine
Page
TxtRegion
GraphRegion




279
10
55
28




List of transcriptions






document
TxtRegion
ImgRegion
LineDrawRegion
GraphRegion
TabRegion
ChartRegion
SepRegion
MathRegion
ChemRegion
MusicRegion
AdRegion
NoiseRegion
UnkownRegion
CustomRegion
TextLine
Page




luther_babstum_1526
7


2










51
2


aventinus_grammatica_1515
29


20










129
3


nn_historia_1500
6


1










35
2


petrarca_psalmi_1506
13


5










64
3






Extent



In this section they can insert additional information, instructions or notes.