An open API service indexing awesome lists of open source software.

https://github.com/tboenig/17_fontmix_simple

This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
https://github.com/tboenig/17_fontmix_simple

ground-truth ocr-d

Last synced: 5 months ago
JSON representation

This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.

Awesome Lists containing this project

README

          


17_fontmix_simple


Ground Truth with a collection of documents with the following characteristics:
fonts blackletter and antiqua, ancient Greek, Hebrew, initials, with title page, colour chart


Metadata



Language:

grc, heb, deu

Format:

Page-XML

Time:

1600-1700

GT Type:

data_structure_and_text

License:

CC0 1.0

Transcription Guidelines:

OCR-D Ground Truth Guidelines https://ocr-d.de/en/gt-guidelines/trans/

Project:

OCR-D

Project-URL:

https://ocr-d.de/


Sources


The volume of transcriptions:





TextLine
Page
TxtRegion
GraphRegion
SepRegion




332
12
81
4
3




List of transcriptions






document
TxtRegion
ImgRegion
LineDrawRegion
GraphRegion
TabRegion
ChartRegion
SepRegion
MathRegion
ChemRegion
MusicRegion
AdRegion
NoiseRegion
UnkownRegion
CustomRegion
TextLine
Page




bohse_helicon_1696
35


3


2







121
5


weigel_gnothi02_1618
24


1










130
4


rollenhagen_reysen_1603
22





1







81
3






Extent



In this section they can insert additional information, instructions or notes.