Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/PedroBarcha/old-books-dataset

Old book pages (with groundtruth), formerly used for OCR studies. There are several versions of the set (concerning resolution and binarization). Noised and denoised sets (done by several methods) are eventually going to be uploaded.

binarization binarized-dataset books-dataset dataset ground-truth groundtruth ocr-database ocr-dataset old-books old-documents text text-data text-database

Last synced: 21 Apr 2024