Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with alto-xml
A curated list of projects in awesome lists tagged with alto-xml .
https://github.com/UglyToad/PdfPig
Read and extract text and other content from PDFs in C# (port of PDFBox)
alto-xml csharp document-analysis hocr layout-analysis netstandard page-xml pdf pdf-document pdf-document-processor pdf-extractor pdf-files pdf-generation pdfbox
Last synced: 31 Jul 2024
https://github.com/uglytoad/pdfpig
Read and extract text and other content from PDFs in C# (port of PDFBox)
alto-xml csharp document-analysis hocr layout-analysis netstandard page-xml pdf pdf-document pdf-document-processor pdf-extractor pdf-files pdf-generation pdfbox
Last synced: 01 Oct 2024
https://github.com/mittagessen/kraken
OCR engine for all the languages
alto-xml handwritten-text-recognition hocr htr layout-analysis neural-networks ocr optical-character-recognition page-xml
Last synced: 30 Jul 2024
https://github.com/BobLd/DocumentLayoutAnalysis
Document Layout Analysis resources repos for development with PdfPig.
alto alto-xml csharp docstrum document-layout-analysis hocr hocr-documents layout-analysis page-segmentation page-xml pdf pdfpig recursive-xy-cut table-extraction tei xy-cut xycut
Last synced: 03 Aug 2024
https://github.com/qurator-spk/dinglehopper
An OCR evaluation tool
alto alto-xml ocr ocr-d ocr-evaluation page page-xml qurator
Last synced: 03 Aug 2024
https://github.com/altoxml/schema
ALTO XML schema - latest and all former versions
alto alto-xml alto-xml-schema ocr optical-character-recognition schema
Last synced: 30 Jul 2024
https://github.com/cneud/alto-tools
Python tools for performing various operations on ALTO XML files
alto-xml digital-library optical-character-recognition
Last synced: 30 Jul 2024
https://github.com/altomator/en-data_mining
Data Mining Historical Newspaper Metadata (METS/ALTO formats)
alto alto-xml basex data-mining digital-humanities digital-libraries digital-library metadata mets-xml ocr perl-script xml
Last synced: 28 Sep 2024
https://github.com/altomator/EN-data_mining
Data Mining Historical Newspaper Metadata (METS/ALTO formats)
alto alto-xml basex data-mining digital-humanities digital-libraries digital-library metadata mets-xml ocr perl-script xml
Last synced: 03 Aug 2024
https://github.com/Living-with-machines/alto2txt
Convert ALTO XML to plain text + minimal metadata
alto-xml digital-humanities historical-newspapers
Last synced: 03 Aug 2024
https://github.com/altomator/ALTO-HTML
Conversion of ALTO files (including tags) to HTML
Last synced: 03 Aug 2024