Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

awesome-historical-newspaper-analysis

Awesome historical newspaper analysis tools and literature
https://github.com/Duke-Chronicle-Project/awesome-historical-newspaper-analysis

Last synced: 2 days ago
JSON representation

  • Data standards

    • hOCR

      • hocr-tools
      • hocrjs - Visualization of hOCR files.
      • PAGEviewer - Visualization of page layout and OCR segmentation for PAGE XML, ALTO XML, FineReader XML and hOCR.
  • Optical character recognition

    • hOCR

      • Tesseract - Open source C++ api and command line tool. Provides basic layout analysis.
      • Ocrad - The GNU OCR.
  • Document layout analysis, text enrichment and semantic segmentation

  • Text analysis

  • Quality evaluation

    • hOCR

      • Aletheia - Ground truth annotation tool.