Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/paulpower1/pdf_scraper_poc

POC of a pdf scraper using syncfusion libraries
https://github.com/paulpower1/pdf_scraper_poc

Last synced: about 1 month ago
JSON representation

POC of a pdf scraper using syncfusion libraries

Awesome Lists containing this project

README

        

# pdf_scraper
POC of a pdf scraper using syncfusion libraries

Author : Paul Power

Date : 19-April-2022

Description: This simple program extracts text and annotations from a pdf and is essentially a POC of using
SyncFusion libraries. https://www.syncfusion.com

Notes :

1. Further tests should be done to ensure changes can be written back to the pdf

2. test that the drawings and graphics can be extracted and written back preserving their integrity
(autocad can import a pdf containing drawings)

3. syncfusion libraries appear to work ok - files need to be uncompressed first before manipulation???