Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/paulpower1/pdf_scraper_poc
POC of a pdf scraper using syncfusion libraries
https://github.com/paulpower1/pdf_scraper_poc
Last synced: about 1 month ago
JSON representation
POC of a pdf scraper using syncfusion libraries
- Host: GitHub
- URL: https://github.com/paulpower1/pdf_scraper_poc
- Owner: PaulPower1
- Created: 2022-04-19T11:22:24.000Z (almost 3 years ago)
- Default Branch: main
- Last Pushed: 2022-04-19T11:38:51.000Z (almost 3 years ago)
- Last Synced: 2024-11-09T07:46:30.344Z (3 months ago)
- Language: C#
- Size: 10.7 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# pdf_scraper
POC of a pdf scraper using syncfusion librariesAuthor : Paul Power
Date : 19-April-2022
Description: This simple program extracts text and annotations from a pdf and is essentially a POC of using
SyncFusion libraries. https://www.syncfusion.comNotes :
1. Further tests should be done to ensure changes can be written back to the pdf
2. test that the drawings and graphics can be extracted and written back preserving their integrity
(autocad can import a pdf containing drawings)3. syncfusion libraries appear to work ok - files need to be uncompressed first before manipulation???