An open API service indexing awesome lists of open source software.

https://github.com/skitsanos/extract-pdf-tables

PDF Tables extraction with Java and Tabula
https://github.com/skitsanos/extract-pdf-tables

cli cli-app command-line command-line-tool java pdf pdf-extractor pdf-table pdf-table-extract pdf-table-extraction

Last synced: 4 months ago
JSON representation

PDF Tables extraction with Java and Tabula

Awesome Lists containing this project

README

        

# PDF Tables Extractor

Showcasing the use of `tabula` to extract tables from PDF documents

```shell
JSON=$(java -jar "target/extract-pdf-tables-1.0.2-jar-with-dependencies.jar" "{{.FILE}}" | jq '[.[] | select(length > 0)]')

# store JSON in extracts.json file
echo $JSON > out/extracts.json

# display the first table found
echo $JSON | jq '.[0]'
```