https://github.com/statcan/slicemypdf

This project uses SLICE algorithm to extract information from a text-based PDF page containing financial statements (tabular data). It can also be used to extract regular tables but will contain all text on a page.
https://github.com/statcan/slicemypdf

Last synced: 12 months ago
JSON representation

Host: GitHub
URL: https://github.com/statcan/slicemypdf
Owner: StatCan
License: other
Created: 2021-08-11T13:52:06.000Z (almost 5 years ago)
Default Branch: main
Last Pushed: 2021-08-11T14:11:42.000Z (almost 5 years ago)
Last Synced: 2023-03-02T22:23:16.370Z (over 3 years ago)
Language: Jupyter Notebook
Homepage:
Size: 714 KB
Stars: 21
Watchers: 4
Forks: 7
Open Issues: 3

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/statcan/slicemypdf

Awesome Lists containing this project