https://github.com/doc-analysis/DocBank
DocBank: A Benchmark Dataset for Document Layout Analysis
https://github.com/doc-analysis/DocBank
Last synced: about 1 year ago
JSON representation
DocBank: A Benchmark Dataset for Document Layout Analysis
- Host: GitHub
- URL: https://github.com/doc-analysis/DocBank
- Owner: doc-analysis
- License: apache-2.0
- Created: 2020-05-29T08:33:31.000Z (about 6 years ago)
- Default Branch: master
- Last Pushed: 2024-08-12T04:18:10.000Z (almost 2 years ago)
- Last Synced: 2024-08-12T05:29:31.364Z (almost 2 years ago)
- Language: Python
- Homepage:
- Size: 328 MB
- Stars: 548
- Watchers: 19
- Forks: 71
- Open Issues: 27
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-pdf - DocBank Dataset - A large-scale dataset built with weak supervision, enabling models to integrate textual and layout information for downstream tasks. (Datasets)