Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ckorzen/pdf-text-extraction-benchmark
A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF documents, especially from scientific articles.
https://github.com/ckorzen/pdf-text-extraction-benchmark
arxiv benchmark evaluation extraction pdf tex text-extraction
Last synced: 2 months ago
JSON representation
A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF documents, especially from scientific articles.
- Host: GitHub
- URL: https://github.com/ckorzen/pdf-text-extraction-benchmark
- Owner: ckorzen
- License: mit
- Created: 2015-12-15T09:48:20.000Z (about 9 years ago)
- Default Branch: master
- Last Pushed: 2020-11-07T16:13:35.000Z (about 4 years ago)
- Last Synced: 2024-08-03T17:08:13.531Z (6 months ago)
- Topics: arxiv, benchmark, evaluation, extraction, pdf, tex, text-extraction
- Language: TeX
- Homepage:
- Size: 505 MB
- Stars: 63
- Watchers: 6
- Forks: 11
- Open Issues: 2
Awesome Lists containing this project
- awesome-document-understanding - pdf-text-extraction-benchmark - text-extraction-benchmark.svg?style=social) - PDF tools benchmark (Resources)