Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/vidiptvashist/document-parsing-ocr-computer-vision
https://github.com/vidiptvashist/document-parsing-ocr-computer-vision
Last synced: 3 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/vidiptvashist/document-parsing-ocr-computer-vision
- Owner: vidiptvashist
- License: mit
- Created: 2024-02-09T15:16:30.000Z (11 months ago)
- Default Branch: main
- Last Pushed: 2024-02-12T07:47:30.000Z (11 months ago)
- Last Synced: 2024-11-13T23:30:35.451Z (2 months ago)
- Language: Jupyter Notebook
- Size: 6.84 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Document-Parsing-OCR-Computer-Vision
Current Open-Source solutions for converting documents (PDFs or XDOCs) into machine readable data lack the capacity to extract data while maintaining the original format, limited to just plain text extraction.Multiple columns, tables data and graphs data, which are common in Ford documents, are extracted without format consideration by existing solutions