https://github.com/marcusmyers/docued
https://github.com/marcusmyers/docued
Last synced: 3 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/marcusmyers/docued
- Owner: marcusmyers
- Created: 2014-02-13T04:08:52.000Z (over 11 years ago)
- Default Branch: master
- Last Pushed: 2014-02-13T04:16:12.000Z (over 11 years ago)
- Last Synced: 2025-01-15T14:00:03.831Z (5 months ago)
- Size: 97.7 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# DocuEd
Is a start to a Document Management software to digitize records for retention.
Below is the initial README and is not totally outdated. Need to clean this process up
Install Ubuntu Server (Select LAMP, OpenSSH Server on install)
run install.sh
To split a file with pdftk
pdftk in.pdf burst
OPTIONALLY
pdftk in.pdf burst output newname_%02d.pdfUse Ghostscript to convert PDF to TIFF
gs -dNOPAUSE -sDEVICE=tiffg4 -r600x600 -dBATCH -sPAPERSIZE=letter -sOutputFile=Output_File_Name.tif Name_of_PDF.pdf
Use Tesseract to ocr TIFF
tesseract in-file.tif outfile -l eng