Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/bradsec/pdftext
PDFText is a web app developed with JavaScript, HTML, and CSS to convert standard PDF documents to text.
https://github.com/bradsec/pdftext
pdf pdf-document pdf-text pdf-viewer
Last synced: about 2 months ago
JSON representation
PDFText is a web app developed with JavaScript, HTML, and CSS to convert standard PDF documents to text.
- Host: GitHub
- URL: https://github.com/bradsec/pdftext
- Owner: bradsec
- License: mit
- Created: 2023-12-09T04:21:03.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2023-12-10T21:08:33.000Z (about 1 year ago)
- Last Synced: 2023-12-11T10:31:45.629Z (about 1 year ago)
- Topics: pdf, pdf-document, pdf-text, pdf-viewer
- Language: CSS
- Homepage: https://pdftext.me/
- Size: 126 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# PDFText
PDFText is a web application which provides a client-side solution to extract text from PDF files. It uses [PDF.js](https://mozilla.github.io/pdf.js/) for rendering PDF documents and offers functionalities like viewing the extracted text, downloading it as a text file, and copying it to the clipboard.
### Try it out at [pdftext.me](https://pdftext.me)
## Features
- **PDF Upload**: Users can upload a PDF file to extract text.
- **Text Display**: Extracted text is displayed in the application interface.
- **Download Extracted Text**: Users can download the extracted text as a `.txt` file.
- **Copy to Clipboard**: The extracted text can be copied directly to the clipboard.
- **Alert Notifications**: Provides feedback on successful operations or errors.## Known Limitations
- Will work on standard printed text based PDF documents. It does not incorporate OCR for handwritting recognition.
## Credits
- **PDF.js** by Mozilla | [Apache 2.0 License](https://github.com/mozilla/pdf.js/blob/master/LICENSE)