An open API service indexing awesome lists of open source software.

https://github.com/jorgecj/extract-text-from-pdf

Project made to extract text from PDF files.
https://github.com/jorgecj/extract-text-from-pdf

javascript js node-js nodejs

Last synced: 3 months ago
JSON representation

Project made to extract text from PDF files.

Awesome Lists containing this project

README

        

# Extract-text-from-PDF
![Thumb do projeto](https://github.com/JorgeCJ/Extract-text-from-PDF/assets/127647774/8b345859-37da-4554-a894-c7ae954f569d)

`Project made to extract text from PDF files. First, type in the console, inside the PDF converter folder, "npm init", to create a new NPM package. After that, type "npm install pdfreader", to install our module responsible for extracting text from the PDF. Inside the "package.json", don't forget to type "type":"module", :`

![type module](https://github.com/JorgeCJ/Extract-text-from-PDF/assets/127647774/8b1c5d9a-0de3-443a-9f51-4c61b2eaa2db)

`Place your PDF file inside the "pdfToText" folder. Don't forget to rename it to "sample.pdf":`

![sample do pdf](https://github.com/JorgeCJ/Extract-text-from-PDF/assets/127647774/0d8bee87-5fd2-4698-891b-bea3e2479714)

`And finally, just type the following code in your terminal: "node pdfToText/pdfReader.js pdfToText/sample.pdf".`

## Technologies used:
- [Javascript](https://developer.mozilla.org/en-US/docs/Web/JavaScript)
- [NodeJs](https://nodejs.org/en/docs)

If you want to know more about how to extract text from PDF, click [HERE](https://www.npmjs.com/package/pdfreader).