Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ananduremanan/file_search
File Search Using Node.js, Express.js and React.js
https://github.com/ananduremanan/file_search
expressjs nodejs reactjs tailwindcss tesseract-ocr xlxs
Last synced: about 2 hours ago
JSON representation
File Search Using Node.js, Express.js and React.js
- Host: GitHub
- URL: https://github.com/ananduremanan/file_search
- Owner: ananduremanan
- Created: 2023-11-20T05:34:26.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2023-11-29T04:46:20.000Z (about 1 year ago)
- Last Synced: 2024-11-09T06:06:53.674Z (about 2 months ago)
- Topics: expressjs, nodejs, reactjs, tailwindcss, tesseract-ocr, xlxs
- Language: TypeScript
- Homepage:
- Size: 23.3 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# File Search Using Node.js, Express.js and React.js
### Introduction
A search system designed for locating files through keyword searches, where the search functionality scans the contents of various file types such as PDFs, Excel sheets, Word documents, images, etc. The results are then indexed and displayed in the front end. Powered by OCR even scanned pdf can also be indexed for searching.
### How to Run
##### Warning !! This Project uses the npm package [pdf-poppler](https://www.npmjs.com/package/pdf-poppler) which only supports in Windows and Mac.
Clone this repository by running
```bash
git clone [email protected]:ananduremanan/File-Search.git
```Run the project by
```bash
npm start
```### Time Took for processing
| Task | Duration | Pages | Word Count | Notes |
| ------------------------------- | ------------------ | ----- | ---------- | ----------- |
| Convert 1 Page PDF (14 Words) | 1.70 s | 1 | 14 | |
| Convert 4 Page PDF (1880 Words) | 19.05 s | 4 | 1880 | |
| Convert 4 Page PDF (Image) | 13.10 s | 4 | - | Scanned PDF |
| Convert 618 Page PDF (Image) | 15.01.44 (Minutes) | 618 | - | Scanned PDF |