Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/vievie31/podofo
A simple pdf search engine with flask
https://github.com/vievie31/podofo
flask not-optimized pdf-search-engine short-project sqlite tf-idf
Last synced: about 2 hours ago
JSON representation
A simple pdf search engine with flask
- Host: GitHub
- URL: https://github.com/vievie31/podofo
- Owner: VieVie31
- License: mit
- Created: 2016-05-09T20:13:48.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2024-03-20T10:43:03.000Z (8 months ago)
- Last Synced: 2024-03-20T11:54:02.931Z (8 months ago)
- Topics: flask, not-optimized, pdf-search-engine, short-project, sqlite, tf-idf
- Language: Python
- Size: 354 KB
- Stars: 23
- Watchers: 3
- Forks: 18
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# PoDoFo
My simple pdf search engine with flask running actually on my raspberry-pi, so I can access and search in my pdfs databe from everywhere !!
This code flask and sqlite3 for the web server and sql queries, for extracting pdfs informations it use pdfminer...
Developped on raspian jessie lite and OS X 10.9 .
## Video Demo
On youtube :
[![IMAGE ALT TEXT HERE](https://img.youtube.com/vi/iz0-QGdS9Rg/0.jpg)]
(https://www.youtube.com/watch?v=iz0-QGdS9Rg)## Azure Install in VM
Ubuntu 18.04 Server as VM Image. Open ports 22 and 80
- sudo apt update
- sudo apt install rustc libssl-dev python3-pip python3-setuptools sqlite3
- git clone https://github.com/VieVie31/podofo.git
- cd podofo
- pip3 install -r requirements.txtChange shebang to python3 in run.py
- cd app/sql
- ./reset-db.sh
- cd ../../
- ./run.py