Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/vievie31/podofo

A simple pdf search engine with flask
https://github.com/vievie31/podofo

flask not-optimized pdf-search-engine short-project sqlite tf-idf

Last synced: about 2 hours ago
JSON representation

A simple pdf search engine with flask

Awesome Lists containing this project

README

        

# PoDoFo

My simple pdf search engine with flask running actually on my raspberry-pi, so I can access and search in my pdfs databe from everywhere !!

This code flask and sqlite3 for the web server and sql queries, for extracting pdfs informations it use pdfminer...

Developped on raspian jessie lite and OS X 10.9 .

## Video Demo

On youtube :

[![IMAGE ALT TEXT HERE](https://img.youtube.com/vi/iz0-QGdS9Rg/0.jpg)]
(https://www.youtube.com/watch?v=iz0-QGdS9Rg)

## Azure Install in VM

Ubuntu 18.04 Server as VM Image. Open ports 22 and 80

- sudo apt update
- sudo apt install rustc libssl-dev python3-pip python3-setuptools sqlite3
- git clone https://github.com/VieVie31/podofo.git
- cd podofo
- pip3 install -r requirements.txt

Change shebang to python3 in run.py

- cd app/sql
- ./reset-db.sh
- cd ../../
- ./run.py