An open API service indexing awesome lists of open source software.

https://github.com/adibaba/barrel

Elasticsearch PDF indexing
https://github.com/adibaba/barrel

Last synced: 2 months ago
JSON representation

Elasticsearch PDF indexing

Awesome Lists containing this project

README

        

# Run Elasticsearch with Docker

- See https://www.elastic.co/guide/en/elasticsearch/reference/7.5/docker.html
- `sudo docker pull docker.elastic.co/elasticsearch/elasticsearch:7.5.2`
- `sudo docker run -p 9200:9200 -p 9300:9300 -e "discovery.type=single-node" docker.elastic.co/elasticsearch/elasticsearch:7.5.2`
- List indexes (with column headings): http://localhost:9200/_cat/indices?v=true
# Run Barrel

Configure Barrel via configuration.txt.

```
Configuration file:
configuration.txt
Indexes:
[barrel]
Usage:
index Indexes unknown PDF files
index all Indexes all PDF files
list Lists PDF files
search Searches for query
search Searches for query in filenames containing filter
```

You can also run it in Java using MainManual.java.