https://github.com/adibaba/barrel
Elasticsearch PDF indexing
https://github.com/adibaba/barrel
Last synced: 2 months ago
JSON representation
Elasticsearch PDF indexing
- Host: GitHub
- URL: https://github.com/adibaba/barrel
- Owner: adibaba
- Created: 2019-12-09T23:48:52.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2021-12-14T21:37:37.000Z (over 3 years ago)
- Last Synced: 2023-08-12T18:38:30.618Z (almost 2 years ago)
- Language: Java
- Size: 23.4 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Run Elasticsearch with Docker
- See https://www.elastic.co/guide/en/elasticsearch/reference/7.5/docker.html
- `sudo docker pull docker.elastic.co/elasticsearch/elasticsearch:7.5.2`
- `sudo docker run -p 9200:9200 -p 9300:9300 -e "discovery.type=single-node" docker.elastic.co/elasticsearch/elasticsearch:7.5.2`
- List indexes (with column headings): http://localhost:9200/_cat/indices?v=true
# Run BarrelConfigure Barrel via configuration.txt.
```
Configuration file:
configuration.txt
Indexes:
[barrel]
Usage:
index Indexes unknown PDF files
index all Indexes all PDF files
list Lists PDF files
search Searches for query
search Searches for query in filenames containing filter
```You can also run it in Java using MainManual.java.