Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/lloydzhou/tika-parser

parse server based on tika
https://github.com/lloydzhou/tika-parser

Last synced: 1 day ago
JSON representation

parse server based on tika

Awesome Lists containing this project

README

        

# tika-parser
parse server based on tika

## usage

1. start server
```
pip install regex tika

python main.py
```

2. http client
```
curl http://127.0.0.1:8888 -F 'file=@/path/to/file'

-->
[
{
"page_content": "xxxx",
"metadata": {
"offset": 0,
"length": 1,
"strip": 1,
"source": "xx filename"
}
}
]
```