Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/shelfio/apache-tika-lambda-layer
AWS Lambda layer containing latest version of Apache Tika
https://github.com/shelfio/apache-tika-lambda-layer
apache-tika aws-lambda lambda-layer text-extraction
Last synced: 10 days ago
JSON representation
AWS Lambda layer containing latest version of Apache Tika
- Host: GitHub
- URL: https://github.com/shelfio/apache-tika-lambda-layer
- Owner: shelfio
- License: mit
- Created: 2019-12-16T11:35:24.000Z (almost 5 years ago)
- Default Branch: master
- Last Pushed: 2024-02-27T14:18:39.000Z (9 months ago)
- Last Synced: 2024-08-02T06:13:35.059Z (3 months ago)
- Topics: apache-tika, aws-lambda, lambda-layer, text-extraction
- Language: Shell
- Size: 327 MB
- Stars: 13
- Watchers: 14
- Forks: 5
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: license
Awesome Lists containing this project
- awesome-layers - `shelfio/apache-tika-lambda-layer` - tika](https://api.globadge.com/v1/badgen/aws/lambda/layer/latest-version/us-east-1/764866452798/apache-tika) | (Layers / Utilities)
README
# Apache Tika for AWS Lambda as a layer
> AWS Lambda layer containing the latest [Apache Tika Server](https://tika.apache.org/)
## Getting Started
Click on Layers and choose "Add a layer", and "Provide a layer version
ARN" and enter the following ARN.```
arn:aws:lambda:us-east-1:764866452798:layer:apache-tika:7
```Current version: [2.3.0](https://tika.apache.org/2.3.0/index.html)
## How can I use it with the Lambda runtime which doesn't have Java installed?
The latest Apache Tika version requires Java 8 to be able to work.
If your runtime doesn't have Java installed, you can use [Java 8 Lambda layer](https://github.com/shelfio/java-lambda-layer), which actually was designed to make Apache Tika work on Node.js 12.x runtime.
## Where can I find Apache Tika `.jar` file inside of Lambda when I attached the layer?
You can find it at `/opt/tika-server.jar` (`/opt` is where Lambda unpacks layers).
## Available regions
* ap-northeast-1: `arn:aws:lambda:ap-northeast-1:764866452798:layer:apache-tika:3`
* ap-northeast-2: `arn:aws:lambda:ap-northeast-2:764866452798:layer:apache-tika:3`
* ap-south-1: `arn:aws:lambda:ap-south-1:764866452798:layer:apache-tika:3`
* ap-southeast-1: `arn:aws:lambda:ap-southeast-1:764866452798:layer:apache-tika:3`
* ap-southeast-2: `arn:aws:lambda:ap-southeast-2:764866452798:layer:apache-tika:3`
* ca-central-1: `arn:aws:lambda:ca-central-1:764866452798:layer:apache-tika:3`
* eu-north-1: `arn:aws:lambda:eu-north-1:764866452798:layer:apache-tika:3`
* eu-central-1: `arn:aws:lambda:eu-central-1:764866452798:layer:apache-tika:3`
* eu-west-1: `arn:aws:lambda:eu-west-1:764866452798:layer:apache-tika:3`
* eu-west-2: `arn:aws:lambda:eu-west-2:764866452798:layer:apache-tika:3`
* eu-west-3: `arn:aws:lambda:eu-west-3:764866452798:layer:apache-tika:3`
* sa-east-1: `arn:aws:lambda:sa-east-1:764866452798:layer:apache-tika:3`
* us-east-1: `arn:aws:lambda:us-east-1:764866452798:layer:apache-tika:7`
* us-east-2: `arn:aws:lambda:us-east-2:764866452798:layer:apache-tika:3`
* us-west-1: `arn:aws:lambda:us-west-1:764866452798:layer:apache-tika:3`
* us-west-2: `arn:aws:lambda:us-west-2:764866452798:layer:apache-tika:3`## Update
1. Go to https://tika.apache.org/download.html, download Apache Tika server `runnable jar`
2. Rename the jar file from `tika-server-x.xx.jar` to `tika-server.jar`
3. Create zip archive of Apache Tika server with the filename `tika-server.zip`
4. Put zip archive into this repo
5. Put proper version inside of `publish.sh` & `README.md`
6. Increment layer versions in `README.md`
7. Commit & Create Pull Request## License
MIT © [Shelf](https://shelf.io)