Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/xenit-eu/solr-backup
Repository which creates an artifact allowing to backup solr on S3 / Caringo Swarm
https://github.com/xenit-eu/solr-backup
Last synced: 6 days ago
JSON representation
Repository which creates an artifact allowing to backup solr on S3 / Caringo Swarm
- Host: GitHub
- URL: https://github.com/xenit-eu/solr-backup
- Owner: xenit-eu
- License: apache-2.0
- Created: 2021-07-29T11:23:34.000Z (over 3 years ago)
- Default Branch: master
- Last Pushed: 2024-04-15T08:44:18.000Z (9 months ago)
- Last Synced: 2024-04-16T13:05:15.152Z (9 months ago)
- Language: Java
- Size: 157 KB
- Stars: 0
- Watchers: 4
- Forks: 0
- Open Issues: 11
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE
Awesome Lists containing this project
README
# Solr backup
This module builds an artifact which can be used to backup and restore solr indexes into S3 and (some of the)
S3-compatible storage endpoints.The original code for the S3 backup was taken
from: https://github.com/athrog/solr/tree/solr-15089/solr/contrib/s3-repository.
It has been packported to solr6, since this is the version used by Alfresco currently. It cannot be backported further
to solr4, because the only backup repository-storage supported back then was the filesystem.
Backported version is available in src/main/java/org/apache/solr/s3.Since that version does not work fully even with Amazon S3, an adapted version has been implemented in
src/main/java/eu/xenit/solr/backup/s3.This new version has been tested against S3-compatible backends:
* localstack s3
Integration tests follow the same line:
* wait for solr to be populated with data, tracking alfresco
* trigger a backup /solr/alfresco/replication?command=backup&repository=s3&location=s3:///&numberToKeep=3
* check if the backup finished in a certain timeout (3 minutes) by following the output of
/solr/alfresco/replication?command=details
* trigger a restore /solr/alfresco/replication?command=restore&repository=s3&location=s3:///
* check if the restore was successful in a certain timeout (3 minutes) by following the output of
/solr/alfresco/replication?command=restorestatus
## Setupyou need to put the solr.xml file under /opt/alfresco-search-services/solrhome/
```${adminHandler:org.alfresco.solr.AlfrescoCoreAdminHandler}
${S3_BUCKET_NAME:}
${S3_ENDPOINT:http://s3.eu-central-1.amazonaws.com}
${S3_REGION:eu-central-1}
${S3_ACCESS_KEY:}
${S3_SECRET_KEY:}
${S3_PROXY_HOST:}
${S3_PROXY_PORT:0}
${S3_PATH_STYLE_ACCESS_ENABLED:false}
```
and specify the Environment or Java variables
## Variables
all of these variable can be set as environment variable or as a system property so that it is substituted in solr.xml
| Environment variable | Java system property | Default | required |
|------------------------------|--------------------------------|--------------------------------------|----------|
| S3_ENDPOINT | -DS3_ENDPOINT | http://s3.eu-central-1.amazonaws.com | false |
| S3_BUCKET_NAME | -DS3_BUCKET_NAME | | true |
| S3_REGION | -DS3_REGION | eu-central-1 | false |
| S3_ACCESS_KEY | -DS3_ACCESS_KEY | | false |
| S3_SECRET_KEY | -DS3_SECRET_KEY | | false |
| S3_PROXY_HOST | -DS3_PROXY_HOST | | false |
| S3_PROXY_PORT | -DS3_PROXY_PORT | | false |
| S3_PATH_STYLE_ACCESS_ENABLED | -DS3_PATH_STYLE_ACCESS_ENABLED | false | false |## Testing against DataCore Swarm docker
./gradlew integration-tests:solr6:integrationTestSwarmDocker
Solr waits for the Swarm environment to be started and then requests an ACCESS_KEY via an init script, which is further
used to authenticate.## Testing LocalStack S3 or any other s3 bucket
Before running the tests, following variables need to be set to an S3 bucket and working credentials.
export S3_ENDPOINT=...
export S3_REGION=...
export S3_BUCKET_NAME=...
export S3_ACCESS_KEY=...
export S3_SECRET_KEY=...
./gradlew integration-tests:solr6:integrationTest