An open API service indexing awesome lists of open source software.

https://github.com/scrapy-plugins/scrapy-dotpersistence

A scrapy extension to sync `.scrapy` folder to an S3 bucket
https://github.com/scrapy-plugins/scrapy-dotpersistence

Last synced: about 1 year ago
JSON representation

A scrapy extension to sync `.scrapy` folder to an S3 bucket

Awesome Lists containing this project

README

          

=====================
scrapy-dotpersistence
=====================

Scrapy extension to sync `.scrapy` folder to an S3 bucket.

Installation
============

You can install scrapy-dotpersistence using pip::

pip install scrapy-dotpersistence

You can then enable the extension in your `settings.py`::

EXTENSIONS = {
...
'scrapy_dotpersistence.DotScrapyPersistence': 0
}

How to use it
=============

Enable extension through `settings.py`::

DOTSCRAPY_ENABLED = True

Configure the exension through `settings.py`::

ADDONS_AWS_ACCESS_KEY_ID = "ABC"
ADDONS_AWS_SECRET_ACCESS_KEY = "DEF"
ADDONS_AWS_USERNAME = "username"
ADDONS_S3_BUCKET = "test-bucket-name"

You can change a dotpersistence folder path with environ::

export DOTSCRAPY_DIR='/tmp/.scrapy'