Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/Hound-fm/podcatcher

Audio media crawler for lbry.
https://github.com/Hound-fm/podcatcher

crawler lbry python

Last synced: 3 months ago
JSON representation

Audio media crawler for lbry.

Lists

README

        


Podcatcher





Audio media crawler for lbry.




PyPI

Discord

## Requirements

- Python [3.8](https://www.python.org/)
- Poetry [1.1.7](https://python-poetry.org/)
- Elasticsearch [7.14.0](https://www.elastic.co/downloads/elasticsearch)
- Lbry-sdk [0.99.0](https://github.com/lbryio/lbry-sdk)

## Development
This project uses [poetry](https://python-poetry.org/) as a dependency management tool.

### Install dependencies
Installs all defined dependencies of the project.
For more information please read the poetry [documentation](https://python-poetry.org/docs/basic-usage/#installing-dependencies).

```shell
poetry install
```
### Tasks
#### Update hooks
Setup and update pre-commit hooks. You should run this the first time after poetry install.
```shell
poetry run task update-hooks
```

#### Format code
For more information please read the black [documentation](https://github.com/psf/black)
```shell
poetry run task format
```

## Commands

### Basic usage

For more information please read the poetry [documentation](https://python-poetry.org/docs/basic-usage/#using-poetry-run).

```shell
poetry run podcatcher
```

### Sync
Scan all audio streams to find music and podcasts episodes, keeping elasticsearch in sync.

```shell
poetry run podcatcher sync
```

### Retry sync

Retry failed sync from last checkpoint. If no previous failed sync occured it will just run a normal sync.
```shell
poetry run podcatcher retry-sync
```

### Cache sync
Skip scan and sync existent cache data to elasticsearch.

```shell
poetry run podcatcher cache-sync
```

### Clear cache
Remove all files on the cache directory.
```shell
poetry run podcatcher clear-cache
```

### Drop
Remove all indices from elasticsearch and all files from the cache directory.

```shell
poetry run podcatcher drop
```