https://github.com/Hound-fm/podcatcher
Audio media crawler for lbry.
https://github.com/Hound-fm/podcatcher
crawler lbry python
Last synced: about 1 year ago
JSON representation
Audio media crawler for lbry.
- Host: GitHub
- URL: https://github.com/Hound-fm/podcatcher
- Owner: Hound-fm
- License: mit
- Created: 2021-07-17T03:41:45.000Z (almost 5 years ago)
- Default Branch: main
- Last Pushed: 2022-12-08T13:13:34.000Z (over 3 years ago)
- Last Synced: 2024-11-18T08:55:39.801Z (over 1 year ago)
- Topics: crawler, lbry, python
- Language: Python
- Homepage: https://hound.fm
- Size: 355 KB
- Stars: 5
- Watchers: 2
- Forks: 5
- Open Issues: 22
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- Awesome-LBRY - Podcatcher - An open source audio media crawler for lbry. Helps you discover music and podcasts. (Tools)
README
Audio media crawler for lbry.

## Requirements
- Python [3.8](https://www.python.org/)
- Poetry [1.1.7](https://python-poetry.org/)
- Elasticsearch [7.14.0](https://www.elastic.co/downloads/elasticsearch)
- Lbry-sdk [0.99.0](https://github.com/lbryio/lbry-sdk)
## Development
This project uses [poetry](https://python-poetry.org/) as a dependency management tool.
### Install dependencies
Installs all defined dependencies of the project.
For more information please read the poetry [documentation](https://python-poetry.org/docs/basic-usage/#installing-dependencies).
```shell
poetry install
```
### Tasks
#### Update hooks
Setup and update pre-commit hooks. You should run this the first time after poetry install.
```shell
poetry run task update-hooks
```
#### Format code
For more information please read the black [documentation](https://github.com/psf/black)
```shell
poetry run task format
```
## Commands
### Basic usage
For more information please read the poetry [documentation](https://python-poetry.org/docs/basic-usage/#using-poetry-run).
```shell
poetry run podcatcher
```
### Sync
Scan all audio streams to find music and podcasts episodes, keeping elasticsearch in sync.
```shell
poetry run podcatcher sync
```
### Retry sync
Retry failed sync from last checkpoint. If no previous failed sync occured it will just run a normal sync.
```shell
poetry run podcatcher retry-sync
```
### Cache sync
Skip scan and sync existent cache data to elasticsearch.
```shell
poetry run podcatcher cache-sync
```
### Clear cache
Remove all files on the cache directory.
```shell
poetry run podcatcher clear-cache
```
### Drop
Remove all indices from elasticsearch and all files from the cache directory.
```shell
poetry run podcatcher drop
```