An open API service indexing awesome lists of open source software.

https://github.com/elipapa/pokemon_luigi

example ETL pipeline task using luigi
https://github.com/elipapa/pokemon_luigi

Last synced: 8 months ago
JSON representation

example ETL pipeline task using luigi

Awesome Lists containing this project

README

          

# pokemon_luigi
Example ETL pipeline task using luigi

### Install
```shell
$ virtualenv pokemon_luigi.env
$ source pokemon_luigi.env/bin/activate
$ pip install -r requirements.txt
# create sqlite database
$ python manage.py version_control
$ python manage.py upgrade
```

### Usage
```shell
# run the luigi task
# PYTHONPATH is required as luigi only uses modules in the global path
$ PYTHONPATH=. luigi --local-scheduler --module pokemon_etl PokemonAddTypeCounts
# use --LoadPokemonTask-csv-file to load a specific dataset
$ PYTHONPATH=. luigi --local-scheduler --module pokemon_etl \
> PokemonAddTypeCounts --LoadPokemonTask-csv-file datasets/pokemon_cleaned.csv
# verify the results in the db
$ sqlite3 -csv -batch pokemon.db "SELECT * FROM pokemon_type_counts ORDER BY type_1_count DESC"
```