An open API service indexing awesome lists of open source software.

https://github.com/edgarrmondragon/meltano-s3-csv-example

Example Meltano project to extract data from files in S3
https://github.com/edgarrmondragon/meltano-s3-csv-example

etl-pipeline meltano

Last synced: about 1 month ago
JSON representation

Example Meltano project to extract data from files in S3

Awesome Lists containing this project

README

        

# Example Meltano project to extract data from files in S3

## Prerequisites

* [Docker][install-docker]
* [Meltano][install-meltano]

## Setup

1. Clone this repository
2. Run `meltano install` to install the project's dependencies
3. Run `docker compose up -d` to start the project's services (MinIO)
4. Run `meltano run tap-spreadsheets-anywhere target-duckdb` to run the pipeline

You should now be be able to query the data in the `output/warehouse.duckdb` file:

```console
$ duckdb output/warehouse.duckdb -c 'select id, email from tap_spreadsheets_anywhere.customers limit 5;'
┌────────┬───────────────────────────┐
│ id │ email │
│ int128 │ varchar │
├────────┼───────────────────────────┤
│ 1 │ [email protected]
│ 2 │ [email protected]
│ 3 │ [email protected]
│ 4 │ [email protected]
│ 5 │ [email protected]
└────────┴───────────────────────────┘
```

[install-docker]: https://docs.docker.com/engine/install/
[install-meltano]: https://docs.meltano.com/getting-started/installation/