Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/toyama0919/embulk-input-presto
Facebook Presto input plugin for Embulk.
https://github.com/toyama0919/embulk-input-presto
embulk embulk-input-plugin embulk-plugin facebook-presto presto
Last synced: 2 months ago
JSON representation
Facebook Presto input plugin for Embulk.
- Host: GitHub
- URL: https://github.com/toyama0919/embulk-input-presto
- Owner: toyama0919
- License: mit
- Created: 2015-12-07T05:21:49.000Z (about 9 years ago)
- Default Branch: master
- Last Pushed: 2017-03-23T00:18:15.000Z (almost 8 years ago)
- Last Synced: 2024-09-22T05:18:50.575Z (3 months ago)
- Topics: embulk, embulk-input-plugin, embulk-plugin, facebook-presto, presto
- Language: Ruby
- Homepage:
- Size: 18.6 KB
- Stars: 11
- Watchers: 3
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE.txt
Awesome Lists containing this project
README
# Facebook Presto input plugin for Embulk [![Gem Version](https://badge.fury.io/rb/embulk-input-presto.svg)](http://badge.fury.io/rb/embulk-input-presto) [![Build Status](https://secure.travis-ci.org/toyama0919/embulk-input-presto.png?branch=master)](http://travis-ci.org/toyama0919/embulk-input-presto)
Facebook Presto input plugin for Embulk.
[see](https://prestodb.io/).## Overview
* **Plugin type**: input
* **Resume supported**: yes
* **Cleanup supported**: yes
* **Guess supported**: no## Configuration
- **host**: host (string, default: `"localhost"`)
- **port**: port (integer, default: `8080`)
- **schema**: schema (string, default: `"default"`)
- **catalog**: catalog (string, default: `"native"`)
- **query**: query (string, required)
- **user**: user (string, default: `"embulk"`)
- **columns**(**deprecated**): columns (array, required)
- **name**: name (string, required)
- **type**: type (string, required)**Warning** : **columns** deprecated since over v0.2.0. Support auto fetch schema.
## Example
```yaml
in:
type: presto
host: presto-cordinator
catalog: store
schema: public
query: |
SELECT
trim(upper(url_decode(keyword))) AS keyword,
count(*) as count
FROM search
CROSS JOIN UNNEST(split(keywords, ',')) AS t (keyword)
WHERE log_date >= (CURRENT_DATE - INTERVAL '90' DAY)
AND length(keywords) != 256
group by keyword
having count(*) >= 10
order by count(*) desc
out:
type: stdout
```## Support type
* TIMESTAMP
* LONG
* DOUBLE
* BOOLEAN
* STRING
* JSON## Build
```
$ rake
```