Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/chezou/sparkavro

Load Avro data into Spark with sparklyr
https://github.com/chezou/sparkavro

avro r spark sparklyr

Last synced: 3 months ago
JSON representation

Load Avro data into Spark with sparklyr

Lists

README

        

[![Travis-CI Build Status](https://travis-ci.org/chezou/sparkavro.svg?branch=master)](https://travis-ci.org/chezou/sparkavro)

# sparkavro

Load Avro data into Spark with sparklyr. It is a wrapper of [spark-avro](https://github.com/databricks/spark-avro)

## Installation

Install using `{devtools}` as follows:

```r
devtools::install_github("chezou/sparkavro")
```

## Usage

```r
library(sparklyr)
library(sparkavro)
sc <- spark_connect(master = "spark://HOST:PORT")
df <- spark_read_avro(sc, "test_table", "/user/foo/test.avro")

spark_write_avro(df, "/tmp/output")
```

Example data are from https://github.com/miguno/avro-cli-examples