An open API service indexing awesome lists of open source software.

https://github.com/thisisnic/awesome-arrow-r

Awesome resources for learning more about Apache Arrow
https://github.com/thisisnic/awesome-arrow-r

List: awesome-arrow-r

awesome awesome-list

Last synced: 5 months ago
JSON representation

Awesome resources for learning more about Apache Arrow

Awesome Lists containing this project

README

        

# Awesome Arrow 🏹

Awesome resources for learning more about things relating to Apache Arrow, focussed on the R package [arrow](https://arrow.apache.org/docs/r/).

If you have any suggestions for other resources to add here, please submit a PR!

Key:

πŸ‘©β€πŸ« Workshop

πŸ“„ Blog post

πŸ“½οΈ Video

🎞️ Slides

## Official docs

* [pkgdown site](https://arrow.apache.org/docs/r/)
* [Arrow R cookbook](https://arrow.apache.org/cookbook/r/)

## General overview

* ["Larger-Than-Memory Data Workflows with Apache Arrow" - UseR! 2022 conference workshop](https://arrow-user2022.netlify.app/) πŸ‘©β€πŸ«
* ["Doing More with Data: An Introduction to Arrow for R Users" by Danielle Navarro](https://www.youtube.com/watch?v=O42LUmJZPx0) πŸ“½οΈ
* ["Getting started with Apache Arrow" by Danielle Navarro](https://blog.djnavarro.net/posts/2021-11-19_starting-apache-arrow-in-r/) πŸ“„
* ["Efficient Data Analysis on Larger-than-Memory Data with DuckDB and Arrow" by Tom Mock](https://www.youtube.com/watch?v=LvTX1ZAZy6M) πŸ“½οΈ
* ["Bigger data with arrow and duckdb" by Tom Mock & Edgar Ruiz](https://jthomasmock.github.io/bigger-data/#1) 🎞️
* ["New Directions for Apache Arrow" by Wes McKinney](https://www.youtube.com/watch?v=u7DecbDw3QE) πŸ“½οΈ
* ["Bigger Data With Ease Using Apache Arrow" by Neal Richardson](https://www.youtube.com/watch?v=zND-Wj2XPvc) πŸ“½οΈ
* ["Apache Arrow: Enabling Data Engineering Tasks in R" by Ian Cook](https://www.youtube.com/watch?v=SXbq4OYtsFA&t=2271s) πŸ“½οΈ

## Data types and Arrow objects

* ["Data serialisation in R" by Danielle Navarro](https://blog.djnavarro.net/posts/2021-11-15_serialisation-with-rds/) πŸ“„
* ["Data types in Arrow and R" by Danielle Navarro](https://blog.djnavarro.net/posts/2022-03-04_data-types-in-arrow-and-r/) πŸ“„
* ["Arrays and tables in Arrow" by Danielle Navarro](https://blog.djnavarro.net/posts/2022-05-25_arrays-and-tables-in-arrow/) πŸ“„

## Arrow bindings/Acero
* ["Binding Apache Arrow to R" by Danielle Navarro](https://blog.djnavarro.net/posts/2022-01-18_binding-arrow-to-r/) πŸ“„
* ["Arrow New Feature Showcase: show_exec_plan()" by Nic Crane](https://thisisnic.github.io/2022/08/26/arrow-new-feature-showcase-show-exec-plan/) πŸ“„

## File formats and partitioning

* ["Creating an Arrow dataset: An exploration of the file formats that Arrow can read and write." by FranΓ§ois Michonneau](https://francoismichonneau.net/2022/08/arrow-dataset-creation/) πŸ“„
* ["Creating an Arrow dataset (part 2): How does partitioning impact query performance?" by FranΓ§ois Michonneau](https://francoismichonneau.net/2022/09/arrow-dataset-part-2/) πŸ“„
* ["Understanding the Parquet file format" by Colin Gillespie](https://www.jumpingrivers.com/blog/parquet-file-format-big-data-r/) πŸ“„
* ["Folks, C’mon, Use Parquet" by Piotr StoroΕΌenko](https://appsilon.com/csv-to-parquet-transition/#csv-problematic) πŸ“„

## Geoarrow
* ["Building Bridges: Arrow, Parquet, and Geospatial Computing" by Dewey Dunnington](https://dewey.dunnington.ca/post/2022/building-bridges-arrow-parquet-and-geospatial-computing/) πŸ“„
* ["Accelerating geospatial computing using Apache Arrow" by Dewey Dunnington](https://www.rstudio.com/conference/2022/talks/accelerating-geospatial-computing-using-apache/) πŸ“½οΈ
* ["Accelerating Geospatial Computing in R and Python Using Apache Arrow" by Dewey Dunnington and Joris Van den Bossche](https://www.youtube.com/watch?v=PbO5FVcPUIQ) πŸ“½οΈ

## Community
* ["Contributing to the Arrow R Package - Get Involved!" by Nic Crane](https://www.youtube.com/watch?v=E__dvxv0Tyg&t=308s) πŸ“½οΈ

## Resources for Arrow devs
* [Series of blog posts around setting up more complex dev environments than those described in the vignettes, by Will Jones](https://www.datawill.io/) πŸ“„