https://github.com/thisisnic/awesome-arrow-r
Awesome resources for learning more about Apache Arrow
https://github.com/thisisnic/awesome-arrow-r
List: awesome-arrow-r
awesome awesome-list
Last synced: 5 months ago
JSON representation
Awesome resources for learning more about Apache Arrow
- Host: GitHub
- URL: https://github.com/thisisnic/awesome-arrow-r
- Owner: thisisnic
- Created: 2022-01-26T11:03:39.000Z (about 3 years ago)
- Default Branch: main
- Last Pushed: 2023-12-13T14:33:06.000Z (over 1 year ago)
- Last Synced: 2024-05-23T08:03:37.364Z (11 months ago)
- Topics: awesome, awesome-list
- Homepage:
- Size: 28.3 KB
- Stars: 72
- Watchers: 6
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- jimsghstars - thisisnic/awesome-arrow-r - Awesome resources for learning more about Apache Arrow (Others)
README
# Awesome Arrow πΉ
Awesome resources for learning more about things relating to Apache Arrow, focussed on the R package [arrow](https://arrow.apache.org/docs/r/).
If you have any suggestions for other resources to add here, please submit a PR!
Key:
π©βπ« Workshop
π Blog post
π½οΈ Video
ποΈ Slides
## Official docs
* [pkgdown site](https://arrow.apache.org/docs/r/)
* [Arrow R cookbook](https://arrow.apache.org/cookbook/r/)## General overview
* ["Larger-Than-Memory Data Workflows with Apache Arrow" - UseR! 2022 conference workshop](https://arrow-user2022.netlify.app/) π©βπ«
* ["Doing More with Data: An Introduction to Arrow for R Users" by Danielle Navarro](https://www.youtube.com/watch?v=O42LUmJZPx0) π½οΈ
* ["Getting started with Apache Arrow" by Danielle Navarro](https://blog.djnavarro.net/posts/2021-11-19_starting-apache-arrow-in-r/) π
* ["Efficient Data Analysis on Larger-than-Memory Data with DuckDB and Arrow" by Tom Mock](https://www.youtube.com/watch?v=LvTX1ZAZy6M) π½οΈ
* ["Bigger data with arrow and duckdb" by Tom Mock & Edgar Ruiz](https://jthomasmock.github.io/bigger-data/#1) ποΈ
* ["New Directions for Apache Arrow" by Wes McKinney](https://www.youtube.com/watch?v=u7DecbDw3QE) π½οΈ
* ["Bigger Data With Ease Using Apache Arrow" by Neal Richardson](https://www.youtube.com/watch?v=zND-Wj2XPvc) π½οΈ
* ["Apache Arrow: Enabling Data Engineering Tasks in R" by Ian Cook](https://www.youtube.com/watch?v=SXbq4OYtsFA&t=2271s) π½οΈ## Data types and Arrow objects
* ["Data serialisation in R" by Danielle Navarro](https://blog.djnavarro.net/posts/2021-11-15_serialisation-with-rds/) π
* ["Data types in Arrow and R" by Danielle Navarro](https://blog.djnavarro.net/posts/2022-03-04_data-types-in-arrow-and-r/) π
* ["Arrays and tables in Arrow" by Danielle Navarro](https://blog.djnavarro.net/posts/2022-05-25_arrays-and-tables-in-arrow/) π## Arrow bindings/Acero
* ["Binding Apache Arrow to R" by Danielle Navarro](https://blog.djnavarro.net/posts/2022-01-18_binding-arrow-to-r/) π
* ["Arrow New Feature Showcase: show_exec_plan()" by Nic Crane](https://thisisnic.github.io/2022/08/26/arrow-new-feature-showcase-show-exec-plan/) π## File formats and partitioning
* ["Creating an Arrow dataset: An exploration of the file formats that Arrow can read and write." by FranΓ§ois Michonneau](https://francoismichonneau.net/2022/08/arrow-dataset-creation/) π
* ["Creating an Arrow dataset (part 2): How does partitioning impact query performance?" by FranΓ§ois Michonneau](https://francoismichonneau.net/2022/09/arrow-dataset-part-2/) π
* ["Understanding the Parquet file format" by Colin Gillespie](https://www.jumpingrivers.com/blog/parquet-file-format-big-data-r/) π
* ["Folks, Cβmon, Use Parquet" by Piotr StoroΕΌenko](https://appsilon.com/csv-to-parquet-transition/#csv-problematic) π## Geoarrow
* ["Building Bridges: Arrow, Parquet, and Geospatial Computing" by Dewey Dunnington](https://dewey.dunnington.ca/post/2022/building-bridges-arrow-parquet-and-geospatial-computing/) π
* ["Accelerating geospatial computing using Apache Arrow" by Dewey Dunnington](https://www.rstudio.com/conference/2022/talks/accelerating-geospatial-computing-using-apache/) π½οΈ
* ["Accelerating Geospatial Computing in R and Python Using Apache Arrow" by Dewey Dunnington and Joris Van den Bossche](https://www.youtube.com/watch?v=PbO5FVcPUIQ) π½οΈ## Community
* ["Contributing to the Arrow R Package - Get Involved!" by Nic Crane](https://www.youtube.com/watch?v=E__dvxv0Tyg&t=308s) π½οΈ## Resources for Arrow devs
* [Series of blog posts around setting up more complex dev environments than those described in the vignettes, by Will Jones](https://www.datawill.io/) π