Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/ev2900/iceberg_emr_athena

Resources from an virtual tech talk / workshop - Set Up and Use Apache Iceberg Tables on Your Data Lake
https://github.com/ev2900/iceberg_emr_athena

apache-iceberg athena aws emr spark

Last synced: 3 months ago
JSON representation

Resources from an virtual tech talk / workshop - Set Up and Use Apache Iceberg Tables on Your Data Lake

Awesome Lists containing this project

README

        

# Workshop Resources

map-user map-user

This repository contains the resources used in the [Analytics Workshop: Set Up and Use Apache Iceberg Tables on Your Data Lake](https://pages.awscloud.com/Analytics-Workshop-Set-Up-and-Use-Apache-Iceberg-Tables-on-Your-Data-Lake_2023_VW-0406-ABD?trk=0986d705-daec-4430-8ab3-53e50c51ea99&sc_channel=el)

A copy of the presentation used in the workshop is available at [Iceberg_Athena_EMR.pptx](https://github.com/ev2900/Iceberg_EMR_Athena/blob/main/Iceberg_Athena_EMR.pptx)

The SQL used in the Athena demonstration is available
* [Athena_Iceberg_SQL_1.sql](https://github.com/ev2900/Iceberg_EMR_Athena/blob/main/Athena_Iceberg_SQL_1.sql)
* [Athena_Iceberg_SQL_2.sql](https://github.com/ev2900/Iceberg_EMR_Athena/blob/main/Athena_Iceberg_SQL_2.sql)
* [Athena_Iceberg_SQL_3.sql](https://github.com/ev2900/Iceberg_EMR_Athena/blob/main/Athena_Iceberg_SQL_3.sql)

The EMR Jupyter notebook used in demonstration is avilable [EMR_Iceberg_PySpark.ipynb](https://github.com/ev2900/Iceberg_EMR_Athena/blob/main/EMR_Iceberg_PySpark.ipynb)