Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ev2900/iceberg_emr_athena
Resources from an virtual tech talk / workshop - Set Up and Use Apache Iceberg Tables on Your Data Lake
https://github.com/ev2900/iceberg_emr_athena
apache-iceberg athena aws emr spark
Last synced: 3 months ago
JSON representation
Resources from an virtual tech talk / workshop - Set Up and Use Apache Iceberg Tables on Your Data Lake
- Host: GitHub
- URL: https://github.com/ev2900/iceberg_emr_athena
- Owner: ev2900
- Created: 2023-03-21T21:03:01.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2024-10-17T01:30:54.000Z (4 months ago)
- Last Synced: 2024-10-19T03:06:46.460Z (4 months ago)
- Topics: apache-iceberg, athena, aws, emr, spark
- Language: Jupyter Notebook
- Homepage:
- Size: 12.3 MB
- Stars: 1
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Workshop Resources
![]()
This repository contains the resources used in the [Analytics Workshop: Set Up and Use Apache Iceberg Tables on Your Data Lake](https://pages.awscloud.com/Analytics-Workshop-Set-Up-and-Use-Apache-Iceberg-Tables-on-Your-Data-Lake_2023_VW-0406-ABD?trk=0986d705-daec-4430-8ab3-53e50c51ea99&sc_channel=el)
A copy of the presentation used in the workshop is available at [Iceberg_Athena_EMR.pptx](https://github.com/ev2900/Iceberg_EMR_Athena/blob/main/Iceberg_Athena_EMR.pptx)
The SQL used in the Athena demonstration is available
* [Athena_Iceberg_SQL_1.sql](https://github.com/ev2900/Iceberg_EMR_Athena/blob/main/Athena_Iceberg_SQL_1.sql)
* [Athena_Iceberg_SQL_2.sql](https://github.com/ev2900/Iceberg_EMR_Athena/blob/main/Athena_Iceberg_SQL_2.sql)
* [Athena_Iceberg_SQL_3.sql](https://github.com/ev2900/Iceberg_EMR_Athena/blob/main/Athena_Iceberg_SQL_3.sql)The EMR Jupyter notebook used in demonstration is avilable [EMR_Iceberg_PySpark.ipynb](https://github.com/ev2900/Iceberg_EMR_Athena/blob/main/EMR_Iceberg_PySpark.ipynb)