{"id":25010101,"url":"https://github.com/kyopark2014/aws-athena","last_synced_at":"2025-06-27T08:36:47.922Z","repository":{"id":56729269,"uuid":"524273370","full_name":"kyopark2014/aws-athena","owner":"kyopark2014","description":"It decribes how to use aws athena for data analytics","archived":false,"fork":false,"pushed_at":"2022-08-13T03:41:20.000Z","size":32,"stargazers_count":2,"open_issues_count":0,"forks_count":0,"subscribers_count":3,"default_branch":"main","last_synced_at":"2025-02-05T04:52:36.623Z","etag":null,"topics":["aws","aws-athena","query","sql"],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/kyopark2014.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2022-08-13T01:58:09.000Z","updated_at":"2022-11-20T10:24:22.000Z","dependencies_parsed_at":"2022-08-16T00:40:19.482Z","dependency_job_id":null,"html_url":"https://github.com/kyopark2014/aws-athena","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kyopark2014%2Faws-athena","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kyopark2014%2Faws-athena/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kyopark2014%2Faws-athena/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kyopark2014%2Faws-athena/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/kyopark2014","download_url":"https://codeload.github.com/kyopark2014/aws-athena/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":246268605,"owners_count":20750146,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["aws","aws-athena","query","sql"],"created_at":"2025-02-05T04:52:38.348Z","updated_at":"2025-03-30T02:29:45.413Z","avatar_url":"https://github.com/kyopark2014.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# AWS Athena\n\nAthena는 서버리스 환경에서 Data Lake를 위한 interative query를 제공합니다.\n\n![image](https://user-images.githubusercontent.com/52392004/184464605-c71155b2-b466-42e1-86e7-a17358322b25.png)\n\n\n## 특징\n\n#### Simplicity\n\n- serverless이므로 초기 설정이나 인프라 비용이 발생하지 않습니다.\n- SQS를 이용해 S3와 저장된 데이터를 query할 수 있습니다.\n- JDBC/ODBC driver들을 제공합니다. \n\n#### Pay per query\n\n- 데이터를 scan할때에만 비용이 발생합니다.\n- 압축을 통해 30-90%의 query비용을 절감할 수 있습니다. \n\n#### Decouple storage and compute\n\n- 데이터저장소와 Query를 위한 인프라가 분리되어 있습니다.\n- S3, Data warehouse와 database에 대한 Query를 제공합니다.\n- 다양한 custom 데이터나 포맷에 호환가능합니다.\n\n#### Security\n\n- 전송(Transit)에서 암호화합니다.\n- AWS IAM 및 SALv2에 대한 인증을 제공합니다.\n- AWS PrivateLinke를 지원합니다. \n\n## 일반적인 사용 패턴\n\n[Common Patterns](https://github.com/kyopark2014/aws-athena/blob/main/common-patterns.md)에서는 Athena를 사용하는 패턴에 대해 설명합니다. \n\n## Federated Query\n\n[Federated Query](https://github.com/kyopark2014/aws-athena/blob/main/federated-query.md)에서는 Athena를 통해 S3와 같은 AWS service뿐 아니라, on-premise나 다른 cloud 데이터 소스를 query 하는 방법에 대해 소개합니다. \n\n\n## Simple pricing\n\n- DDL operations - Free\n- SQL operations - Free\n- Query concurrency - Free\n- Data scanned - $5 / TB\n\n## 지원하는 데이터 포맷\n\nANSI SQL을 지원하는 Presto engine을 이용하여 SQL query를 수행하며, 지원하는 포맷은 아래와 같습니다. \n\n- CSV\n- Apache Weblogs\n- JSON\n- Parquet\n- ORC\n\n\n\n## Reference\n\n[Amazon Athena](https://aws.amazon.com/athena/?nc1=h_ls\u0026whats-new-cards.sort-by=item.additionalFields.postDateTime\u0026whats-new-cards.sort-order=desc)\n\n[MakeMyTrip: Leveraging Athena Federated Query to break data silos], Reinvent 2021 ANA301\n\n[How to use SQL to Query S3 files with AWS Athena | Step by Step Tutorial](https://www.youtube.com/watch?v=M5ptG0YaqAs)\n\n[Amazon Athena에 대해 알아보기](https://www.youtube.com/watch?v=MAgd-zeB4QU)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkyopark2014%2Faws-athena","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fkyopark2014%2Faws-athena","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkyopark2014%2Faws-athena/lists"}