An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with awsglue

A curated list of projects in awesome lists tagged with awsglue .

https://github.com/prestodb/prestorials

Tutorials and examples of how to deploy Presto and connect it to different data sources

aws awsglue data datalake docker example glue lakehouse mongodb presto presto-connector prestodb prestosql sql tutorial walkthrough

Last synced: 24 Oct 2025

https://github.com/undisputed-jay/spotifyapi-data-engineering-project

This projects uses ETL (Extract, Transform and Load) pipeline to extract data from Spotify using its API and loads the data to a data source(AWS Athena). The entire pipeline will be built using Amazon Web Services (AWS).

aws aws-athena aws-cloudformation aws-lambda aws-s3 awsglue python3 sql

Last synced: 06 Apr 2025

https://github.com/tanishkamarrott/real-time-streaming-analytics-with-kinesis-flink-and-opensearch

This project focuses on real-time data streaming with Kinesis, using Flink for advanced processing and OpenSearch for analytics. This architecture has succinctly handled the complete lifecycle of data from ingestion to actionable insights, making it a comprehensive solution.

apacheflink awsglue awslambda cloudcomputing dataengineering kinesisdatastreams opensearch realtimeanalytics

Last synced: 22 Mar 2025

https://github.com/harikishan-ai/harikishan-ai

I am dedicated to delivering innovative solutions that align with business objectives while ensuring optimal performance, reliability, and security. My strong analytical skills, attention to detail, and problem-solving abilities drive me to create effective and efficient solutions.

analystics automation aws-ec2 aws-lambda aws-s3 awsglue cnn deep-neural-networks logs lstm machine-learning oops-in-python powerbi python rnn transformers

Last synced: 20 Mar 2025

https://github.com/vidupriya/aws-glue--data-copy

The function for copying data like CSV, Parquet, avro etc., from a source S3 bucket to a destination S3 bucket using AWS Glue. It includes the necessary setup for the Glue job, logging, reading data from the source bucket, and writing it to the destination bucket

aws awsglue awss3 data data-copying glue glue-job pyspark python3 s3 s3-bucket s3-buckets s3-storage spark

Last synced: 07 Sep 2025

https://github.com/najmaelboutaheri/data-engineering-project-youtube

This project aims to securely manage, process, and analyze structured and semi-structured YouTube data based on video categories and trending metrics. The architecture leverages AWS services to ingest, store, transform, analyze, and visualize data efficiently and at scale.

aws aws-lambda awscli awsglue awsiam awss3 python3 shell

Last synced: 05 Jan 2026