Projects in Awesome Lists by dacort
A curated list of projects in awesome lists by dacort .
https://github.com/dacort/metabase-athena-driver
An Amazon Athena driver for Metabase 0.32 and later
amazon-athena athena athena-driver aws aws-athena metabase metabase-driver
Last synced: 17 Mar 2025
https://github.com/dacort/athena-sqlite
A SQLite driver for S3 and Amazon Athena 😳
amazon-athena athena aws lambda-layer s3 sar serverless sqlite vfs
Last synced: 16 Feb 2025
https://github.com/dacort/mwhich
Generic API to search for movies or TV shows across Netflix, Hulu, iTunes, and Amazon Video on Demand
Last synced: 16 Feb 2025
https://github.com/dacort/faker-cli
Command-line interface to quickly generate fake CSV and JSON data
aws csv deltalake faker-provider json parquet pyarrow
Last synced: 16 Feb 2025
https://github.com/dacort/duckdb-athena-extension
An experimental Athena extension for DuckDB 🐤
Last synced: 16 Feb 2025
https://github.com/dacort/modern-data-lake-storage-layers
Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work
amazon-emr apache-hudi apache-iceberg aws delta-lake hudi iceberg
Last synced: 16 Feb 2025
https://github.com/dacort/demo-code
Bits of code I use during live demos
amazon-athena amazon-emr aws-athena aws-cloudformation aws-cloudformation-templates aws-emr emr-cluster emr-notebooks live-demos
Last synced: 16 Feb 2025
https://github.com/dacort/damons-data-lake
All the code related to building my own data lake
Last synced: 16 Feb 2025
https://github.com/dacort/athena-federation-python-sdk
Unofficial Python SDK for Athena Federation
Last synced: 16 Feb 2025
https://github.com/dacort/golang-sse-demo
A brief demo of real-time plotting with Plotly, Go, and server-sent events
Last synced: 16 Feb 2025
https://github.com/dacort/ci-cd-serverless-spark
Demo for GitHub Universe 2022
Last synced: 16 Feb 2025
https://github.com/dacort/go-meerkat
Meerkat API documentation and Go client
Last synced: 16 Feb 2025
https://github.com/dacort/dm-whacker
A bookmarklet to automatically delete Twitter Direct Messages
Last synced: 16 Feb 2025
https://github.com/dacort/emr-serverless-sql-cli
An experimental tool for running SQL on EMR Serverless
Last synced: 16 Feb 2025
https://github.com/dacort/s3-diff-uploader
Python code to demonstrate differential uploading of files to S3.
Last synced: 16 Feb 2025
https://github.com/dacort/emr-cli-examples
Varied ways of deploying PySpark code to EMR and how the EMR CLI can make it all as easy as a single command.
Last synced: 12 Apr 2025
https://github.com/dacort/redpill
A simple script to get my base OS X system up and running
Last synced: 12 Apr 2025
https://github.com/dacort/s3mpty
A batteries-included tool for deleting the contents of versioned S3 buckets.
Last synced: 12 Apr 2025
https://github.com/dacort/notatsxsw
A combination of jealousy and rage resulted in a Google AppEngine proxy that would filter out SxSW tweets.
Last synced: 12 Apr 2025
https://github.com/dacort/spark-local-environment
An example of using EMR Serverless container image for local environment
Last synced: 12 Apr 2025
https://github.com/dacort/is-remote
A journal of my adventures in remote work
Last synced: 12 Apr 2025
https://github.com/dacort/tweepml
TweepML is an XML format used to represent a list of Tweeps (Twitter users)
Last synced: 16 Feb 2025
https://github.com/dacort/ugrep
Hacked up shell script to grep in UTF-16 files
Last synced: 12 Apr 2025
https://github.com/dacort/emr-job-templates
A sample repository of production-ready Spark code for use with Amazon EMR.
Last synced: 12 Apr 2025
https://github.com/dacort/athena-query-stats
Query your Athena query history using Athena 🙆♂️
Last synced: 12 Apr 2025
https://github.com/dacort/slugplot
Weather visualization to show change in average temperature over time.
Last synced: 12 Apr 2025
https://github.com/dacort/metabase-datasette-driver
A Datasette driver for Metabase
Last synced: 12 Apr 2025
https://github.com/dacort/emr-eks-airflow2-plugin
An experimental Airflow 2.0 plugin for EMR on EKS
Last synced: 12 Apr 2025
https://github.com/dacort/syslog-to-athena
Use Fluentd to send syslogs to Athena for great querying
Last synced: 12 Apr 2025
https://github.com/dacort/jupyter-static-website
A way to continuously deploy Jupyter notebooks to a static website backed by S3.
Last synced: 12 Apr 2025
https://github.com/dacort/airflow-example-dags
Example dags for airflow experimentation
Last synced: 12 Apr 2025
https://github.com/dacort/choirmaster
Go-based poller for dynamic data sources to make them sing with choir.io
Last synced: 12 Apr 2025
https://github.com/dacort/ziply-dsl-monitor
My DSL was severely broken...so I graphed it.
Last synced: 12 Apr 2025
https://github.com/dacort/spark-tweeter
I know ... you always wanted your Spark jobs to be able to tweet, right?
Last synced: 12 Apr 2025
https://github.com/dacort/cargo-crates
An easy way to build data extractors in Docker.
data-engineering docker python
Last synced: 12 Apr 2025
https://github.com/dacort/byteable-calc
A byte-size HTML/JS calculator for making big numbers human-readable
Last synced: 12 Apr 2025
https://github.com/dacort/forklift
Forklift your cargo into different places 🚚
Last synced: 12 Apr 2025
https://github.com/dacort/ha-storybutton
Storybutton integration for Home Assistant
hacs hacs-plugin home-assistant storybutton
Last synced: 03 Apr 2025
https://github.com/dacort/dotfiles
My (n)ever-changing dotfiles / bare git style
Last synced: 12 Apr 2025
https://github.com/dacort/bingdaily
Set your macOS desktop image to the Bing Image of the day
bing-image golang macos wallpaper-changer
Last synced: 12 Apr 2025
https://github.com/dacort/cypress_type_repro
Reproduces cypress-io/cypress#5480
Last synced: 12 Apr 2025
https://github.com/dacort/crashplan-audit
Make sure your files are _actually_ backed up by Crashplan
Last synced: 12 Apr 2025
https://github.com/dacort/homebrew-formulas
Personal homebrew formula/casks
Last synced: 12 Apr 2025
https://github.com/dacort/serverless-twitter-analytics
An example of using Amazon MSK and Amazon Athena to query Twitter data
Last synced: 12 Apr 2025
https://github.com/dacort/athena-gsheets
Athena Data Source Connector for 0day Google Sheet
Last synced: 12 Apr 2025