Projects in Awesome Lists tagged with data-eng
A curated list of projects in awesome lists tagged with data-eng .
https://github.com/arbaznazir/datalineagepy
86% faster data lineage tracking for pandas DataFrames with zero infrastructure. Real-time monitoring, ML anomaly detection, and enterprise compliance features.
anomaly-detection data-eng data-governance data-lineage data-quality data-science dataframes enterprise etl lineage-tracing machine-learning pandas python
Last synced: 28 Apr 2026
https://github.com/datacody/dbt-jaffle-shop
A hands-on project built to deepen understanding of dbt modeling, testing, and documentation. Based on the Jaffle Shop dataset, the project showcases best practices in transforming and validating source data for business analytics using the modern data stack.
analytics bigquery data-eng data-modeling dbt etl-pipeline sql transformation
Last synced: 02 Aug 2025
https://github.com/christophermoverton/fintech-market-ingestion
Production-grade market data pipeline: Alpaca (Daily & 1Min) → normalized schema → partitioned Parquet → DuckDB analytics + strict QA observability.
algorithmic-trading data-eng data-pipeline data-quality data-validation duckdb etl fintec idempotency market-data observability parquet partitioning python time-series
Last synced: 25 May 2026
https://github.com/shibam120302/whatsapp_chat_data_analysis
An Exhaustive Analysis of WhatsApp Chat Data for Extracting Real-Time Insights, Identifying Usage Patterns, Detecting Spam, and Understanding User Sentiment at Scale
data-eng data-engineering visualization whatsapp whatsapp-analysis
Last synced: 15 May 2026
https://github.com/tknishh/aws-location-service
End-to-end Implementation of the medium article leveraging services of AWS and Algolia.
algolia-api amazon-location-service aws-lambda data-eng dynamodb
Last synced: 02 Feb 2026