An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with data-eng

A curated list of projects in awesome lists tagged with data-eng .

https://github.com/arbaznazir/datalineagepy

86% faster data lineage tracking for pandas DataFrames with zero infrastructure. Real-time monitoring, ML anomaly detection, and enterprise compliance features.

anomaly-detection data-eng data-governance data-lineage data-quality data-science dataframes enterprise etl lineage-tracing machine-learning pandas python

Last synced: 28 Apr 2026

https://github.com/datacody/dbt-jaffle-shop

A hands-on project built to deepen understanding of dbt modeling, testing, and documentation. Based on the Jaffle Shop dataset, the project showcases best practices in transforming and validating source data for business analytics using the modern data stack.

analytics bigquery data-eng data-modeling dbt etl-pipeline sql transformation

Last synced: 02 Aug 2025

https://github.com/christophermoverton/fintech-market-ingestion

Production-grade market data pipeline: Alpaca (Daily & 1Min) → normalized schema → partitioned Parquet → DuckDB analytics + strict QA observability.

algorithmic-trading data-eng data-pipeline data-quality data-validation duckdb etl fintec idempotency market-data observability parquet partitioning python time-series

Last synced: 25 May 2026

https://github.com/shibam120302/whatsapp_chat_data_analysis

An Exhaustive Analysis of WhatsApp Chat Data for Extracting Real-Time Insights, Identifying Usage Patterns, Detecting Spam, and Understanding User Sentiment at Scale

data-eng data-engineering visualization whatsapp whatsapp-analysis

Last synced: 15 May 2026

https://github.com/tknishh/aws-location-service

End-to-end Implementation of the medium article leveraging services of AWS and Algolia.

algolia-api amazon-location-service aws-lambda data-eng dynamodb

Last synced: 02 Feb 2026