Projects in Awesome Lists by agile-lab-dev
A curated list of projects in awesome lists by agile-lab-dev .
https://github.com/agile-lab-dev/data-product-specification
An open specification for data products in Data Mesh
Last synced: 08 Mar 2025
https://github.com/agile-lab-dev/darwin
Avro Schema Evolution made easy
avro avro-schema hadoop hbase scala schema-evolution spark
Last synced: 09 Aug 2025
https://github.com/agile-lab-dev/wasp
WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.
akka elasticsearch hadoop hbase hdfs jdbc kafka parquet scala solr spark spark-streaming yarn
Last synced: 09 Apr 2025
https://github.com/agile-lab-dev/witboost-starter-kit
Witboost is a versatile platform that addresses a wide range of sophisticated data engineering challenges. The Starter Kit showcases the integration capabilities and provides a "batteries-included" product.
Last synced: 08 Mar 2025
https://github.com/agile-lab-dev/sparksearchengine
Big Data search with Spark and Lucene
Last synced: 15 Apr 2025
https://github.com/agile-lab-dev/governance-decision-record
The Governance Decision Record (GDR) is a specification model for (computational) data governance policies inspired from the ADR (Architectural Decision Record).
architectural-decision-records data data-governance data-management data-management-platform data-mesh federated-computational-governance governance-decision-record platform policy-as-code
Last synced: 30 Jun 2025
https://github.com/agile-lab-dev/literate-programming-articles
Collection of articles, using the Literate Programming style, about Data Engineering and Software Tooling in general
literate-programming ruby spark spark-connect
Last synced: 26 Aug 2025
https://github.com/agile-lab-dev/honkit-plugin-holaspirit
Honkit plugin to embed Holaspirit roles inside rendered markdown pages
Last synced: 08 Mar 2025
https://github.com/agile-lab-dev/wasp-proxy
A REST interface to inject data into WASP platform
Last synced: 08 Mar 2025
https://github.com/agile-lab-dev/witboost-azure-adls-output-port-tech-adapter
The Azure ADLS Output Port Tech Adapter. Part of the Witboost Starter Kit: https://github.com/agile-lab-dev/witboost-starter-kit
Last synced: 26 Aug 2025
https://github.com/agile-lab-dev/witboost-azure-datafactory-specific-provisioner
Last synced: 08 Mar 2025
https://github.com/agile-lab-dev/witboost-cdp-cdw-impala-output-port-template
Last synced: 08 Mar 2025
https://github.com/agile-lab-dev/scrupulous-config
A small library/command line utility that parses/resolves two typesafe config and outputs the diffs
Last synced: 08 Mar 2025
https://github.com/agile-lab-dev/witboost-aws-s3-tech-adapter
The AWS S3 Tech Adapter. Part of the Witboost Starter Kit: https://github.com/agile-lab-dev/witboost-starter-kit
Last synced: 06 Sep 2025
https://github.com/agile-lab-dev/witboost-azure-adls-storage-area-specific-provisioner
Last synced: 08 Mar 2025
https://github.com/agile-lab-dev/witboost-azure-adls-output-port-template
The Azure ADLS Output Port Template. Part of the Witboost Starter Kit: https://github.com/agile-lab-dev/witboost-starter-kit
Last synced: 15 Jun 2025
https://github.com/agile-lab-dev/witboost-azure-datafactory-workload-template
Last synced: 10 Sep 2025
https://github.com/agile-lab-dev/witboost-aws-athena-output-port-template
The AWS Athena Output Port Template. Part of the Witboost Starter Kit: https://github.com/agile-lab-dev/witboost-starter-kit
Last synced: 28 Jul 2025
https://github.com/agile-lab-dev/witboost-scala-mesh-commons
The scala-mesh-commons library. Part of the Witboost Starter Kit: https://github.com/agile-lab-dev/witboost-starter-kit
Last synced: 08 Mar 2025