https://github.com/hussein-awala/gdpr-compliant-lakehouse
This repository is a demonstration of how to handle GDPR export and delete requests in an Iceberg Lakehouse to make it GDPR-compliant.
https://github.com/hussein-awala/gdpr-compliant-lakehouse
apache-iceberg apache-spark datalake gdpr lakehouse
Last synced: about 2 months ago
JSON representation
This repository is a demonstration of how to handle GDPR export and delete requests in an Iceberg Lakehouse to make it GDPR-compliant.
- Host: GitHub
- URL: https://github.com/hussein-awala/gdpr-compliant-lakehouse
- Owner: hussein-awala
- Created: 2024-11-24T14:21:40.000Z (11 months ago)
- Default Branch: main
- Last Pushed: 2024-11-24T14:21:54.000Z (11 months ago)
- Last Synced: 2025-08-29T08:51:00.797Z (2 months ago)
- Topics: apache-iceberg, apache-spark, datalake, gdpr, lakehouse
- Language: Jupyter Notebook
- Homepage:
- Size: 9.77 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# How to create a GDPR-compliant Iceberg Lakehouse
This repository is a demonstration of how to handle GDPR export and delete requests in an Iceberg Lakehouse to make it
GDPR-compliant.
## Run the demo
To run the demo, you need to have Docker and Docker Compose installed on your machine. Then, you can run the following
command:
```bash
docker-compose up -d
```
This command will start a Docker container with a Jupyter notebook server, a container with minio (a S3-compatible
object storage) and a container with an Iceberg REST catalog.
In your browser, navigate to [http://localhost:8888/lab/tree/work/notebooks/GDPR-demo.ipynb](http://localhost:8888/lab/tree/work/notebooks/GDPR-demo.ipynb) and run the notebook.
## Shut down the demo
To shut down the demo, you can run the following command:
```bash
docker-compose down --volumes
```
This command will stop and remove the Docker containers and delete the data volumes, but it will keep all the notebooks
and data files created in the `work` directory.