https://github.com/gretelai/multi-table
Notebook and code to synthesize relational databases such as Postgres and Mysql.
https://github.com/gretelai/multi-table
anonymization pii-detection postgres synthetic-data
Last synced: over 1 year ago
JSON representation
Notebook and code to synthesize relational databases such as Postgres and Mysql.
- Host: GitHub
- URL: https://github.com/gretelai/multi-table
- Owner: gretelai
- License: apache-2.0
- Archived: true
- Created: 2022-04-15T15:19:09.000Z (about 4 years ago)
- Default Branch: main
- Last Pushed: 2022-10-24T19:10:23.000Z (over 3 years ago)
- Last Synced: 2025-02-28T15:11:08.715Z (over 1 year ago)
- Topics: anonymization, pii-detection, postgres, synthetic-data
- Language: Jupyter Notebook
- Homepage:
- Size: 2.78 MB
- Stars: 8
- Watchers: 24
- Forks: 1
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Gretel Multi-Table
This code is designed to run transforms or synthetics directly on a relational database while keeping the referential integrity of primary and foreign keys intact.
# Get Started
Install, preferebly into a virtual environment
```
pip install -U .
```
## Running the notebooks
1. Launch either the transform or synthetics notebook
2. Update your database connection string
3. If running transforms, update the location of your policy files
3. When viewing data or the synthetic performance report, update the relevant cell to use your table names
3. Update where you'd like your final data stored