Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/mesmacosta/datacatalog-tag-template-processor

A package to manage Google Cloud Data Catalog Tag Template scripts.
https://github.com/mesmacosta/datacatalog-tag-template-processor

analytics bulk cloud csv datacatalog docker gcp metadata-management

Last synced: 3 months ago
JSON representation

A package to manage Google Cloud Data Catalog Tag Template scripts.

Awesome Lists containing this project

README

        

# Datacatalog Tag Template Processor

[![CircleCI][1]][2] [![PyPi][5]][6] [![License][7]][7] [![Issues][8]][9]

A package to manage Google Cloud Data Catalog Tag Template scripts.

**Disclaimer: This is not an officially supported Google product.**

## Table of Contents

- [Executing in Cloud Shell](#executing-in-cloud-shell)
- [1. Environment setup](#1-environment-setup)
* [1.1. Python + virtualenv](#11-python--virtualenv)
+ [1.1.1. Install Python 3.6+](#111-install-python-36)
+ [1.1.2. Get the source code](#112-get-the-source-code)
+ [1.1.3. Create and activate an isolated Python environment](#113-create-and-activate-an-isolated-python-environment)
+ [1.1.4. Install the package](#114-install-the-package)
* [1.2. Docker](#12-docker)
* [1.3. Auth credentials](#13-auth-credentials)
+ [1.3.1. Create a service account and grant it below roles](#131-create-a-service-account-and-grant-it-below-roles)
+ [1.3.2. Download a JSON key and save it as](#132-download-a-json-key-and-save-it-as)
+ [1.3.3. Set the environment variables](#133-set-the-environment-variables)
- [2. Load Templates from CSV file](#2-load-templates-from-csv-file)
* [2.1. Create a CSV file representing the Templates to be created](#21-create-a-csv-file-representing-the-templates-to-be-created)
* [2.2. Run the ddatacatalog-tag-template-processor script - Create the Tag Templates](#22-run-the-ddatacatalog-tag-template-processor-script---create-the-tag-templates)
* [2.3. Run the datacatalog-tag-template-processor script - Delete the Tag Templates](#23-run-the-datacatalog-tag-template-processor-script---delete-the-tag-templates)

-----

## Executing in Cloud Shell
````bash
# Set your SERVICE ACCOUNT, for instructions go to 1.3. Auth credentials
# This name is just a suggestion, feel free to name it following your naming conventions
export GOOGLE_APPLICATION_CREDENTIALS=~/datacatalog-tag-template-processor-sa.json

# Install datacatalog-tag-template-processor
pip3 install datacatalog-tag-template-processor --user

# Add to your PATH
export PATH=~/.local/bin:$PATH

# Look for available commands
datacatalog-tag-template-processor --help
````

## 1. Environment setup

### 1.1. Python + virtualenv

Using [virtualenv][3] is optional, but strongly recommended unless you use [Docker](#12-docker).

#### 1.1.1. Install Python 3.6+

#### 1.1.2. Get the source code
```bash
git clone https://github.com/mesmacosta/datacatalog-tag-template-processor
cd ./datacatalog-tag-template-processor
```

_All paths starting with `./` in the next steps are relative to the `datacatalog-tag-template-processor`
folder._

#### 1.1.3. Create and activate an isolated Python environment

```bash
pip install --upgrade virtualenv
python3 -m virtualenv --python python3 env
source ./env/bin/activate
```

#### 1.1.4. Install the package

```bash
pip install --upgrade .
```

### 1.2. Docker

Docker may be used as an alternative to run the script. In this case, please disregard the
[Virtualenv](#11-python--virtualenv) setup instructions.

### 1.3. Auth credentials

#### 1.3.1. Create a service account and grant it below roles

- Data Catalog Admin

#### 1.3.2. Download a JSON key and save it as
This name is just a suggestion, feel free to name it following your naming conventions
- `./credentials/datacatalog-tag-template-processor-sa.json`

#### 1.3.3. Set the environment variables

_This step may be skipped if you're using [Docker](#12-docker)._

```bash
export GOOGLE_APPLICATION_CREDENTIALS=~/credentials/datacatalog-tag-template-processor-sa.json
```

## 2. Load Templates from CSV file

### 2.1. Create a CSV file representing the Templates to be created

Templates are composed of as many lines as required to represent all of their fields. The columns are
described as follows:

| Column | Description | Mandatory |
| --- | --- | --- |
| **template_name** | Resource name of the Tag Template for the Tag. | Y |
| **display_name** | Resource name of the Tag Template for the Tag. | Y |
| **field_id** | Id of the Tag Template field. | Y |
| **field_display_name** | Display name of the Tag Template field. | Y |
| **field_type** | Type of the Tag Template field. | Y |
| **enum_values** | Values for the Enum field. | N |

### 2.2. Run the ddatacatalog-tag-template-processor script - Create the Tag Templates

- Python + virtualenv

```bash
datacatalog-tag-template-processor tag-templates create --csv-file CSV_FILE_PATH
```

### 2.3. Run the datacatalog-tag-template-processor script - Delete the Tag Templates

- Python + virtualenv

```bash
datacatalog-tag-template-processor tag-templates delete --csv-file CSV_FILE_PATH
```

*TIPS*
- [sample-input/create-tag-templates][4] for reference;

[1]: https://circleci.com/gh/mesmacosta/datacatalog-tag-template-processor.svg?style=svg
[2]: https://circleci.com/gh/mesmacosta/datacatalog-tag-template-processor
[3]: https://virtualenv.pypa.io/en/latest/
[4]: https://github.com/mesmacosta/datacatalog-tag-template-processor/tree/master/sample-input/create-tag-templates
[5]: https://img.shields.io/pypi/v/datacatalog-tag-template-processor.svg?force_cache=true
[6]: https://pypi.org/project/datacatalog-tag-template-processor/
[7]: https://img.shields.io/github/license/mesmacosta/datacatalog-tag-template-processor.svg
[8]: https://img.shields.io/github/issues/mesmacosta/datacatalog-tag-template-processor.svg
[9]: https://github.com/mesmacosta/datacatalog-tag-template-processor/issues