https://github.com/fpopic/bigquery-schema-select

(Script) Generates SQL query that selects all fields (recursively for nested fields) from the provided BigQuery schema file.
https://github.com/fpopic/bigquery-schema-select

bigquery bigquery-schema scala sql

Last synced: 2 months ago
JSON representation

(Script) Generates SQL query that selects all fields (recursively for nested fields) from the provided BigQuery schema file.

Host: GitHub
URL: https://github.com/fpopic/bigquery-schema-select
Owner: fpopic
License: apache-2.0
Created: 2020-05-23T22:00:15.000Z (almost 5 years ago)
Default Branch: master
Last Pushed: 2024-08-31T13:26:24.000Z (9 months ago)
Last Synced: 2025-01-21T22:10:26.892Z (4 months ago)
Topics: bigquery, bigquery-schema, scala, sql
Language: Scala
Homepage:
Size: 30.3 KB
Stars: 1
Watchers: 3
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # bigquery-schema-select

![Scala CI](https://github.com/fpopic/bigquery-schema-select/workflows/Scala%20CI/badge.svg) 

[](https://search.maven.org/#search%7Cga%7C1%7Cbigquery-schema-select_2.13)

Generates SQL query that selects all fields (recursively for nested fields) from the provided BigQuery schema file.

### Installation

Download latest version `bigquery-schema-select_2.13-X.Y.jar` from [maven releases UI](https://repo1.maven.org/maven2/com/github/fpopic/bigquery-schema-select_2.13/) or using CLI:

```shell script

# replace X.Y with the latest version

wget -O ~/bigquery-schema-select_2.13-X.Y.jar https://repo1.maven.org/maven2/com/github/fpopic/bigquery-schema-select_2.13/X.Y/bigquery-schema-select_2.13-X.Y.jar

```

### Usage

Using existing table: 

```shell script

bq show --schema --format=prettyjson my_project:my_dataset.my_table | java -jar ~/bigquery-schema-select_2.13-X.Y.jar

```

Using JSON schema file:

```shell script

cat my_schema.json | java -jar ~/bigquery-schema-select_2.13-X.Y.jar

```

```json

[

  {

    "name": "A",

    "type": "TIMESTAMP"

  },

  {

    "name": "B",

    "type": "TIMESTAMP"

  },

  {

    "name": "C",

    "type": "RECORD",

    "fields": [

      {

        "name": "D",

        "type": "RECORD",

        "fields": [

          {

            "name": "E",

            "type": "TIMESTAMP"

          },

          {

            "name": "F",

            "type": "RECORD",

            "mode": "REPEATED",

            "fields": [

              {

                "name": "G",

                "type": "STRING"

              }

            ]

          }

        ]

      },

      {

        "name": "H",

        "type": "TIMESTAMP"

      }

    ]

  },

  {

    "name": "I",

    "type": "RECORD",

    "fields": [

      {

        "name": "J",

        "type": "TIMESTAMP"

      },

      {

        "name": "K",

        "type": "TIMESTAMP"

      }

    ]

  },

  {

    "name": "L",

    "type": "RECORD",

    "mode": "REPEATED",

    "fields": [

      {

        "name": "M",

        "type": "TIMESTAMP"

      },

      {

        "name": "N",

        "type": "TIMESTAMP"

      },

      {

        "name": "O",

        "type": "RECORD",

        "fields": [

          {

            "name": "P",

            "type": "TIMESTAMP"

          }

        ]

      }

    ]

  },

  {

    "name": "Q",

    "type": "TIMESTAMP",

    "mode": "REPEATED"

  }

]

```

Would generate:

```sql

SELECT

  A,

  B,

  STRUCT(

    STRUCT(

      C.D.E,

      ARRAY(

        SELECT AS STRUCT

          F.G

        FROM

          UNNEST(C.D.F) AS F

        WITH

          OFFSET

        ORDER BY

          OFFSET

      ) AS F

    ) AS D,

    C.H

  ) AS C,

  STRUCT(

    I.J,

    I.K

  ) AS I,

  ARRAY(

    SELECT AS STRUCT

      L.M,

      L.N,

      STRUCT(

        L.O.P

      ) AS O

    FROM

      UNNEST(L) AS L

    WITH 

      OFFSET

    ORDER BY

      OFFSET

  ) AS L,

  Q

```

In case you would like to use snake_case for field names use flag `--use_snake_case`:

```shell script

cat my_schema.json | java -jar ~/bigquery-schema-select_2.13-X.Y.jar --use_snake_case

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/fpopic/bigquery-schema-select

Awesome Lists containing this project

README