Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/zhangyuan/dataworks-cli

Hope to save data engineers from Aliyun Dataworks.
https://github.com/zhangyuan/dataworks-cli

Last synced: 21 days ago
JSON representation

Hope to save data engineers from Aliyun Dataworks.

Awesome Lists containing this project

README

        

# dataworks-cli

A tool to help data engineers and data architects to know about the stuffs locked in Dataworks more clearly and efficiently.

**FYI: Dataworks is a service provided by Aliyun. This project is not associated with the Dataworks nor Aliyun.**

## Usage

### Prepare the `.env` file

Add the following env variables to `.env` file:

```env
ACCESS_KEY_ID=
ACCESS_KEY_SECRET=
DATAWORKS_PROJECT_ID=
DATAWORKS_ENDPOINT=dataworks.cn-beijing.aliyuncs.com # or other endpoint
```

### Get the file list by file types

```bash
dataworks-cli files list -t 10,23 -o files/manifest.json
```

### Download the files from the file list

```bash
dataworks-cli files list -t 10,23 -o files/manifest.json
```

### Get the table list by data source

```bash
dataworks-cli files download -i files.json -o ./files
```

### Get the DI sync tasks

```bash
dataworks-cli di list-sync-tasks -s data-source-name -o di-sync-tasks.json
```

### Get the node list by project env

```bash
dataworks-cli nodes list -e PROD -o nodes.json
```