https://github.com/ekzhu/match-recognize-vldb2023
https://github.com/ekzhu/match-recognize-vldb2023
Last synced: 7 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/ekzhu/match-recognize-vldb2023
- Owner: ekzhu
- Created: 2022-09-26T22:23:29.000Z (about 3 years ago)
- Default Branch: main
- Last Pushed: 2022-09-26T22:32:31.000Z (about 3 years ago)
- Last Synced: 2025-01-23T06:32:34.216Z (9 months ago)
- Size: 5.86 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Note
This repository contains the MATCH_RECOGNIZE query templates and scripts for
importing raw datasets into [Trino](https://trino.io/) (Hive + HDFS) and [PostgreSQL 14](https://www.postgresql.org/docs/14/index.html).The raw CSV files for the datasets can be obtained from [here](https://github.com/sigmod2021-index-pattern/index-pattern).
Directory structure:
- query-templates:
- (benchmark dataset)
- (pattern definition)
- (pattern name).sql: MATCH_RECOGNIZE query with example parameters.
- scripts:
- hive_import_(dataset).sql: create tables in Hive after copying the raw CSV files into HDFS.
- trino_import_(dataset).sql: create tables in Trino from Hive tables.
- postgres_import_(dataset).sql: create tables in PostgreSQL from raw CSV files. [Citus](https://github.com/citusdata/citus) extension is required.