https://github.com/durner/crystal-queries
https://github.com/durner/crystal-queries
Last synced: 22 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/durner/crystal-queries
- Owner: durner
- Created: 2021-06-04T13:56:48.000Z (almost 4 years ago)
- Default Branch: master
- Last Pushed: 2021-06-13T13:48:53.000Z (almost 4 years ago)
- Last Synced: 2023-11-23T21:26:26.835Z (over 1 year ago)
- Size: 339 KB
- Stars: 1
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Queries used for Crystal Evaluation
This repository contains the queries used for the evaluation of different
caching strategies in Crystal. Each folder includes all the queries of the
respective data set. The queries stem from five different regions with each
region spanning 200 queries (separated by id; e.g., 1.sql to 200.sql belong
to region 1). Most queries are conjuncted either in 2 or 3 columns which results
in queries that consider ∼ 1% of the tuples for the computation. The complete
region (union of the queries) spans ~10% of the tuples. Thus, the queries
are overlapping.## Datasets
- TPC-H / Lineitem (data generated by the TPC-H dbgen tool)
- Stock Data from https://www.kaggle.com/ehallmar/daily-historical-stock-prices-1970-2018
- Taxi data from https://www1.nyc.gov/site/tlc/about/tlc-trip-record-data.page