Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ververica/sql-training
https://github.com/ververica/sql-training
Last synced: 2 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/ververica/sql-training
- Owner: ververica
- Archived: true
- Created: 2019-03-11T16:33:08.000Z (almost 6 years ago)
- Default Branch: master
- Last Pushed: 2021-11-26T09:23:23.000Z (about 3 years ago)
- Last Synced: 2024-08-03T17:21:23.814Z (6 months ago)
- Language: Java
- Size: 27.5 MB
- Stars: 542
- Watchers: 59
- Forks: 229
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
**⚠️ This repository has been archived. ⚠️**
---
# Apache Flink® SQL Training
**This repository provides a training for Flink's SQL API.**
In this training you will learn to:
* run SQL queries on streams.
* use Flink's SQL CLI client.
* perform window aggregations, stream joins, and pattern matching with SQL queries.
* specify a continuous SQL query that maintain a dynamic result table.
* write the result of streaming SQL queries to Kafka and MySQL.Please find the [training instructions](https://github.com/ververica/sql-training/wiki) in the Wiki of this repository.
### Requirements
The training is based on Flink's SQL CLI client and uses Docker Compose to setup the training environment.
You **only need [Docker](https://www.docker.com/)** to run this training.
You don't need Java, Scala, or an IDE.## What is Apache Flink?
[Apache Flink](https://flink.apache.org) is a framework and distributed processing engine for stateful computations over unbounded and bounded data streams. Flink has been designed to run in all common cluster environments, perform computations at in-memory speed and at any scale.
## What is SQL on Apache Flink?
Flink features multiple APIs with different levels of abstraction. SQL is supported by Flink as a unified API for batch and stream processing, i.e., queries are executed with the same semantics on unbounded, real-time streams or bounded, recorded streams and produce the same results. SQL on Flink is commonly used to ease the definition of data analytics, data pipelining, and ETL applications.
The following example shows a SQL query that computes the number of departing taxi rides per hour.
```sql
SELECT
TUMBLE_START(rowTime, INTERVAL '1' HOUR) AS t,
COUNT(*) AS cnt
FROM Rides
WHERE
isStart
GROUP BY
TUMBLE(rowTime, INTERVAL '1' HOUR)
```----
*Apache Flink, Flink®, Apache®, the squirrel logo, and the Apache feather logo are either registered trademarks or trademarks of The Apache Software Foundation.*