{"id":13569537,"url":"https://github.com/twalthr/flink-api-examples","last_synced_at":"2025-04-04T05:33:03.176Z","repository":{"id":37754309,"uuid":"415888399","full_name":"twalthr/flink-api-examples","owner":"twalthr","description":"Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.","archived":false,"fork":false,"pushed_at":"2023-09-26T13:55:12.000Z","size":32,"stargazers_count":53,"open_issues_count":5,"forks_count":18,"subscribers_count":3,"default_branch":"main","last_synced_at":"2024-08-02T14:06:41.564Z","etag":null,"topics":["apache-flink","data-engineering","flink","flink-examples","flink-sql","stream-processing"],"latest_commit_sha":null,"homepage":"","language":"Java","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/twalthr.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2021-10-11T11:04:02.000Z","updated_at":"2024-07-20T00:11:37.000Z","dependencies_parsed_at":"2023-01-19T13:16:48.673Z","dependency_job_id":null,"html_url":"https://github.com/twalthr/flink-api-examples","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/twalthr%2Fflink-api-examples","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/twalthr%2Fflink-api-examples/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/twalthr%2Fflink-api-examples/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/twalthr%2Fflink-api-examples/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/twalthr","download_url":"https://codeload.github.com/twalthr/flink-api-examples/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":223100124,"owners_count":17087387,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["apache-flink","data-engineering","flink","flink-examples","flink-sql","stream-processing"],"created_at":"2024-08-01T14:00:41.144Z","updated_at":"2024-11-05T01:32:44.484Z","avatar_url":"https://github.com/twalthr.png","language":"Java","readme":"# Flink API Examples for DataStream API and Table API\n\nThe Table API is not a new kid on the block. But the community has worked hard on reshaping its future. Today, it is one\nof the core abstractions in Flink next to the DataStream API. The Table API can deal with bounded and unbounded streams\nin a unified and highly optimized ecosystem inspired by databases and SQL. Various connectors and catalogs integrate\nwith the outside world.\n\nBut this doesn't mean that the DataStream API will become obsolete any time soon. This repository demos what Table API\nis capable of today. We present how the API solves different scenarios:\n\n- as a batch processor,\n- a changelog processor,\n- a change data capture (CDC) hub,\n- or a streaming ETL tool\n\nwith many built-in functions and operators for deduplicating, joining, and aggregating data.\n\nIt shows hybrid pipelines in which both APIs interact in symbiosis and contribute their unique strengths.\n\n# How to Use This Repository\n\n1. Import this repository into your IDE (preferably IntelliJ IDEA). Select the `pom.xml` file during import to treat it\n   as a Maven project. The project uses the latest Flink 1.15.\n\n2. All examples are runnable from the IDE. You simply need to execute the `main()` method of every example class.\n\n3. In order to make the examples run within IntelliJ IDEA, it is necessary to tick\n   the `Add dependencies with \"provided\" scope to classpath` option in the run configuration under `Modify options`.\n\n4. For the Apache Kafka examples, download and unzip [Apache Kafka](https://kafka.apache.org/downloads). Start up Kafka\n   and Zookeeper:\n\n   ```\n   ./bin/zookeeper-server-start.sh config/zookeeper.properties \u0026\n   \n   ./bin/kafka-server-start.sh config/server.properties \u0026\n   ```\n\n   Run `FillKafkaWithCustomers` and `FillKafkaWithTransactions` to create and fill the Kafka topics with Flink.\n\n5. For the MySQL CDC example, run `StartMySqlContainer` with an available Docker setup to set up a dummy database\n   instance. `FillMySqlWithValues` provides a Flink job to update the database tables while the CDC example is running.\n\n\n","funding_links":[],"categories":["Java"],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftwalthr%2Fflink-api-examples","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Ftwalthr%2Fflink-api-examples","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftwalthr%2Fflink-api-examples/lists"}