Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with flink

A curated list of projects in awesome lists tagged with flink .

https://github.com/chaokunyang/athena

A task scheduler for spark, flink, mapreduce, java, python, bash

flink hadoop mapreduce spark task-manager task-scheduler

Last synced: 19 Nov 2024

https://github.com/nextbreakpoint/flink-workshop

Example of Flink jobs for workshop (work in progress)

flink java stream-processing

Last synced: 18 Jan 2025

https://github.com/nkonev/flink-pipe

A [single|uber|fat] jar standalone Apache Flink connected to PostgreSQL via Ververica CDC connector. Usage Flink SQL to replicate data from PostgreSQL to Elasticsearch/something else.

cdc debezium debezium-connector debezium-embedded elastic elasticsearch flink flink-cdc flink-sql flink-stream-processing flink-streaming postgres postgresql replication ververica-connector

Last synced: 08 Nov 2024

https://github.com/hifly81/1brc_streaming

1brc challenge with streaming solutions for Apache Kafka

1brc apache camel-kafka flink kafka kafkastreams ksqldb nifi spark spring-kafka streaming

Last synced: 02 Nov 2024

https://github.com/zzy979/flink-tutorial

Flink入门教程示例代码

flink

Last synced: 15 Jan 2025

https://github.com/lightbend/flink-k8s-build

This is a simple project to build Flink image with Prometheus jar out of the "standard" Flink images

flink kubernetes

Last synced: 12 Nov 2024

https://github.com/jinsyin/datalink

⚡ 数据集成 | DataLink is a lightweight data integration framework build on top of DataX, Spark and Flink

batch big-data bigdata cdc data data-collection data-exchange data-integration data-pipeline data-synchronization datalink etl flink flink-cdc framework integration pipeline spark streaming

Last synced: 15 Nov 2024

https://github.com/jinsyin/flink-udfs

User-defined Functions for Flink

flink udf udfs

Last synced: 16 Jan 2025

https://github.com/thomas-bouvier/mastodon-flink-connector

Flink connector for the streaming Mastodon API

flink flink-connector mastodon

Last synced: 15 Dec 2024

https://github.com/fdmsantos/aws-twitter-data-analytics

Project to Learn Data analytics in AWS using twitter data

aws data-analytics data-engineering data-science data-visualization flink spark terraform

Last synced: 28 Nov 2024

https://github.com/innogames/flink-real-time-crm-ui

Demo UI for the flink-real-time-crm project using Spring Boot with Thymeleaf. The Demo UI is able to send messages to and consume from the related Kafka topics.

apacheflink apachekafka flink flink-examples flink-stream-processing java kafka spring spring-boot thymeleaf

Last synced: 02 Dec 2024

https://github.com/ev2900/flink_late_arriving_date_event_order

Helps explain how Flink handles late arriving data and the effects on message order

apache-flink flink flink-sql

Last synced: 05 Nov 2024

https://github.com/build-on-aws/real-time-streaming-analytics-application-using-apache-kafka

Sample code repository to build a real-time streaming analytics application using Apache Kafka on AWS

amazon-ecs aws aws-opensearch flink kafka

Last synced: 26 Dec 2024

https://github.com/aiven-open/slack-connector-for-apache-flink

Apache Flink® connector for Slack

flink flink-connector slack

Last synced: 11 Nov 2024

https://github.com/datadome/public-flink-utils

Utilities for Flink

flink public

Last synced: 11 Jan 2025

https://github.com/mcddhub/mcdd-big-data-study

Study project for big data (Hadoop, Zookeeper, Kafka, Flink, Spark)

big-data data-processing docker flink hadoop kafka spark zookeeper

Last synced: 10 Oct 2024

https://github.com/romnn/flink-kafka-protobuf

Minimal working example of protobuf models streamed from kafka into a flink pipeline.

cep deserialization event-driven example-repo flink kafka kafkaconnector protobuf serialization stream-processing

Last synced: 19 Jan 2025

https://github.com/hwywl/flink-examples

Flink 学习产生的代码,对主要的功能进行了demo测试,写了详细的注释,按模块分开方便学习。

flink kinesis

Last synced: 08 Jan 2025

https://github.com/codelionx/fastflinkstreams

A Flink demo project using Scala and SBT that analyzes HTTP log data from NASA.

flink flink-stream-processing sbt scala

Last synced: 28 Oct 2024

https://github.com/gordonmurray/apache_flink_using_checkpoints

I tried out Checkpointing in some Flink Jobs to see if it would help with Job restarts

apache docker docker-compose flink s3

Last synced: 04 Dec 2024

https://github.com/lapetitesouris/sensormetrics

Apache Flink Aggregation job

apache-flink flink stream-processing

Last synced: 16 Dec 2024

https://github.com/marceloneppel/apache-beam-golang-udf

Run UDFs (User Defined Functions) on Apache Beam Golang SDK.

apache-beam big-data cloud dataflow flink golang udf

Last synced: 03 Dec 2024

https://github.com/lerouxrgd/squirrel

Flink window word count example in Clojure

clojure flink stream-processing wordcount

Last synced: 12 Jan 2025

https://github.com/stubenhuang/monitop

👀 A Non-intrusive system that monitor numerical value of params

dubbo flink monitor rocketmq

Last synced: 04 Dec 2024

https://github.com/sebastianschmidl/fastflinkstreams

A Flink demo project using Scala and SBT that analyzes HTTP log data from NASA.

flink flink-stream-processing sbt scala

Last synced: 19 Nov 2024

https://github.com/nextbreakpoint/flink-on-docker

Scripts for executing Flink jobs on a local Docker Swarm cluster (obsolete)

docker docker-swarm flink infrastructure-as-code

Last synced: 18 Jan 2025

https://github.com/j3-signalroom/supercharge_streamlit-apache_flink

Engaging, interactive visualizations crafted with Streamlit, seamlessly powered by Apache Flink in batch mode to reveal deep insights from data.

apache-flink apache-iceberg aws-glue-data-catalog flink iceberg kafka pyflink streamlit streamlit-dashboard

Last synced: 20 Nov 2024

https://github.com/touk/nussknacker-benchmarks

Benchmarks for Nussknacker - A visual tool to define and run real-time decision algorithms. Brings agility to business teams, liberates developers to focus on technology.

benchmark flink kafka scala

Last synced: 22 Dec 2024

https://github.com/touk/nussknacker-scenario-examples-library

Nussknacker Scenario Examples Library

flink kafka lowcode nussknacker streaming

Last synced: 22 Dec 2024

https://github.com/jinsyin/flink-connector-mongo

Flink Connector for the MongoDB

apache-flink connector flink mongo mongodb

Last synced: 16 Jan 2025

https://github.com/mgoblin/streams

Pet project to grow Apache Flink skills

flink flink-stream-processing

Last synced: 20 Nov 2024

https://github.com/silvanheller/hackathon-scads

Stream Processing to analyze bias and content with regards to religion in the GDELT Event Database. Developed in the 2nd Streamline Hackathon in Leipzig, July 2018

apache-flink flink hackathon maven scala stream-processing

Last synced: 28 Nov 2024

https://github.com/j3-signalroom/apache_flink-kickstarter

Examples of Apache Flink® applications showcasing the DataStream API and Table API in Java and Python, featuring AWS, GitHub, Terraform, and Apache Iceberg.

apache-flink apache-iceberg aws-glue aws-parameter-store aws-s3 aws-secrets-manager flink flink-examples flink-kafka flink-stream-processing github-actions iceberg snowflake streamlit-dashboard terraform-cloud

Last synced: 27 Oct 2024

https://github.com/aikuyun/flinkx

flinkx 一些修改

data flink

Last synced: 17 Dec 2024

https://github.com/akornatskyy/sample-etl-flink-java

The sample ingests multiline gzipped files of popular books into postgres.

batch-processing etl flink ingestion java postgres sample

Last synced: 29 Nov 2024

https://github.com/worst001/note_bigdata

收录了大数据相关各类资料、笔记、手册

bigdata cdh datawarehouse development flink flume guide hadoop hbase hive learning markdown mkdocs note notebook spark

Last synced: 12 Jan 2025

https://github.com/1ambda/lakehouse-cdc

Playground for Lakehouse CDC (Flink, Iceberg, Kafka and Debezium)

cdc debezium flink iceberg kafka

Last synced: 31 Dec 2024

https://github.com/vemonet/flink-on-openshift

🐿️ Deploy Apache Flink on OpenShift OKD 3.11

flink openshift

Last synced: 10 Jan 2025

https://github.com/martindai/flink-sample

本项目包含学习总结flink的相关案例

flink kafka

Last synced: 02 Jan 2025

https://github.com/danimonsalve/steaming-with-kafka-flink

Real time Streaming with Apache Flink

docker elasticsearch flink kafka kibana postgresql

Last synced: 12 Jan 2025

https://github.com/build-on-aws/real-time-analytics-with-flink-sql

Sample code repository that contains the sample data set files and Zeppelin notes to run analytical queries with SQL.

aws flink kinesis-data-analytics sql streaming

Last synced: 26 Dec 2024

https://github.com/sernst/flink-local-docker

A simple containerization of flink for running locally during development.

docker flink sbt scala

Last synced: 14 Jan 2025

https://github.com/yandex-cloud-examples/yc-data-streaming-flink-with-yds

Потоковая обработка данных c помощью Apache Flink и YDS.

flink yandex-cloud yandexcloud ydb yds

Last synced: 29 Dec 2024

https://github.com/p16i/flink-sorter-performance-evaluation

Investigation of code generation approach for improving sort performance

benchmark flink sorting

Last synced: 29 Nov 2024

https://github.com/night-fury-me/real-time-vehicle-data-processing

A repository that contains implementation of a Real-Time Vehicle Data Processing Pipeline that efficiently manages and analyzes vehicle data through a cohesive system.

bigquery cpp data-engineering data-streaming flink grpc kafka python real-time-data-processing

Last synced: 22 Nov 2024

https://github.com/elkoyote07/kafkaflinkplayground

A friendly playground for experimenting with Kafka and Flink integration and development.

apache-flink apache-kafka big-data docker docker-compose flink java kafka opensource python streams web-ui

Last synced: 08 Jan 2025

https://github.com/jorgermduarte/real-time-data-architecture-kafka-flink-dw-k8s

Real-time data processing architecture using Apache Kafka, Flink, and Kubernetes. This project demonstrates how to build a scalable and resilient pipeline for streaming data, performing ETL with Flink, and storing the processed data in a Data Warehouse for analysis.

apache big-data data-pipeline data-warehouse distributed-systems etl flink kafka kubernetes real-time streaming

Last synced: 22 Dec 2024

https://github.com/akornatskyy/flink-compute-service

:round_pushpin: Run Flink application on AWS EC2.

aws-ec2 flink service

Last synced: 29 Nov 2024

https://github.com/handsomestwei/flink-word-count

flink单词统计。包括table api和streaming使用,以及常见的数据源读写

flink flink-examples flink-streaming java-8 mysql

Last synced: 19 Dec 2024

https://github.com/emelis-ptr/sabd2

Progetto: Sistemi e Architetture per Big Data

ais big-data docker flink java kafka

Last synced: 08 Jan 2025

https://github.com/lukashass/flink-devcontainer

Apache Flink development environment using Gitpod or VSCode Remote-Containers

apache-flink devcontainer flink gitpod remote-containers vscode

Last synced: 02 Dec 2024

https://github.com/yuhexiong/kafka-data-pipeline-flink-java

Data pipeline from Kafka to Kafka, Doris, MongoDB and Doris to Kafka using Flink Java.

datapipeline doris flink java jdbc kafka mongodb

Last synced: 02 Dec 2024

https://github.com/yuhexiong/kafka-data-pipeline-structured-flink-java

Data pipeline from Kafka to Kafka, Doris and Doris to Kafka using Flink Java.

apache-doris apache-flink apache-kafka doris flink flink-stream-processing kafka

Last synced: 02 Dec 2024

https://github.com/c-kiplimo/realtime-sales-stream

This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessary infrastructure components, including Apache Flink, Elasticsearch, and Postgres.

docker docker-compose elasticsearch flink java postgresql python spring-boot zookeeper

Last synced: 12 Jan 2025

https://github.com/j3-signalroom/mac_flink_with_iceberg

Apache Flink Docker image with Apache Iceberg support for Mac M2, M3, or M4 chips.

apache-flink apache-iceberg flink iceberg

Last synced: 14 Dec 2024

https://github.com/j3-signalroom/linux_flink_with_iceberg

Apache Flink Docker image with Apache Iceberg support for Linux (i.e., non-Mac M chip).

apache-flink apache-iceberg flink iceberg

Last synced: 14 Dec 2024

https://github.com/sowrabh-m/data_processing_using_spark_flink

This project demonstrates data cleaning, processing with Apache Spark and Apache Flink, both locally and on AWS EMR.

aws aws-emr aws-s3 emr-cluster flink flink-stream-processing spark spark-flink spark-streaming

Last synced: 11 Oct 2024

https://github.com/vedang/clink

All the examples in Flink Documentation, but in Clojure

clojure flink java

Last synced: 16 Dec 2024

https://github.com/lupusruber/rnmp_homework1

This project simulates message production and consumption using Kafka, with real-time data transformations via Flink, all running within a Docker environment. Requires: Docker, Git, and Python.

data-engineering docker flink kafka python

Last synced: 17 Dec 2024

https://github.com/getindata/flink-ververica-catalog-proxy

Proxy to the internal Ververica Catalog via Ververica REST Api

flink flink-sql sql ververica-platform

Last synced: 23 Nov 2024

https://github.com/rainbowatcher/notes

计算机相关内容的学习笔记

cs flink hadoop java rust

Last synced: 23 Dec 2024

https://github.com/zag13/kfc

日志处理DEMO

clickhouse docker-compose filebeat flink kafka

Last synced: 16 Jan 2025

https://github.com/dobraczka/docker-swarm-flink

Deploy flink in docker swarm mode

big-data cluster docker docker-swarm flink

Last synced: 01 Jan 2025

https://github.com/eemmiillyy/flowda

A system to stream database query results to users via access controlled kafka topics in near real time.

debezium flink kafka

Last synced: 10 Jan 2025

https://github.com/martishin/flink-examples

Examples of Apache Flink jobs

flink gradle java

Last synced: 20 Dec 2024

https://github.com/librity/rtjvm_flink

Rock The JVM - Apache Flink with Scala

flink flink-sql flink-stream-processing rockthejvm rtjvm scala

Last synced: 08 Jan 2025