Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with flink
A curated list of projects in awesome lists tagged with flink .
https://github.com/chaokunyang/athena
A task scheduler for spark, flink, mapreduce, java, python, bash
flink hadoop mapreduce spark task-manager task-scheduler
Last synced: 19 Nov 2024
https://github.com/nextbreakpoint/flink-workshop
Example of Flink jobs for workshop (work in progress)
Last synced: 18 Jan 2025
https://github.com/nkonev/flink-pipe
A [single|uber|fat] jar standalone Apache Flink connected to PostgreSQL via Ververica CDC connector. Usage Flink SQL to replicate data from PostgreSQL to Elasticsearch/something else.
cdc debezium debezium-connector debezium-embedded elastic elasticsearch flink flink-cdc flink-sql flink-stream-processing flink-streaming postgres postgresql replication ververica-connector
Last synced: 08 Nov 2024
https://github.com/hifly81/1brc_streaming
1brc challenge with streaming solutions for Apache Kafka
1brc apache camel-kafka flink kafka kafkastreams ksqldb nifi spark spring-kafka streaming
Last synced: 02 Nov 2024
https://github.com/icai/whybug
fundebug
bigdata bug-tracker elk-stack flink hadoop miniprogram tracking
Last synced: 08 Jan 2025
https://github.com/lightbend/flink-k8s-build
This is a simple project to build Flink image with Prometheus jar out of the "standard" Flink images
Last synced: 12 Nov 2024
https://github.com/hibuz/hadoop-docker
🐳 hadoop ecosystems docker image
data-engineering docker docker-compose flink hadoop hbase hive spark zeppelin
Last synced: 15 Nov 2024
https://github.com/jinsyin/datalink
⚡ 数据集成 | DataLink is a lightweight data integration framework build on top of DataX, Spark and Flink
batch big-data bigdata cdc data data-collection data-exchange data-integration data-pipeline data-synchronization datalink etl flink flink-cdc framework integration pipeline spark streaming
Last synced: 15 Nov 2024
https://github.com/apache/flink-connector-kudu
Apache flink
connector datastream flink kudu sql table
Last synced: 17 Oct 2024
https://github.com/thomas-bouvier/mastodon-flink-connector
Flink connector for the streaming Mastodon API
flink flink-connector mastodon
Last synced: 15 Dec 2024
https://github.com/wtanaka/ansible-role-apache-flink
Ansible role to install Apache Flink
ansible ansible-galaxy ansible-role ansible-roles apache-flink flink galaxy streaming
Last synced: 22 Nov 2024
https://github.com/fdmsantos/aws-twitter-data-analytics
Project to Learn Data analytics in AWS using twitter data
aws data-analytics data-engineering data-science data-visualization flink spark terraform
Last synced: 28 Nov 2024
https://github.com/innogames/flink-real-time-crm-ui
Demo UI for the flink-real-time-crm project using Spring Boot with Thymeleaf. The Demo UI is able to send messages to and consume from the related Kafka topics.
apacheflink apachekafka flink flink-examples flink-stream-processing java kafka spring spring-boot thymeleaf
Last synced: 02 Dec 2024
https://github.com/cemayan/earthquake_collector
earthquakes eureka expo flink golang nodejs react-native stream-processing zuul
Last synced: 17 Jan 2025
https://github.com/ev2900/flink_late_arriving_date_event_order
Helps explain how Flink handles late arriving data and the effects on message order
Last synced: 05 Nov 2024
https://github.com/build-on-aws/real-time-streaming-analytics-application-using-apache-kafka
Sample code repository to build a real-time streaming analytics application using Apache Kafka on AWS
amazon-ecs aws aws-opensearch flink kafka
Last synced: 26 Dec 2024
https://github.com/factorhouse/flex
Flex for Apache Flink
database developer-tools docker dockerfile dockerhub flink management monitoring tool ui
Last synced: 09 Nov 2024
https://github.com/aiven-open/slack-connector-for-apache-flink
Apache Flink® connector for Slack
Last synced: 11 Nov 2024
https://github.com/mcddhub/mcdd-big-data-study
Study project for big data (Hadoop, Zookeeper, Kafka, Flink, Spark)
big-data data-processing docker flink hadoop kafka spark zookeeper
Last synced: 10 Oct 2024
https://github.com/romnn/flink-kafka-protobuf
Minimal working example of protobuf models streamed from kafka into a flink pipeline.
cep deserialization event-driven example-repo flink kafka kafkaconnector protobuf serialization stream-processing
Last synced: 19 Jan 2025
https://github.com/hwywl/flink-examples
Flink 学习产生的代码,对主要的功能进行了demo测试,写了详细的注释,按模块分开方便学习。
Last synced: 08 Jan 2025
https://github.com/jinsyin/flink-formats
Custom Flink Formats
flink flink-formats flink-json-array json-array
Last synced: 16 Jan 2025
https://github.com/codelionx/fastflinkstreams
A Flink demo project using Scala and SBT that analyzes HTTP log data from NASA.
flink flink-stream-processing sbt scala
Last synced: 28 Oct 2024
https://github.com/gordonmurray/apache_flink_using_checkpoints
I tried out Checkpointing in some Flink Jobs to see if it would help with Job restarts
apache docker docker-compose flink s3
Last synced: 04 Dec 2024
https://github.com/lapetitesouris/sensormetrics
Apache Flink Aggregation job
apache-flink flink stream-processing
Last synced: 16 Dec 2024
https://github.com/marceloneppel/apache-beam-golang-udf
Run UDFs (User Defined Functions) on Apache Beam Golang SDK.
apache-beam big-data cloud dataflow flink golang udf
Last synced: 03 Dec 2024
https://github.com/lerouxrgd/squirrel
Flink window word count example in Clojure
clojure flink stream-processing wordcount
Last synced: 12 Jan 2025
https://github.com/stubenhuang/monitop
👀 A Non-intrusive system that monitor numerical value of params
Last synced: 04 Dec 2024
https://github.com/sebastianschmidl/fastflinkstreams
A Flink demo project using Scala and SBT that analyzes HTTP log data from NASA.
flink flink-stream-processing sbt scala
Last synced: 19 Nov 2024
https://github.com/nextbreakpoint/flink-on-docker
Scripts for executing Flink jobs on a local Docker Swarm cluster (obsolete)
docker docker-swarm flink infrastructure-as-code
Last synced: 18 Jan 2025
https://github.com/j3-signalroom/supercharge_streamlit-apache_flink
Engaging, interactive visualizations crafted with Streamlit, seamlessly powered by Apache Flink in batch mode to reveal deep insights from data.
apache-flink apache-iceberg aws-glue-data-catalog flink iceberg kafka pyflink streamlit streamlit-dashboard
Last synced: 20 Nov 2024
https://github.com/touk/nussknacker-benchmarks
Benchmarks for Nussknacker - A visual tool to define and run real-time decision algorithms. Brings agility to business teams, liberates developers to focus on technology.
Last synced: 22 Dec 2024
https://github.com/touk/nussknacker-scenario-examples-library
Nussknacker Scenario Examples Library
flink kafka lowcode nussknacker streaming
Last synced: 22 Dec 2024
https://github.com/jinsyin/flink-connector-mongo
Flink Connector for the MongoDB
apache-flink connector flink mongo mongodb
Last synced: 16 Jan 2025
https://github.com/mgoblin/streaming_examples
flink flink-stream-processing stream-processing
Last synced: 20 Nov 2024
https://github.com/silvanheller/hackathon-scads
Stream Processing to analyze bias and content with regards to religion in the GDELT Event Database. Developed in the 2nd Streamline Hackathon in Leipzig, July 2018
apache-flink flink hackathon maven scala stream-processing
Last synced: 28 Nov 2024
https://github.com/j3-signalroom/apache_flink-kickstarter
Examples of Apache Flink® applications showcasing the DataStream API and Table API in Java and Python, featuring AWS, GitHub, Terraform, and Apache Iceberg.
apache-flink apache-iceberg aws-glue aws-parameter-store aws-s3 aws-secrets-manager flink flink-examples flink-kafka flink-stream-processing github-actions iceberg snowflake streamlit-dashboard terraform-cloud
Last synced: 27 Oct 2024
https://github.com/akornatskyy/sample-etl-flink-java
The sample ingests multiline gzipped files of popular books into postgres.
batch-processing etl flink ingestion java postgres sample
Last synced: 29 Nov 2024
https://github.com/getindata/feast-kafka-postgres-demo
feast-framework feature-store flink streaming
Last synced: 23 Nov 2024
https://github.com/vemonet/flink-on-openshift
🐿️ Deploy Apache Flink on OpenShift OKD 3.11
Last synced: 10 Jan 2025
https://github.com/j3-signalroom/j3-techstack-lexicon
J3's techStack Lexicon.
apache-flink apache-iceberg flink iceberg terraform terraform-cloud
Last synced: 14 Dec 2024
https://github.com/rupeshtr78/blog
Big Data Spark Hadoop Kafka Flink Spark Streaming
aws bigdata cassandra elasticsearch emr-cluster flink hadoop hive hue kafka mapreduce mongodb oozie spark sparkstreaming yarn
Last synced: 12 Jan 2025
https://github.com/danimonsalve/steaming-with-kafka-flink
Real time Streaming with Apache Flink
docker elasticsearch flink kafka kibana postgresql
Last synced: 12 Jan 2025
https://github.com/build-on-aws/real-time-analytics-with-flink-sql
Sample code repository that contains the sample data set files and Zeppelin notes to run analytical queries with SQL.
aws flink kinesis-data-analytics sql streaming
Last synced: 26 Dec 2024
https://github.com/sernst/flink-local-docker
A simple containerization of flink for running locally during development.
Last synced: 14 Jan 2025
https://github.com/yandex-cloud-examples/yc-data-streaming-flink-with-yds
Потоковая обработка данных c помощью Apache Flink и YDS.
flink yandex-cloud yandexcloud ydb yds
Last synced: 29 Dec 2024
https://github.com/martishin/postgres-kafka-flink-streaming-example
PostgreSQL -> Kafka -> Flink streaming example
data-engineering flink golang gradle java kafka postgresql python3
Last synced: 20 Dec 2024
https://github.com/p16i/flink-sorter-performance-evaluation
Investigation of code generation approach for improving sort performance
Last synced: 29 Nov 2024
https://github.com/night-fury-me/real-time-vehicle-data-processing
A repository that contains implementation of a Real-Time Vehicle Data Processing Pipeline that efficiently manages and analyzes vehicle data through a cohesive system.
bigquery cpp data-engineering data-streaming flink grpc kafka python real-time-data-processing
Last synced: 22 Nov 2024
https://github.com/elkoyote07/kafkaflinkplayground
A friendly playground for experimenting with Kafka and Flink integration and development.
apache-flink apache-kafka big-data docker docker-compose flink java kafka opensource python streams web-ui
Last synced: 08 Jan 2025
https://github.com/jorgermduarte/real-time-data-architecture-kafka-flink-dw-k8s
Real-time data processing architecture using Apache Kafka, Flink, and Kubernetes. This project demonstrates how to build a scalable and resilient pipeline for streaming data, performing ETL with Flink, and storing the processed data in a Data Warehouse for analysis.
apache big-data data-pipeline data-warehouse distributed-systems etl flink kafka kubernetes real-time streaming
Last synced: 22 Dec 2024
https://github.com/akornatskyy/flink-compute-service
:round_pushpin: Run Flink application on AWS EC2.
Last synced: 29 Nov 2024
https://github.com/handsomestwei/flink-word-count
flink单词统计。包括table api和streaming使用,以及常见的数据源读写
flink flink-examples flink-streaming java-8 mysql
Last synced: 19 Dec 2024
https://github.com/lukashass/flink-devcontainer
Apache Flink development environment using Gitpod or VSCode Remote-Containers
apache-flink devcontainer flink gitpod remote-containers vscode
Last synced: 02 Dec 2024
https://github.com/yuhexiong/kafka-data-pipeline-flink-java
Data pipeline from Kafka to Kafka, Doris, MongoDB and Doris to Kafka using Flink Java.
datapipeline doris flink java jdbc kafka mongodb
Last synced: 02 Dec 2024
https://github.com/yuhexiong/kafka-data-pipeline-structured-flink-java
Data pipeline from Kafka to Kafka, Doris and Doris to Kafka using Flink Java.
apache-doris apache-flink apache-kafka doris flink flink-stream-processing kafka
Last synced: 02 Dec 2024
https://github.com/yuhexiong/deploy-flink-gcp-guide
apache-flink deployment flink gcp google-cloud-platform
Last synced: 02 Dec 2024
https://github.com/davidkhala/mq
message queue collection
activemq confluent flink kafka message-queue risingwave solace
Last synced: 08 Dec 2024
https://github.com/c-kiplimo/realtime-sales-stream
This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessary infrastructure components, including Apache Flink, Elasticsearch, and Postgres.
docker docker-compose elasticsearch flink java postgresql python spring-boot zookeeper
Last synced: 12 Jan 2025
https://github.com/j3-signalroom/mac_flink_with_iceberg
Apache Flink Docker image with Apache Iceberg support for Mac M2, M3, or M4 chips.
apache-flink apache-iceberg flink iceberg
Last synced: 14 Dec 2024
https://github.com/j3-signalroom/linux_flink_with_iceberg
Apache Flink Docker image with Apache Iceberg support for Linux (i.e., non-Mac M chip).
apache-flink apache-iceberg flink iceberg
Last synced: 14 Dec 2024
https://github.com/sowrabh-m/data_processing_using_spark_flink
This project demonstrates data cleaning, processing with Apache Spark and Apache Flink, both locally and on AWS EMR.
aws aws-emr aws-s3 emr-cluster flink flink-stream-processing spark spark-flink spark-streaming
Last synced: 11 Oct 2024
https://github.com/vedang/clink
All the examples in Flink Documentation, but in Clojure
Last synced: 16 Dec 2024
https://github.com/lupusruber/rnmp_homework1
This project simulates message production and consumption using Kafka, with real-time data transformations via Flink, all running within a Docker environment. Requires: Docker, Git, and Python.
data-engineering docker flink kafka python
Last synced: 17 Dec 2024
https://github.com/getindata/flink-ververica-catalog-proxy
Proxy to the internal Ververica Catalog via Ververica REST Api
flink flink-sql sql ververica-platform
Last synced: 23 Nov 2024
https://github.com/zag13/kfc
日志处理DEMO
clickhouse docker-compose filebeat flink kafka
Last synced: 16 Jan 2025
https://github.com/piotr-kalanski/big-data-dev-environment
Big Data development environment
ansible big-data elasticsearch flink hadoop kafka kibana mysql spark virtual-machine
Last synced: 15 Dec 2024
https://github.com/jinsyin/flink-handbook
《Flink 学习指南》
apache-flink bigdata flink flink-handbook hadoop
Last synced: 16 Jan 2025
https://github.com/dobraczka/docker-swarm-flink
Deploy flink in docker swarm mode
big-data cluster docker docker-swarm flink
Last synced: 01 Jan 2025
https://github.com/eemmiillyy/flowda
A system to stream database query results to users via access controlled kafka topics in near real time.
Last synced: 10 Jan 2025
https://github.com/rupeshtr78/machine_learning
Machine Learning TensorFlow Neural Networks Deep Learning
classification data-analysis deep-learning deep-neural-networks flink jupyter-notebook keras machine-learning machinelearning-python perceptron python3 spark tensorflow
Last synced: 12 Jan 2025
https://github.com/librity/rtjvm_flink
Rock The JVM - Apache Flink with Scala
flink flink-sql flink-stream-processing rockthejvm rtjvm scala
Last synced: 08 Jan 2025