Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with flink

A curated list of projects in awesome lists tagged with flink .

https://github.com/apache/flink

Apache Flink

big-data flink java python scala sql

Last synced: 30 Dec 2024

https://github.com/zhisheng17/flink-learning

flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》

clickhouse elasticsearch flink hbase influxdb kafka loki mysql opentsdb rabbitmq redis rocketmq spark stream-processing streaming

Last synced: 30 Dec 2024

https://github.com/wangzhiwubigdata/god-of-bigdata

专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

azkaban bigdata flink flume hadoop hbase hdfs hive kafka spark zookeeper

Last synced: 31 Dec 2024

https://github.com/wangzhiwubigdata/God-Of-BigData

专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

azkaban bigdata flink flume hadoop hbase hdfs hive kafka spark zookeeper

Last synced: 30 Oct 2024

https://github.com/risingwavelabs/risingwave

Best-in-class stream processing, analytics, and management. Perform continuous analytics, or build event-driven applications, real-time ETL pipelines, and feature stores in minutes. Unified streaming and batch. PostgreSQL compatible.

analytics big-data cloud-native data-engineering database distributed-database etl flink kafka ksqldb materialized-view postgres postgresql real-time real-time-analytics rust serverless spark-streaming sql stream-processing

Last synced: 30 Dec 2024

https://github.com/RisingWaveLabs/risingwave

SQL stream processing, analytics, and management. We decouple storage and compute to offer instant failover, dynamic scaling, speedy bootstrapping, and efficient joins.

analytics big-data cloud-native data-engineering database distributed-database etl flink kafka ksqldb materialized-view postgres postgresql real-time real-time-analytics rust serverless spark-streaming sql stream-processing

Last synced: 01 Nov 2024

https://github.com/apache/zeppelin

Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.

big-data database flink java javascript nosql scala spark zeppelin

Last synced: 30 Dec 2024

https://github.com/zq2599/blog_demos

CSDN博客专家程序员欣宸的github,这里有六百多篇原创文章的详细分类和汇总,以及对应的源码,内容涉及Java、Docker、Kubernetes、DevOPS等方面

docker docker-java docker-jib flink java jenkins kubernetes kubernetes-java kubernetes-jenkins-maven spring spring-cloud spring-cloud-kubernetes springboot

Last synced: 31 Dec 2024

https://github.com/flink-china/flink-training-course

Flink 中文视频课程(持续更新...)

course flink streaming training

Last synced: 08 Nov 2024

https://github.com/water8394/flink-recommandsystem-demo

:helicopter::rocket:基于Flink实现的商品实时推荐系统。flink统计商品热度,放入redis缓存,分析日志信息,将画像标签和实时记录放入Hbase。在用户发起推荐请求后,根据用户画像重排序热度榜,并结合协同过滤和标签两个推荐模块为新生成的榜单的每一个产品添加关联产品,最后返回新的用户列表。

flink flink-examples flink-hbase flink-kafka flink-redis recommand recommander-system

Last synced: 02 Jan 2025

https://github.com/water8394/flink-recommandSystem-demo

:helicopter::rocket:基于Flink实现的商品实时推荐系统。flink统计商品热度,放入redis缓存,分析日志信息,将画像标签和实时记录放入Hbase。在用户发起推荐请求后,根据用户画像重排序热度榜,并结合协同过滤和标签两个推荐模块为新生成的榜单的每一个产品添加关联产品,最后返回新的用户列表。

flink flink-examples flink-hbase flink-kafka flink-redis recommand recommander-system

Last synced: 13 Sep 2024

https://github.com/dtstack/chunjun

A data integration framework

bigdata data-integration flink framework java

Last synced: 31 Dec 2024

https://github.com/DTStack/chunjun

A data integration framework

bigdata data-integration flink framework java

Last synced: 26 Oct 2024

https://github.com/alibaba/alink

Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.

apriori classification clustering data-mining feature-engineering flink flink-machine-learning flink-ml fm graph-algorithms graph-embedding kafka machine-learning recommender recommender-system regression statistics word2vec xgboost

Last synced: 31 Dec 2024

https://github.com/alibaba/Alink

Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.

apriori classification clustering data-mining feature-engineering flink flink-machine-learning flink-ml fm graph-algorithms graph-embedding kafka machine-learning recommender recommender-system regression statistics word2vec xgboost

Last synced: 26 Oct 2024

https://github.com/webankfintech/dataspherestudio

DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.

airflow atlas azkaban dataworks davinci dolphinscheduler flink governance griffin hadoop hive hue kettle linkis spark supperset tableau visualis workflow zeppelin

Last synced: 31 Dec 2024

https://github.com/WeBankFinTech/DataSphereStudio

DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.

airflow atlas azkaban dataworks davinci dolphinscheduler flink governance griffin hadoop hive hue kettle linkis spark supperset tableau visualis workflow zeppelin

Last synced: 26 Oct 2024

https://github.com/DataLinkDC/dinky

Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.

datalake datawarehouse flink flinkcdc flinksql olap real-time-computing-platform sql

Last synced: 30 Oct 2024

https://github.com/moran1607/bigdataguide

大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料

bigdata flink flume hadoop hbase hive javase kafka scala spark zookeeper

Last synced: 01 Jan 2025

https://github.com/MoRan1607/BigDataGuide

大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料

bigdata flink flume hadoop hbase hive javase kafka scala spark zookeeper

Last synced: 05 Nov 2024

https://github.com/apache/incubator-paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

big-data data-ingestion flink paimon real-time-analytics spark streaming-datalake table-store

Last synced: 18 Dec 2024

https://github.com/apache/paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

big-data data-ingestion flink paimon real-time-analytics spark streaming-datalake table-store

Last synced: 31 Dec 2024

https://github.com/lakesoul-io/LakeSoul

LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.

arrow big-data datafusion datalake flink huggingface lakehouse lakesoul postgresql python pytorch rust spark sql streaming vectorized velox

Last synced: 30 Oct 2024

https://github.com/lakesoul-io/lakesoul

LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.

arrow big-data datafusion datalake flink huggingface lakehouse lakesoul postgresql python pytorch rust spark sql streaming vectorized velox

Last synced: 01 Jan 2025

https://github.com/Qihoo360/Quicksql

A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources

flink hive spark sql

Last synced: 30 Oct 2024

https://github.com/qihoo360/quicksql

A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources

flink hive spark sql

Last synced: 03 Jan 2025

https://github.com/dtstack/flinkstreamsql

基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法

bigdata flink sql stream

Last synced: 03 Jan 2025

https://github.com/DTStack/flinkStreamSQL

基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法

bigdata flink sql stream

Last synced: 30 Oct 2024

https://github.com/birdlark/larkmidtable

LarkMidTable 是一站式开源的数据中台,实现中台的 基础建设,数据治理,数据开发,监控告警,数据服务,数据的可视化,实现高效赋能数据前台并提供数据服务的产品。

datax flink flink-cdc flink-sql flinkx seatunnel

Last synced: 04 Jan 2025

https://github.com/alibaba/sreworks

Cloud Native DataOps & AIOps Platform | 云原生数智运维平台

aiops application cloudnative dataops devops engineering flink k8s kubernetes maintenance oam operation ops saas sre

Last synced: 03 Jan 2025

https://github.com/birdLark/LarkMidTable

LarkMidTable 是一站式开源的数据中台,实现中台的 基础建设,数据治理,数据开发,监控告警,数据服务,数据的可视化,实现高效赋能数据前台并提供数据服务的产品。

datax flink flink-cdc flink-sql flinkx seatunnel

Last synced: 30 Oct 2024

https://github.com/zhp8341/flink-streaming-platform-web

基于flink的实时流计算web平台

flink flink-sql sql streaming

Last synced: 02 Jan 2025

https://github.com/alibaba/SREWorks

Cloud Native DataOps & AIOps Platform | 云原生数智运维平台

aiops application cloudnative dataops devops engineering flink k8s kubernetes maintenance oam operation ops saas sre

Last synced: 30 Oct 2024

https://github.com/bytedance/bitsail

BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data every day.

big-data data-integration data-lake data-pipeline data-synchronization flink high-performance real-time

Last synced: 04 Jan 2025

https://github.com/water8394/bigdata-interview

:dart: :star2:[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结

bigdata flink hadoop hbase hdfs interview interview-questions kafka mapreduce spark yarn

Last synced: 04 Jan 2025

https://github.com/water8394/BigData-Interview

:dart: :star2:[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结

bigdata flink hadoop hbase hdfs interview interview-questions kafka mapreduce spark yarn

Last synced: 30 Oct 2024

https://github.com/collabh/bigdata-growth

大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。

bigdata bigdatalearning debezium flink hadoop hbase hdfs hive hudi kafka kudu mapreduce olap spark

Last synced: 02 Jan 2025

https://github.com/collabH/bigdata-growth

大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。

bigdata bigdatalearning debezium flink hadoop hbase hdfs hive hudi kafka kudu mapreduce olap spark

Last synced: 31 Oct 2024

https://github.com/dtstack/taier

Taier is a big data development platform for submission, scheduling, operation and maintenance, and indicator information display

azkaban chunjun cronjob-scheduler dag data-schedule distributed-schedule-system flink hadoop hive job-scheduler scheduler spark task-schedule workflow-scheduling-system

Last synced: 02 Jan 2025

https://github.com/DTStack/Taier

Taier is a big data development platform for submission, scheduling, operation and maintenance, and indicator information display

azkaban chunjun cronjob-scheduler dag data-schedule distributed-schedule-system flink hadoop hive job-scheduler scheduler spark task-schedule workflow-scheduling-system

Last synced: 30 Oct 2024

https://github.com/uber-archive/AthenaX

SQL-based streaming analytics platform at scale

analytics calcite data flink sql stream streaming uber

Last synced: 30 Oct 2024

https://github.com/uber-archive/athenax

SQL-based streaming analytics platform at scale

analytics calcite data flink sql stream streaming uber

Last synced: 25 Sep 2024

https://github.com/datavane/tis

Support agile DataOps Based on Flink, DataX and Flink-CDC, Chunjun with Web-UI

cdc chunjun dataops datax etl flink flink-streaming java

Last synced: 03 Jan 2025

https://github.com/ververica/flink-sql-cookbook

The Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are completely self-contained and can be run in Ververica Platform as is.

apache-flink flink flink-sql sql stream-processing

Last synced: 12 Nov 2024

https://github.com/intsmaze/flink-boot

懒松鼠Flink-Boot 脚手架让Flink全面拥抱Spring生态体系,使得开发者可以以Java WEB开发模式开发出分布式运行的流处理程序,懒松鼠让跨界变得更加简单。懒松鼠旨在让开发者以更底上手成本(不需要理解分布式计算的理论知识和Flink框架的细节)便可以快速编写业务代码实现。为了进一步提升开发者使用懒松鼠脚手架开发大型项目的敏捷的度,该脚手架默认集成Spring框架进行Bean管理,同时将微服务以及WEB开发领域中经常用到的框架集成进来,进一步提升开发速度。比如集成Mybatis ORM框架,Hibernate Validator校验框架,Spring Retry重试框架等,具体见下面的脚手架特性。

bigdata flink flink-boot java java-flink mcv mybatis sping spring-boot spring-retry

Last synced: 30 Oct 2024

https://github.com/apache/flink-kubernetes-operator

Apache Flink Kubernetes Operator

big-data flink java

Last synced: 05 Nov 2024

https://github.com/tugraph-family/tugraph-analytics

TuGraph Analytics is the fastest OLAP graph database.

analytics big-data calcite database flink graph java olap sql stream tugraph

Last synced: 01 Jan 2025

https://github.com/raray-chuan/xichuan_note

xichuan的学习总结笔记,覆盖了java、spring、java其他常用框架,以及大数据相关组件等📚

bigdata elk flink hadoop hbase hive java juc jvm kafaka kafka redis spark spring springcloud zabbix zookeeper

Last synced: 29 Dec 2024

https://github.com/spotify/featran

A Scala feature transformation library for data science and machine learning

algebird breeze data flink ml scala scalding scio spark tensorflow xgboost

Last synced: 03 Jan 2025

https://github.com/WeBankFinTech/Exchangis

Exchangis is a lightweight,highly extensible data exchange platform that supports data transmission between structured and unstructured heterogeneous data sources

dataspherestudio datax etl exchangis flink linkis sqoop transmission-engine wedatasphere

Last synced: 30 Oct 2024

https://github.com/harbby/sylph

Stream computing platform for bigdata

big-data flink java spark-streaming sql streamsql sylph

Last synced: 30 Dec 2024

https://github.com/apachecn/flink-doc-zh

Apache Flink 中文文档

flink

Last synced: 04 Jan 2025

https://github.com/pierre94/flink-notes

flink学习笔记

bigdata flink flink-notes flinkx

Last synced: 05 Nov 2024

https://github.com/flowerfine/scaleph

Open data platform based on Kubernetes. Scaleph supports SeaTunnel、Flink and Doris backended by SeaTunnel on Flink engine、Flink Kubernetes Operator and Doris operator.

dag data-platform dataops doris doris-manager doris-operator flink flink-kubernetes flink-kubernetes-operator flink-sql flink-sql-gateway seatunnel

Last synced: 05 Nov 2024

https://github.com/cubefs/compass

Compass is a task diagnosis platform for bigdata

airflow bigdata diagnose dolphinscheduler flink hadoop mapreduce scheduler spark sql

Last synced: 05 Jan 2025

https://github.com/itinycheng/flink-connector-clickhouse

Flink SQL connector for ClickHouse. Support ClickHouseCatalog and read/write primary data, maps, arrays to clickhouse.

clickhouse connector flink flink-connector

Last synced: 05 Nov 2024

https://github.com/apache/flink-docker

Docker packaging for Apache Flink

docker flink

Last synced: 04 Jan 2025

https://github.com/lightbend/cloudflow

Cloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.

akka cloudflow flink kubernetes microservices-architectures spark streaming-applications streaming-data streaming-runtimes

Last synced: 03 Jan 2025

https://github.com/apache/flink-ml

Machine learning library of Apache Flink

big-data flink java machine-learning ml python

Last synced: 30 Dec 2024

https://github.com/kamu-data/kamu-cli

Next-generation decentralized data lakehouse and a multi-party stream processing network

blockchain data-as-code data-management data-science datafusion flink jupyter kamu open-data open-data-fabric spark sql

Last synced: 03 Jan 2025

https://github.com/binglind/alchemy

给flink开发的web系统。支持页面上定义udf,进行sql和jar任务的提交;支持source、sink、job的管理;可以管理openshift上的flink集群

flink jar sql stream udf ui web

Last synced: 30 Oct 2024

https://github.com/apache/doris-flink-connector

Flink Connector for Apache Doris

apache connector data-warehousing dbms doris flink mpp olap

Last synced: 05 Nov 2024

https://github.com/melin/superior-sql-parser

基于 antlr4 的多种数据库SQL解析器,获取SQL中元数据,可用于数据平台产品中的多个场景:ddl语句提取元数据、sql 权限校验、表级血缘、sql语法校验等场景。支持spark、flink、gauss、starrocks、Oracle、MYSQL、Postgresql,sqlserver,、db2等

flink gauss lineage metadata mysql parser postgres spark sql starrocks

Last synced: 05 Nov 2024

https://github.com/bytedance/cloudshuffleservice

Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.

flink hadoop-mapreduce spark

Last synced: 31 Dec 2024

https://github.com/luxiaoxun/eagle

Real time data processing system based on flink and CEP

cep complex-event-processing drools flink realtime-processing siddhi

Last synced: 30 Dec 2024

https://github.com/bytedance/CloudShuffleService

Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.

flink hadoop-mapreduce spark

Last synced: 05 Nov 2024

https://github.com/jeff-zou/flink-connector-redis

Asynchronous flink connector based on the Lettuce, supporting sql join and sink, query caching and debugging.

flink flink-connector flink-connector-redis flink-sql join lettuce redis

Last synced: 05 Nov 2024

https://github.com/huangfox/dpkb

大数据相关内容汇总,包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词:Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse

flink hadoop hbase hive presto spark

Last synced: 30 Oct 2024

https://github.com/spotify/flink-on-k8s-operator

Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.

apache-beam apache-flink flink flink-operator kubernetes kubernetes-operator

Last synced: 04 Jan 2025

https://github.com/knaufk/flink-faker

A data generator source connector for Flink SQL based on data-faker.

apache-flink flink flink-sql

Last synced: 05 Nov 2024

https://github.com/flink-extended/flink-remote-shuffle

Remote Shuffle Service for Flink

flink shuffle

Last synced: 05 Nov 2024

https://github.com/getindata/flink-http-connector

Http Connector for Apache Flink. Provides sources and sinks for Datastream , Table and SQL APIs.

data-streaming flink flink-sql flink-stream-processing java

Last synced: 04 Jan 2025

https://github.com/nielsbasjes/logparser

Easy parsing of Apache HTTPD and NGINX access logs with Java, Hadoop, Hive, Flink, Beam, Storm, Drill, ...

apache beam drill flink hadoop hive httpd java logformat nginx parse parser

Last synced: 03 Jan 2025

https://github.com/ing-bank/flink-deployer

A tool that help automate deployment to an Apache Flink cluster

apache-flink deployment docker flink go golang

Last synced: 08 Nov 2024

https://github.com/zuinnote/hadoopcryptoledger

Hadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive

bigdata bitcoin blockchain cryptoledger ethereum flink hadoop hive spark

Last synced: 02 Jan 2025

https://github.com/sansa-stack/sansa-stack

Big Data RDF Processing and Analytics Stack built on Apache Spark and Apache Jena http://sansa-stack.github.io/SANSA-Stack/

apache-jena apache-spark distributed-computing flink rdf semantic-web spark

Last synced: 03 Jan 2025

https://github.com/SANSA-Stack/SANSA-Stack

Big Data RDF Processing and Analytics Stack built on Apache Spark and Apache Jena http://sansa-stack.github.io/SANSA-Stack/

apache-jena apache-spark distributed-computing flink rdf semantic-web spark

Last synced: 20 Nov 2024

https://github.com/apache/flink-shaded

Apache Flink shaded artifacts repository

big-data flink java scala

Last synced: 02 Jan 2025

https://github.com/lichaojacobs/java_learning_practice

java 进阶之路:面试高频算法、akka、多线程、NIO、Netty、SpringBoot、Spark&&Flink 等

algorithm flink java netty spark spring web

Last synced: 19 Dec 2024

https://github.com/HamaWhiteGG/flink-sql-security

FlinkSQL数据脱敏和行级权限解决方案及源码,支持面向用户级别的数据脱敏和行级数据访问控制,即特定用户只能访问到脱敏后的数据或授权过的行。此方案是实时领域Flink的解决方案,类似于离线数仓Hive Ranger中的Row-level Filter和Column Masking方案。

flink flinksql security

Last synced: 05 Nov 2024

https://github.com/water8394/flink-simple-tutorial

:bell::pill:flink简易使用教程,结合官方仓库的example样例,结合常见场景,使用flink的基本功能

flink simple tutorials

Last synced: 10 Nov 2024