An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with flume

A curated list of projects in awesome lists tagged with flume .

https://github.com/wangzhiwubigdata/god-of-bigdata

专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

azkaban bigdata flink flume hadoop hbase hdfs hive kafka spark zookeeper

Last synced: 10 Apr 2025

https://github.com/wangzhiwubigdata/God-Of-BigData

专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

azkaban bigdata flink flume hadoop hbase hdfs hive kafka spark zookeeper

Last synced: 27 Mar 2025

https://github.com/moran1607/bigdataguide

大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料

bigdata flink flume hadoop hbase hive javase kafka scala spark zookeeper

Last synced: 10 Apr 2025

https://github.com/MoRan1607/BigDataGuide

大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料

bigdata flink flume hadoop hbase hive javase kafka scala spark zookeeper

Last synced: 04 Apr 2025

https://github.com/apache/logging-flume

Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-like data

apache api flume java jvm library

Last synced: 10 Apr 2025

https://github.com/apache/flume

Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-like data

apache api flume java jvm library

Last synced: 25 Mar 2025

https://github.com/chrisjpatty/flume

Extract logic from your apps with a user-friendly node editor powered by React.

flume javascript node react

Last synced: 13 Apr 2025

https://github.com/233zzh/TitanDataOperationSystem

最好的大数据项目。《Titan数据运营系统》,本项目是一个全栈闭环系统,我们有用作数据可视化的web系统,然后用flume-kafaka-flume进行日志的读取,在hive设计数仓,编写spark代码进行数仓表之间的转化以及ads层表到mysql的迁移,使用azkaban进行定时任务的调度,使用技术:Java/Scala语言,Hadoop、Spark、Hive、Kafka、Flume、Azkaban、SpringBoot,Bootstrap, Echart等;

azkaban flume hadoop hive kafka spark

Last synced: 27 Mar 2025

https://github.com/jcustenborder/flume-ng-rabbitmq

Flume plugin for RabbitMQ

flume rabbitmq

Last synced: 16 Mar 2025

https://github.com/v5tech/cloud

云计算之hadoop、hive、hue、oozie、sqoop、hbase、zookeeper环境搭建及配置文件

flume flume-ng hadoop hbase hive hue oozie pig sqoop zookeeper

Last synced: 18 Apr 2025

https://github.com/san089/cloudera_material

Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collaborate.

big-data bigdata cca cca175 certification cloudera flume hadoop hive hive-metastore pyspark spark sqoop sqoop-export sqoop-import sqoop-session

Last synced: 14 Feb 2025

https://github.com/29dch/real-time-log-analysis-system

:penguin:基于spark streaming+flume+kafka+hbase的实时日志处理分析系统(分为控制台版本和基于springboot、Echarts等的Web UI可视化版本)

echarts flume hbase java kafka scala spark-streaming spring-boot

Last synced: 11 Nov 2024

https://github.com/orangedrk/javanotes

Java后端学习笔记。包括Linux、maven、git、互联网架构、大数据体系等

flume git hadoop hbase hdfs hive javaee javase kafka linux mapreduce maven mybatis mycat rabbitmq redis spring spring-boot springcloud zookeeper

Last synced: 10 Apr 2025

https://github.com/zenoyang/web-click-flow

网站点击流离线日志分析

etl flume hadoop hive mapreduce sqoop

Last synced: 19 Apr 2025

https://github.com/zicat/tributary

tributary

big-data flume java kafka

Last synced: 06 Apr 2025

https://github.com/apache/logging-flume-spring-boot

Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-like data

apache api flume java jvm library

Last synced: 11 Feb 2025

https://github.com/apache/logging-flume-kafka

Apache Flume Kafka contains the Flume components that interact with Kafka.

apache api flume java jvm library

Last synced: 09 Apr 2025

https://github.com/lovnishverma/bigdataecosystem

Complete Big Data Ecosystem on Docker Desktop

bigdata docker flume hadoop hdfs hive mapreduce spark sqoop

Last synced: 11 Apr 2025

https://github.com/mplachter/ansible-role-flume

Ansible role for installing and configuring Apache Flume, always the use of multiple sinks and sources as well as sink groups.

ansible-role flume

Last synced: 27 Nov 2024

https://github.com/cosmo0920/flume-ng-fluentd-sink

Flume NG Sink plugin for Fluentd.

fluentd fluentd-sink flume flume-sink-plugin

Last synced: 12 Mar 2025

https://github.com/apache/logging-flume-hadoop

Apache Flume Hadoop provides various Flume components for the Hadoop ecosystem

apache api flume java jvm library

Last synced: 06 Mar 2025

https://github.com/worst001/note_bigdata

收录了大数据相关各类资料、笔记、手册

bigdata cdh datawarehouse development flink flume guide hadoop hbase hive learning markdown mkdocs note notebook spark

Last synced: 01 Mar 2025

https://github.com/aabouzaid/telegraf-flume-plugin

Telegraf plugin to collect Apache Flume metrics

apache-flume flume telegraf

Last synced: 30 Mar 2025

https://github.com/juanbono/flume-demo

Demostración de Apache Flume, para la asignatura Implementación de Bases de Datos NoSQL

demo flume

Last synced: 26 Mar 2025

https://github.com/hwywl/bigdata

大数据学习代码Spark、Hive、Storm、HBase

big-data flume hbase hdfs hive mr spark storm zook

Last synced: 25 Feb 2025

https://github.com/apache/logging-flume-jdbc

Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of event data

apache api flume java jvm library

Last synced: 06 Mar 2025

https://github.com/whiteccinn/stream-ng

Distributed stream computing service

distributed etcd flume go kafka mysql redis stream

Last synced: 22 Feb 2025

https://github.com/alokjani/bigdata-vagrant-devlab

Hadoop Software Development sandbox

centos flume hadoop hive pig sqoop zeppelin

Last synced: 05 Apr 2025

https://github.com/badruzzaman123/big-data-pipeline

Electrical Consumption Monitoring - Big Data Pipeline using Lambda Architecture in Python

apache-flink cassandra flume hadoop hbase java kafka-consumer kibana machine-learning mongodb scala spark-streaming sqoop tableau

Last synced: 30 Mar 2025

https://github.com/scraly/flume-bigquery-sink

An Apache Flume Sink implementation to publish data to Google BigQuery

bigquery flume sink

Last synced: 19 Apr 2025

https://github.com/apache/logging-flume-twitter

Apache Flume Twitter provides the Twitter Source for Apache Flume

apache api flume java jvm library

Last synced: 06 Mar 2025

https://github.com/shenruisi/zkfupload

upload your config file to zookeeper node, e.g. flume config file

flume flume-ng zookeeper zookeeper-client

Last synced: 15 Mar 2025

https://github.com/apache/logging-flume-search

Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of event data

apache api flume java jvm library

Last synced: 09 Apr 2025

https://github.com/aabouzaid/flume-formula

SaltStack formula for Apache Flume.

apache-flume flume saltstack saltstack-formula

Last synced: 30 Mar 2025

https://github.com/jcustenborder/kafka-connect-flume-avro

Kafka Connect Connector for receiving events from the Flume Avro Source plugin

avro-source flume kafka-connect

Last synced: 11 Mar 2025

https://github.com/apache/logging-flume-morphline

Apache Flume Morphline provides the Morphline Solr Sink for Apache Flume

apache api flume java jvm library

Last synced: 06 Mar 2025

https://github.com/vspiewak/flume-extensions

Flume extentions

flume java

Last synced: 17 Feb 2025

https://github.com/apache/logging-flume-legacy

Apache Flume Legacy provides the legacy Avro and Thrift Source for Apache Flume

apache api flume java jvm library

Last synced: 06 Mar 2025