Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/wangzhiwubigdata/god-of-bigdata

专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
https://github.com/wangzhiwubigdata/god-of-bigdata

azkaban bigdata flink flume hadoop hbase hdfs hive kafka spark zookeeper

Last synced: 4 days ago
JSON representation

专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

Awesome Lists containing this project

README

        







- 🔥 关注[**❤️公众号**](https://mmbiz.qpic.cn/mmbiz_png/UdK9ByfMT2P8ylc0r5wg2SEHk9LxuL6xRMiczG82lZNwzVmm6otcpd26yZIDqOvbTIeNb2FdYZYatHvsoH6TbVg/640?wx_fmt=png&wxfrom=5&wx_lazy=1&wx_co=1)

- 🔥 加我 [**👬好友**](https://mmbiz.qpic.cn/mmbiz_png/UdK9ByfMT2NLwWPfib27wK2aK6iaCicBu4VseYibyGQdiagT6zcMSTXbITlV5Wl3lq8T3icMt4ibtjGat0j4dqSnqf43A/640?wx_fmt=png&wxfrom=5&wx_lazy=1&wx_co=1)

- 🔥 关注 [**📺B站**](https://space.bilibili.com/22021870)

## 🚜 **大数据成神之路目录**

### ⛳ 上车前必读

- [❤️❤️全网最全大数据面试通关手册!❤️❤️](https://mp.weixin.qq.com/s/HJMolm5vMDbGWmprrxkzvA)
- 图片打不开,点[这里](https://blog.csdn.net/u013411339/article/details/113097759)
- 👉 如果你不知道自己要从哪开始、要学什么,请看:[**《八千里路云和月|从零到大数据专家学习路径指南》**](https://mp.weixin.qq.com/s/MAwD-UJgvIa_dZjmaykqrQ)
这是一个大数据专家/架构师提升自身技术栈广度和深度的经验之路,也是我的亲身学习经历汇总!
- 👉 如果你对某一个框架的学习路径有疑惑🤔,请看下面:
- [我们在学习Flink的时候,到底在学习什么?](https://mp.weixin.qq.com/s/xh4SEX9t-fRVdoiAl0KKSQ)
- [我们在学习Spark的时候,到底在学习什么?](https://mp.weixin.qq.com/s/pN0AqNJuFnlLjNW2OonGtA)
- [我们在学习Kafka的时候,到底在学习什么?](https://mp.weixin.qq.com/s/AurKLpHB0XWgbBat-M7Urw)

------------

### ⛳ 大数据开发基础篇
| :ski:Java基础| :memo:NIO|:book:并发|:guitar:JVM|:dollar:分布式|:floppy_disk:Zookeeper|:punch:RPC|:art:Netty|:computer:Linux|
| :------:| :------: | :------: |:------: |:------: |:------: |:------: |:------: |:------:|
| [Java基础](#一Java基础) | [NIO](#二NIO基础) | [并发容器](#三Java并发容器) |[JVM](#四JVM深度解析和面试点) |[分布式](#五分布式理论基础和原理) |[zookeeper](#六大数据框架开发基础-zookeeper)|[RPC](#七大数据框架开发基础-RPC)|[Netty](#八大数据框架基石之网路通信-Netty)|[Linux](/Linux基础/Linux基础和命令.md)|

------------
### ⛳ 大数据框架学习篇











Hadoop
Hive
Spark
Flink
HBase
Kafka
Zookeeper



----------------
### ⛳ 大数据开发实战进阶篇

这里的文章主要是我平时发表在公众号,博客等的文章,精心挑选,以飨读者。








Flink实战进阶
Spark实战进阶
Kafka实战进阶
OLAP实战进阶



------------------------------------
### ⛳ 大数据开发面试篇






面试系列合集
大数据算法



------------------------------------
### ⛳ 个人公众号:octocat: 大数据精品文章合集







2020精品文章合集
2021精品文章合集
硬刚系列文章合集



------------------------------------
### ⛳ 高屋建瓴总结篇

- [Hadoop系统性总结(知识星球读者专享)](https://mp.weixin.qq.com/s/HJMolm5vMDbGWmprrxkzvA)
- [Hive系统性总结(知识星球读者专享)](https://mp.weixin.qq.com/s/HJMolm5vMDbGWmprrxkzvA)
- [Spark系统性总结(知识星球读者专享)](https://mp.weixin.qq.com/s/HJMolm5vMDbGWmprrxkzvA)
- [Flink系统性总结(知识星球读者专享)](https://mp.weixin.qq.com/s/HJMolm5vMDbGWmprrxkzvA)
- [Hbase系统性总结(知识星球读者专享)](https://mp.weixin.qq.com/s/HJMolm5vMDbGWmprrxkzvA)
- [Kafka系统性总结(知识星球读者专享)](https://mp.weixin.qq.com/s/HJMolm5vMDbGWmprrxkzvA)

---------------------------------------

## `第一部分: 大数据开发基础篇`

### 一、Java基础

##### 📚 1.1 Java系统性学习

- [《Java基础系统性学习专栏》](https://blog.csdn.net/u013411339/category_11681255.html)

##### 分类导航

* [大数据成神之路-Java高级特性增强(多线程)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA/%E5%A4%A7%E6%95%B0%E6%8D%AE%E6%88%90%E7%A5%9E%E4%B9%8B%E8%B7%AF-Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA(%E5%A4%9A%E7%BA%BF%E7%A8%8B).md)
* [大数据成神之路-Java高级特性增强(Synchronized关键字)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA/%E5%A4%A7%E6%95%B0%E6%8D%AE%E6%88%90%E7%A5%9E%E4%B9%8B%E8%B7%AF-Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA(Synchronized%E5%85%B3%E9%94%AE%E5%AD%97).md)
* [大数据成神之路-Java高级特性增强(volatile关键字)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA/%E5%A4%A7%E6%95%B0%E6%8D%AE%E6%88%90%E7%A5%9E%E4%B9%8B%E8%B7%AF-Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA(volatile%E5%85%B3%E9%94%AE%E5%AD%97).md)
* [大数据成神之路-Java高级特性增强(锁)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA/%E5%A4%A7%E6%95%B0%E6%8D%AE%E6%88%90%E7%A5%9E%E4%B9%8B%E8%B7%AF-Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA(%E9%94%81).md)
* [大数据成神之路-Java高级特性增强(ArrayList/Vector)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA/%E5%A4%A7%E6%95%B0%E6%8D%AE%E6%88%90%E7%A5%9E%E4%B9%8B%E8%B7%AF-Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA(%E9%9B%86%E5%90%88%E6%A1%86%E6%9E%B6).md)
* [大数据成神之路-Java高级特性增强(LinkedList)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA/%E5%A4%A7%E6%95%B0%E6%8D%AE%E6%88%90%E7%A5%9E%E4%B9%8B%E8%B7%AF-Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA.md)
* [大数据成神之路-Java高级特性增强(HashMap)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA/%E5%A4%A7%E6%95%B0%E6%8D%AE%E6%88%90%E7%A5%9E%E4%B9%8B%E8%B7%AF-Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA(HashMap).md)
* [大数据成神之路-Java高级特性增强(HashSet)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA/%E5%A4%A7%E6%95%B0%E6%8D%AE%E6%88%90%E7%A5%9E%E4%B9%8B%E8%B7%AF-Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA(HashSet).md)
* [大数据成神之路-Java高级特性增强(LinkedHashMap)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA/%E5%A4%A7%E6%95%B0%E6%8D%AE%E6%88%90%E7%A5%9E%E4%B9%8B%E8%B7%AF-Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA(LinkedHashMap).md)

### 二、NIO基础

##### 📚 2.1 Netty系统系学习

- [《Netty系统系学习专栏》](https://blog.csdn.net/u013411339/category_11681261.html)

##### 📚 2.2 分类导航

* [大数据成神之路-Java高级特性增强-NIO大纲](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA/%E5%A4%A7%E6%95%B0%E6%8D%AE%E6%88%90%E7%A5%9E%E4%B9%8B%E8%B7%AF-Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA-NIO.md)
* [NIO概览](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA/NIO%E6%A6%82%E8%A7%88.md)
* [Java NIO之Buffer(缓冲区)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA/Java%20NIO%E4%B9%8BBuffer(%E7%BC%93%E5%86%B2%E5%8C%BA).md)
* [Java NIO之Channel(通道)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA/Java%20NIO%E4%B9%8BChannel(%E9%80%9A%E9%81%93).md)
* [ava NIO之Selector(选择器)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA/Java%20NIO%E4%B9%8BSelector(%E9%80%89%E6%8B%A9%E5%99%A8).md)
* [Java NIO之拥抱Path和Files](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA/Java%20NIO%E4%B9%8B%E6%8B%A5%E6%8A%B1Path%E5%92%8CFiles.md)

### 三、Java并发容器

##### 📚 3.1 系统性学习

- [《多线程&并发容器学习专栏》](https://blog.csdn.net/u013411339/category_11681268.html)

##### 3.2 分类导航

* [大数据成神之路-Java高级特性增强(并发容器大纲)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/%E5%B9%B6%E5%8F%91%E5%AE%B9%E5%99%A8/%E5%A4%A7%E6%95%B0%E6%8D%AE%E6%88%90%E7%A5%9E%E4%B9%8B%E8%B7%AF-Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA(%E5%B9%B6%E5%8F%91%E5%AE%B9%E5%99%A8%E5%A4%A7%E7%BA%B2).md)
* [大数据成神之路-Java高级特性增强(LinkedBlockingQueue)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/%E5%B9%B6%E5%8F%91%E5%AE%B9%E5%99%A8/%E5%A4%A7%E6%95%B0%E6%8D%AE%E6%88%90%E7%A5%9E%E4%B9%8B%E8%B7%AF-Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA(LinkedBlockingQueue).md)
* [大数据成神之路-Java高级特性增强(LinkedBlockingDeque)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/%E5%B9%B6%E5%8F%91%E5%AE%B9%E5%99%A8/%E5%A4%A7%E6%95%B0%E6%8D%AE%E6%88%90%E7%A5%9E%E4%B9%8B%E8%B7%AF-Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA(LinkedBlockingDeque).md)
* [大数据成神之路-Java高级特性增强(CopyOnWriteArraySet)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/%E5%B9%B6%E5%8F%91%E5%AE%B9%E5%99%A8/%E5%A4%A7%E6%95%B0%E6%8D%AE%E6%88%90%E7%A5%9E%E4%B9%8B%E8%B7%AF-Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA(CopyOnWriteArraySet).md)
* [大数据成神之路-Java高级特性增强(CopyOnWriteArrayList)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/%E5%B9%B6%E5%8F%91%E5%AE%B9%E5%99%A8/%E5%A4%A7%E6%95%B0%E6%8D%AE%E6%88%90%E7%A5%9E%E4%B9%8B%E8%B7%AF-Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA(CopyOnWriteArrayList).md)
* [大数据成神之路-Java高级特性增强(ConcurrentSkipListSet)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/%E5%B9%B6%E5%8F%91%E5%AE%B9%E5%99%A8/%E5%A4%A7%E6%95%B0%E6%8D%AE%E6%88%90%E7%A5%9E%E4%B9%8B%E8%B7%AF-Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA(ConcurrentSkipListSet).md)
* [大数据成神之路-Java高级特性增强(ConcurrentSkipListMap)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/%E5%B9%B6%E5%8F%91%E5%AE%B9%E5%99%A8/%E5%A4%A7%E6%95%B0%E6%8D%AE%E6%88%90%E7%A5%9E%E4%B9%8B%E8%B7%AF-Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA(ConcurrentSkipListMap).md)
* [大数据成神之路-Java高级特性增强(ConcurrentLinkedQueue)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/%E5%B9%B6%E5%8F%91%E5%AE%B9%E5%99%A8/%E5%A4%A7%E6%95%B0%E6%8D%AE%E6%88%90%E7%A5%9E%E4%B9%8B%E8%B7%AF-Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA(ConcurrentLinkedQueue).md)
* [大数据成神之路-Java高级特性增强(ConcurrentHashMap)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/%E5%B9%B6%E5%8F%91%E5%AE%B9%E5%99%A8/%E5%A4%A7%E6%95%B0%E6%8D%AE%E6%88%90%E7%A5%9E%E4%B9%8B%E8%B7%AF-Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA(ConcurrentHashMap).md)
* [大数据成神之路-Java高级特性增强(ArrayBlockingQueue)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/%E5%B9%B6%E5%8F%91%E5%AE%B9%E5%99%A8/%E5%A4%A7%E6%95%B0%E6%8D%AE%E6%88%90%E7%A5%9E%E4%B9%8B%E8%B7%AF-Java%E9%AB%98%E7%BA%A7%E7%89%B9%E6%80%A7%E5%A2%9E%E5%BC%BA(ArrayBlockingQueue).md)

### 四、JVM深度解析和面试点

##### 📚 4.1 系统性学习

- [《JVM学习专栏》](https://blog.csdn.net/u013411339/category_11681275.html)

##### 4.2 分类导航

* [JVM内存结构](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/JVM/JVM%E5%86%85%E5%AD%98%E7%BB%93%E6%9E%84.md)
* [HotSpot虚拟机对象探秘](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/JVM/HotSpot%E8%99%9A%E6%8B%9F%E6%9C%BA%E5%AF%B9%E8%B1%A1%E6%8E%A2%E7%A7%98.md)
* [垃圾收集策略与算法](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/JVM/%E5%9E%83%E5%9C%BE%E6%94%B6%E9%9B%86%E7%AD%96%E7%95%A5%E4%B8%8E%E7%AE%97%E6%B3%95.md)
* [HotSpot垃圾收集器](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/JVM/HotSpot%E5%9E%83%E5%9C%BE%E6%94%B6%E9%9B%86%E5%99%A8.md)
* [内存分配与回收策略](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/JVM/%E5%86%85%E5%AD%98%E5%88%86%E9%85%8D%E4%B8%8E%E5%9B%9E%E6%94%B6%E7%AD%96%E7%95%A5.md)
* [JVM性能调优](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/JVM/JVM%20%E6%80%A7%E8%83%BD%E8%B0%83%E4%BC%98.md)
* [类文件结构](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/JVM/%E7%B1%BB%E6%96%87%E4%BB%B6%E7%BB%93%E6%9E%84.md)
* [类加载的时机](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/JVM/%E7%B1%BB%E5%8A%A0%E8%BD%BD%E7%9A%84%E6%97%B6%E6%9C%BA.md)
* [类加载的过程](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/JVM/%E7%B1%BB%E5%8A%A0%E8%BD%BD%E7%9A%84%E8%BF%87%E7%A8%8B.md)
* [类加载器](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/JVM/%E7%B1%BB%E5%8A%A0%E8%BD%BD%E5%99%A8.md)

##### 4.3 JVM 详解

* [java类的加载机制](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/JVM/jvm%E7%B3%BB%E5%88%97(%E4%B8%80)java%E7%B1%BB%E7%9A%84%E5%8A%A0%E8%BD%BD%E6%9C%BA%E5%88%B6.md)
* [JVM内存结构](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/JVM/jvm%E7%B3%BB%E5%88%97(%E4%BA%8C)JVM%E5%86%85%E5%AD%98%E7%BB%93%E6%9E%84.md)
* [GC算法 垃圾收集器](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/JVM/jvm%E7%B3%BB%E5%88%97(%E4%B8%89)GC%E7%AE%97%E6%B3%95%20%E5%9E%83%E5%9C%BE%E6%94%B6%E9%9B%86%E5%99%A8.md)
* [jvm调优-命令大全](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/JVM/jvm%E7%B3%BB%E5%88%97(%E5%9B%9B)jvm%E8%B0%83%E4%BC%98-%E5%91%BD%E4%BB%A4%E5%A4%A7%E5%85%A8%EF%BC%88jps%20jstat%20jmap%20jhat%20jstack%20jinfo%EF%BC%89.md)
* [Java GC 分析](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/JVM/jvm%E7%B3%BB%E5%88%97(%E4%BA%94)Java%20GC%20%E5%88%86%E6%9E%90.md)

### 五、分布式理论基础和原理

* [分布式系统的一些基本概念](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/%E5%88%86%E5%B8%83%E5%BC%8F%E7%90%86%E8%AE%BA/%E5%88%86%E5%B8%83%E5%BC%8F%E7%B3%BB%E7%BB%9F%E7%9A%84%E4%B8%80%E4%BA%9B%E5%9F%BA%E6%9C%AC%E6%A6%82%E5%BF%B5.md)
* [分布式系统理论基础一: 一致性、2PC和3PC](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/%E5%88%86%E5%B8%83%E5%BC%8F%E7%90%86%E8%AE%BA/%E5%88%86%E5%B8%83%E5%BC%8F%E7%B3%BB%E7%BB%9F%E7%90%86%E8%AE%BA%E5%9F%BA%E7%A1%80%E4%B8%80%EF%BC%9A%20%E4%B8%80%E8%87%B4%E6%80%A7%E3%80%812PC%E5%92%8C3PC.md)
* [分布式系统理论基础二-CAP](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/%E5%88%86%E5%B8%83%E5%BC%8F%E7%90%86%E8%AE%BA/%E5%88%86%E5%B8%83%E5%BC%8F%E7%B3%BB%E7%BB%9F%E7%90%86%E8%AE%BA%E5%9F%BA%E7%A1%80%E4%BA%8C-CAP.md)
* [分布式系统理论基础三-时间、时钟和事件顺序](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/%E5%88%86%E5%B8%83%E5%BC%8F%E7%90%86%E8%AE%BA/%E5%88%86%E5%B8%83%E5%BC%8F%E7%B3%BB%E7%BB%9F%E7%90%86%E8%AE%BA%E5%9F%BA%E7%A1%80%E4%B8%89-%E6%97%B6%E9%97%B4%E3%80%81%E6%97%B6%E9%92%9F%E5%92%8C%E4%BA%8B%E4%BB%B6%E9%A1%BA%E5%BA%8F.md)
* [分布式系统理论进阶 - Paxos](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/%E5%88%86%E5%B8%83%E5%BC%8F%E7%90%86%E8%AE%BA/%E5%88%86%E5%B8%83%E5%BC%8F%E7%B3%BB%E7%BB%9F%E7%90%86%E8%AE%BA%E8%BF%9B%E9%98%B6%20-%20Paxos.md)
* [分布式系统理论进阶 - Raft、Zab](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/%E5%88%86%E5%B8%83%E5%BC%8F%E7%90%86%E8%AE%BA/%E5%88%86%E5%B8%83%E5%BC%8F%E7%B3%BB%E7%BB%9F%E7%90%86%E8%AE%BA%E8%BF%9B%E9%98%B6%20-%20Raft%E3%80%81Zab.md)
* [分布式系统理论进阶:选举、多数派和租约](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/%E5%88%86%E5%B8%83%E5%BC%8F%E7%90%86%E8%AE%BA/%E5%88%86%E5%B8%83%E5%BC%8F%E7%B3%BB%E7%BB%9F%E7%90%86%E8%AE%BA%E8%BF%9B%E9%98%B6%EF%BC%9A%E9%80%89%E4%B8%BE%E3%80%81%E5%A4%9A%E6%95%B0%E6%B4%BE%E5%92%8C%E7%A7%9F%E7%BA%A6.md)
* [分布式锁的解决方案](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/%E5%88%86%E5%B8%83%E5%BC%8F%E7%90%86%E8%AE%BA/%E5%88%86%E5%B8%83%E5%BC%8F%E9%94%81%E7%9A%84%E8%A7%A3%E5%86%B3%E6%96%B9%E6%A1%88.md)
* [分布式锁的解决方案(二)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/%E5%88%86%E5%B8%83%E5%BC%8F%E7%90%86%E8%AE%BA/%E5%88%86%E5%B8%83%E5%BC%8F%E9%94%81%E7%9A%84%E8%A7%A3%E5%86%B3%E6%96%B9%E6%A1%88(%E4%BA%8C).md)
* [分布式事务的解决方案](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/%E5%88%86%E5%B8%83%E5%BC%8F%E7%90%86%E8%AE%BA/%E5%88%86%E5%B8%83%E5%BC%8F%E4%BA%8B%E5%8A%A1%E7%9A%84%E8%A7%A3%E5%86%B3%E6%96%B9%E6%A1%88.md)
* [分布式ID生成器解决方案](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/%E5%88%86%E5%B8%83%E5%BC%8F%E7%90%86%E8%AE%BA/%E5%88%86%E5%B8%83%E5%BC%8FID%E7%94%9F%E6%88%90%E5%99%A8%E8%A7%A3%E5%86%B3%E6%96%B9%E6%A1%88.md)

### 六、大数据框架开发基础-Zookeeper

* [安装和运行](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/zookeeeper/zk%E5%AE%89%E8%A3%85%E5%92%8C%E8%BF%90%E8%A1%8C.md)
* [zookeeper服务](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/zookeeeper/zk%E6%9C%8D%E5%8A%A1.md)
* [zookeeper应用程序](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/zookeeeper/ZooKeeper%E5%BA%94%E7%94%A8%E7%A8%8B%E5%BA%8F.md)
* [zookeeper开发实例](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/zookeeeper/zk%E5%BC%80%E5%8F%91%E5%AE%9E%E4%BE%8B.md)
* [zookeeper集群构建](http://www.importnew.com/23237.html)

### 七、大数据框架开发基础-RPC

* [RPC简单介绍](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/RPC/RPC%E7%AE%80%E5%8D%95%E4%BB%8B%E7%BB%8D.md)
* [RPC的原理和框架](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/RPC/RPC%E7%9A%84%E5%8E%9F%E7%90%86%E5%92%8C%E6%A1%86%E6%9E%B6.md)
* [手把手教你实现一个简单的RPC](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/RPC/%E6%89%8B%E6%8A%8A%E6%89%8B%E6%95%99%E4%BD%A0%E5%AE%9E%E7%8E%B0%E4%B8%80%E4%B8%AA%E7%AE%80%E5%8D%95%E7%9A%84RPC.md)

### 八、大数据框架基石之网路通信-Netty

* [关于Netty我们都需要知道什么](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Netty/%E5%85%B3%E4%BA%8ENetty%E6%88%91%E4%BB%AC%E9%83%BD%E9%9C%80%E8%A6%81%E7%9F%A5%E9%81%93%E4%BB%80%E4%B9%88.md)
* [Netty源码解析-概述篇](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Netty/Netty%E6%BA%90%E7%A0%81%E8%A7%A3%E6%9E%90-%E6%A6%82%E8%BF%B0%E7%AF%87.md)
* [Netty源码解析1-Buffer](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Netty/Netty%E6%BA%90%E7%A0%81%E8%A7%A3%E6%9E%901-Buffer.md)
* [Netty源码解析2-Reactor](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Netty/Netty%E6%BA%90%E7%A0%81%E8%A7%A3%E6%9E%902-Reactor.md)
* [Netty源码解析3-Pipeline](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Netty/Netty%E6%BA%90%E7%A0%81%E8%A7%A3%E6%9E%903-Pipeline.md)
* [Netty源码解析4-Handler综述](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Netty/Netty%E6%BA%90%E7%A0%81%E8%A7%A3%E6%9E%904-Handler%E7%BB%BC%E8%BF%B0.md)
* [Netty源码解析5-ChannelHandler](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Netty/Netty%E6%BA%90%E7%A0%81%E8%A7%A3%E6%9E%905-ChannelHandler.md)
* [Netty源码解析6-ChannelHandler实例之LoggingHandler](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Netty/Netty%E6%BA%90%E7%A0%81%E8%A7%A3%E6%9E%906-ChannelHandler%E5%AE%9E%E4%BE%8B%E4%B9%8BLoggingHandler.md)
* [Netty源码解析7-ChannelHandler实例之TimeoutHandler](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Netty/Netty%E6%BA%90%E7%A0%81%E8%A7%A3%E6%9E%907-ChannelHandler%E5%AE%9E%E4%BE%8B%E4%B9%8BTimeoutHandler.md)
* [Netty源码解析8-ChannelHandler实例之CodecHandler](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Netty/Netty%E6%BA%90%E7%A0%81%E8%A7%A3%E6%9E%908-ChannelHandler%E5%AE%9E%E4%BE%8B%E4%B9%8BCodecHandler.md)
* [Netty源码解析9-ChannelHandler实例之MessageToByteEncoder](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Netty/Netty%E6%BA%90%E7%A0%81%E8%A7%A3%E6%9E%909-ChannelHandler%E5%AE%9E%E4%BE%8B%E4%B9%8BMessageToByteEncoder.md)

## 第二部分:大数据框架学习篇

### 一、Hadoop

##### 📚 1.1 Hadoop系统性总结

- [Hadoop系统性总结(知识星球读者专享)](https://mp.weixin.qq.com/s/HJMolm5vMDbGWmprrxkzvA)

##### 📚 1.2 系统性学习

[Hadoop学习专栏](https://blog.csdn.net/u013411339/category_11681229.html)

##### 1.3 分类导航

1. [分布式文件存储系统 —— HDFS](大数据框架学习/Hadoop-HDFS.md)
2. [分布式计算框架 —— MapReduce](大数据框架学习/Hadoop-MapReduce.md)
3. [集群资源管理器 —— YARN](大数据框架学习/Hadoop-YARN.md)
4. [Hadoop 单机伪集群环境搭建](大数据框架学习/installation/Hadoop单机环境搭建.md)
5. [Hadoop 集群环境搭建](大数据框架学习/installation/Hadoop集群环境搭建.md)
6. [HDFS 常用 Shell 命令](大数据框架学习/HDFS常用Shell命令.md)
7. [HDFS Java API 的使用](大数据框架学习/HDFS-Java-API.md)
8. [基于 Zookeeper 搭建 Hadoop 高可用集群](大数据框架学习/installation/基于Zookeeper搭建Hadoop高可用集群.md)
9. [Hadoop级简入门](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Hadoop/Hadoop%E6%9E%81%E7%AE%80%E5%85%A5%E9%97%A8.md)
10. [MapReduce编程模型和计算框架架构原理](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Hadoop/MapReduce%E7%BC%96%E7%A8%8B%E6%A8%A1%E5%9E%8B%E5%92%8C%E8%AE%A1%E7%AE%97%E6%A1%86%E6%9E%B6%E6%9E%B6%E6%9E%84%E5%8E%9F%E7%90%86.md)

### 二、Hive

##### 📚 2.1 Hive系统性总结

[Hive系统性总结(知识星球读者专享)](https://mp.weixin.qq.com/s/HJMolm5vMDbGWmprrxkzvA)

##### 📚 2.2 系统性学习

[Hive学习专栏](https://blog.csdn.net/u013411339/category_11681281.html)

##### 📚 2.3 分类导航

1. [Hive 简介及核心概念](大数据框架学习/Hive简介及核心概念.md)
2. [Linux 环境下 Hive 的安装部署](大数据框架学习/installation/Linux环境下Hive的安装部署.md)
4. [Hive CLI 和 Beeline 命令行的基本使用](大数据框架学习/HiveCLI和Beeline命令行的基本使用.md)
6. [Hive 常用 DDL 操作](大数据框架学习/Hive常用DDL操作.md)
7. [Hive 分区表和分桶表](大数据框架学习/Hive分区表和分桶表.md)
8. [Hive 视图和索引](大数据框架学习/Hive视图和索引.md)
9. [Hive常用 DML 操作](大数据框架学习/Hive常用DML操作.md)
10. [Hive 数据查询详解](大数据框架学习/Hive数据查询详解.md)

### 三、Spark

##### 📚 3.1 Spark系统性总结

- [Spark系统性总结(知识星球读者专享)](https://mp.weixin.qq.com/s/HJMolm5vMDbGWmprrxkzvA)

#### 📚 3.2 系统性学习

- [Spark学习专栏](https://blog.csdn.net/u013411339/category_11681283.html)

**Spark Core :**

1. [Spark 简介](大数据框架学习/Spark简介.md)
2. [Spark 开发环境搭建](大数据框架学习/installation/Spark开发环境搭建.md)
4. [弹性式数据集 RDD](大数据框架学习/Spark_RDD.md)
5. [RDD 常用算子详解](大数据框架学习/Spark_Transformation和Action算子.md)
5. [Spark 运行模式与作业提交](大数据框架学习/Spark部署模式与作业提交.md)
6. [Spark 累加器与广播变量](大数据框架学习/Spark累加器与广播变量.md)
7. [基于 Zookeeper 搭建 Spark 高可用集群](大数据框架学习/installation/Spark集群环境搭建.md)

**Spark SQL :**

1. [DateFrame 和 DataSet ](大数据框架学习/SparkSQL_Dataset和DataFrame简介.md)
2. [Structured API 的基本使用](大数据框架学习/Spark_Structured_API的基本使用.md)
3. [Spark SQL 外部数据源](大数据框架学习/SparkSQL外部数据源.md)
4. [Spark SQL 常用聚合函数](大数据框架学习/SparkSQL常用聚合函数.md)
5. [Spark SQL JOIN 操作](大数据框架学习/SparkSQL联结操作.md)

**Spark Streaming :**

1. [Spark Streaming 简介](大数据框架学习/Spark_Streaming与流处理.md)
2. [Spark Streaming 基本操作](大数据框架学习/Spark_Streaming基本操作.md)
3. [Spark Streaming 整合 Flume](大数据框架学习/Spark_Streaming整合Flume.md)
4. [Spark Streaming 整合 Kafka](大数据框架学习/Spark_Streaming整合Kafka.md)

## 四、Flink

##### 📚 4.1 Flink系统性总结

- [Flink系统性总结(知识星球读者专享)](https://mp.weixin.qq.com/s/HJMolm5vMDbGWmprrxkzvA)

#### 📚 4.2 系统性学习

- [Flink系统学习专栏](https://blog.csdn.net/u013411339/category_11681289.html)

#### 📚 2.3 分类导航

1. [Flink 核心概念综述](大数据框架学习/Flink核心概念综述.md)
2. [Flink 开发环境搭建](大数据框架学习/Flink开发环境搭建.md)
3. [Flink Data Source](大数据框架学习/Flink_Data_Source.md)
4. [Flink Data Transformation](大数据框架学习/Flink_Data_Transformation.md)
4. [Flink Data Sink](大数据框架学习/Flink_Data_Sink.md)
6. [Flink 窗口模型](大数据框架学习/Flink_Windows.md)
7. [Flink 状态管理与检查点机制](大数据框架学习/Flink状态管理与检查点机制.md)
8. [Flink Standalone 集群部署](大数据框架学习/installation/Flink_Standalone_Cluster.md)

#### Flink当前最火的实时计算引擎-入门篇

* [Flink从入门到放弃(入门篇1)-Flink是什么](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Flink/Flink%E4%BB%8E%E5%85%A5%E9%97%A8%E5%88%B0%E6%94%BE%E5%BC%83(%E5%85%A5%E9%97%A8%E7%AF%871)-Flink%E6%98%AF%E4%BB%80%E4%B9%88%EF%BC%9F.md)
* [Flink从入门到放弃(入门篇2)-本地环境搭建&构建第一个Flink应用](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Flink/Flink%E4%BB%8E%E5%85%A5%E9%97%A8%E5%88%B0%E6%94%BE%E5%BC%83(%E5%85%A5%E9%97%A8%E7%AF%872)-%E6%9C%AC%E5%9C%B0%E7%8E%AF%E5%A2%83%E6%90%AD%E5%BB%BA%26%E6%9E%84%E5%BB%BA%E7%AC%AC%E4%B8%80%E4%B8%AAFlink%E5%BA%94%E7%94%A8.md)
* [Flink从入门到放弃(入门篇3)-DataSetAPI](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Flink/Flink%E4%BB%8E%E5%85%A5%E9%97%A8%E5%88%B0%E6%94%BE%E5%BC%83(%E5%85%A5%E9%97%A8%E7%AF%873)-DataSetAPI.md)
* [Flink从入门到放弃(入门篇4)-DataStreamAPI](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Flink/Flink%E4%BB%8E%E5%85%A5%E9%97%A8%E5%88%B0%E6%94%BE%E5%BC%83(%E5%85%A5%E9%97%A8%E7%AF%874)-DataStreamAPI.md)
* [Flink集群部署](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Flink/Flink%E9%9B%86%E7%BE%A4%E9%83%A8%E7%BD%B2.md)
* [Flink重启策略](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Flink/6-Flink%E9%87%8D%E5%90%AF%E7%AD%96%E7%95%A5.md)
* [Flink的分布式缓存](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Flink/7-Flink%E7%9A%84%E5%88%86%E5%B8%83%E5%BC%8F%E7%BC%93%E5%AD%98.md)
* [Flink中的窗口](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Flink/8-Flink%E4%B8%AD%E7%9A%84%E7%AA%97%E5%8F%A3.md)
* [Flink中的Time](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Flink/9-Flink%E4%B8%AD%E7%9A%84Time.md)
* [Flink集群搭建的HA](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Flink/10-Flink%E9%9B%86%E7%BE%A4%E7%9A%84%E9%AB%98%E5%8F%AF%E7%94%A8(%E6%90%AD%E5%BB%BA%E7%AF%87%E8%A1%A5%E5%85%85).md)
* [Flink中的时间戳和水印](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Flink/11-%E6%97%B6%E9%97%B4%E6%88%B3%E5%92%8C%E6%B0%B4%E5%8D%B0.md)
* [Flink广播变量](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Flink/12-Broadcast%E5%B9%BF%E6%92%AD%E5%8F%98%E9%87%8F.md)
* [Flink-Kafka-Connector](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Flink/13-Flink-Kafka-Connector.md)
* [Flink-Table-&-SQL实战](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Flink/14-Flink-Table-%26-SQL.md)
* [15-Flink实战项目之实时热销排行](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Flink/15-Flink%E5%AE%9E%E6%88%98%E9%A1%B9%E7%9B%AE%E4%B9%8B%E5%AE%9E%E6%97%B6%E7%83%AD%E9%94%80%E6%8E%92%E8%A1%8C.md)
* [16-Flink-Redis-Sink](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Flink/16-Flink-Redis-Sink.md)
* [17-Flink消费Kafka写入Mysql](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Flink/17-Flink%E6%B6%88%E8%B4%B9Kafka%E5%86%99%E5%85%A5Mysql.md)

#### Flink当前最火的实时计算引擎-放弃篇

* [Flink漫谈系列1-概述](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Flink%E6%BC%AB%E8%B0%88%E7%B3%BB%E5%88%97/Apache-Flink%E6%BC%AB%E8%B0%88%E7%B3%BB%E5%88%97(1)-%E6%A6%82%E8%BF%B0.md)
* [Flink漫谈系列2-watermark](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Flink%E6%BC%AB%E8%B0%88%E7%B3%BB%E5%88%97/Apache-Flink-%E6%BC%AB%E8%B0%88%E7%B3%BB%E5%88%97(02)-Watermark.md)
* [Flink漫谈系列3-state](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Flink%E6%BC%AB%E8%B0%88%E7%B3%BB%E5%88%97/Apache-Flink-%E6%BC%AB%E8%B0%88%E7%B3%BB%E5%88%97(03)-State.md)

## 五、HBase

##### 📚 5.1 Hbase系统性总结

- [Hbase系统性总结(知识星球读者专享)](https://mp.weixin.qq.com/s/HJMolm5vMDbGWmprrxkzvA)

##### 📚 5.2 系统性学习

- [Hbase学习专栏](https://blog.csdn.net/u013411339/category_11681293.html)

##### 📚 5.3 分类导航

1. [Hbase 简介](大数据框架学习/Hbase简介.md)
2. [HBase 系统架构及数据结构](大数据框架学习/Hbase系统架构及数据结构.md)
3. [HBase 基本环境搭建 (Standalone /pseudo-distributed mode)](大数据框架学习/installation/HBase单机环境搭建.md)
4. [HBase 集群环境搭建](大数据框架学习/installation/HBase集群环境搭建.md)
5. [HBase 常用 Shell 命令](大数据框架学习/Hbase_Shell.md)
6. [HBase Java API](大数据框架学习/Hbase_Java_API.md)
7. [Hbase 过滤器详解](大数据框架学习/Hbase过滤器详解.md)
8. [HBase 协处理器详解](大数据框架学习/Hbase协处理器详解.md)
9. [HBase 容灾与备份](大数据框架学习/Hbase容灾与备份.md)
10. [HBase的 SQL 中间层 —— Phoenix](大数据框架学习/Hbase的SQL中间层_Phoenix.md)
11. [Spring/Spring Boot 整合 Mybatis + Phoenix](大数据框架学习/Spring+Mybtais+Phoenix整合.md)

## 六、Kafka

##### 📚 6.1 Kafka系统性总结

[Kafka系统性总结](https://t.zsxq.com/eIMrjmE)

##### 📚 6.2 Kafka系统性学习

- [Kafka系统性学习专栏](https://blog.csdn.net/u013411339/category_11681300.html)

**Kafka基本原理 :**

1. [Kafka 简介](大数据框架学习/Kafka简介.md)
2. [基于 Zookeeper 搭建 Kafka 高可用集群](大数据框架学习/installation/基于Zookeeper搭建Kafka高可用集群.md)
3. [Kafka 生产者详解](大数据框架学习/Kafka生产者详解.md)
4. [Kafka 消费者详解](大数据框架学习/Kafka消费者详解.md)
5. [深入理解 Kafka 副本机制](大数据框架学习/Kafka深入理解分区副本机制.md)

**分布式消息队列Kafka原理及与流式计算的集成 :**

1. [Apache-Kafka简介](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Kafka/Apache-Kafka%E7%AE%80%E4%BB%8B.md)
2. [Apache-Kafka核心概念](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Kafka/Apache-Kafka%E6%A0%B8%E5%BF%83%E6%A6%82%E5%BF%B5.md)
3. [Apache-Kafka安装和使用](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Kafka/Apache-Kafka%E5%AE%89%E8%A3%85%E5%92%8C%E4%BD%BF%E7%94%A8.md)
4. [Apache-Kafka编程实战](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Kafka/Apache-Kafka%E7%BC%96%E7%A8%8B%E5%AE%9E%E6%88%98.md)
5. [Apache-Kafka核心组件和流程(副本管理器)](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Kafka/Apache-Kafka%E6%A0%B8%E5%BF%83%E7%BB%84%E4%BB%B6%E5%92%8C%E6%B5%81%E7%A8%8B(%E5%89%AF%E6%9C%AC%E7%AE%A1%E7%90%86%E5%99%A8).md)
6. [Apache-Kafka核心组件和流程-协调器](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Kafka/Apache-Kafka%E6%A0%B8%E5%BF%83%E7%BB%84%E4%BB%B6%E5%92%8C%E6%B5%81%E7%A8%8B-%E5%8D%8F%E8%B0%83%E5%99%A8.md)
7. [Apache-Kafka核心组件和流程-控制器](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Kafka/Apache-Kafka%E6%A0%B8%E5%BF%83%E7%BB%84%E4%BB%B6%E5%92%8C%E6%B5%81%E7%A8%8B-%E6%8E%A7%E5%88%B6%E5%99%A8.md)
8. [Apache-Kafka核心组件和流程-日志管理器](https://github.com/wangzhiwubigdata/God-Of-BigData/blob/master/Kafka/Apache-Kafka%E6%A0%B8%E5%BF%83%E7%BB%84%E4%BB%B6%E5%92%8C%E6%B5%81%E7%A8%8B-%E6%97%A5%E5%BF%97%E7%AE%A1%E7%90%86%E5%99%A8.md)

## 七、Zookeeper

1. [Zookeeper 简介及核心概念](大数据框架学习/Zookeeper简介及核心概念.md)
2. [Zookeeper 单机环境和集群环境搭建](大数据框架学习/installation/Zookeeper单机环境和集群环境搭建.md)
3. [Zookeeper 常用 Shell 命令](大数据框架学习/Zookeeper常用Shell命令.md)
4. [Zookeeper Java 客户端 —— Apache Curator](大数据框架学习/Zookeeper_Java客户端Curator.md)
5. [Zookeeper ACL 权限控制](大数据框架学习/Zookeeper_ACL权限控制.md)

## 八、大数据算法

#### 大数据算法

[大数据算法](https://blog.csdn.net/u013411339/article/details/113429172)

## 第三部分:大数据开发实战进阶篇

### 一、Flink实战进阶文章合集

#### Flink实战合集

[点我查看Flink实战合集](实战系列文章/Flink实战.md)

#### 系统性学习

- [Flink系统学习专栏](https://blog.csdn.net/u013411339/category_11681289.html)
- [Flink高级进阶和企业级应用](https://blog.csdn.net/u013411339/category_9285579.html)

1. [菜鸟供应链实时技术架构演进](https://mp.weixin.qq.com/s/fnx2GnbCWNcaptVPsSp7dw)
2. [趣头条实战-基于Flink+ClickHouse构建实时数据平台](https://mp.weixin.qq.com/s/s6YFOINMw9TKg-QVOkZT9A)
3. [ApacheFlink新场景-OLAP引擎](https://mp.weixin.qq.com/s/NLwYpjzNgkR8O5zRg7RCoQ)
4. [说说Flink DataStream的八种物理分区逻辑](https://mp.weixin.qq.com/s/d_jzHb-b7LEGNz1CN34zMg)
5. [State Processor API:如何读取,写入和修改 Flink 应用程序的状态](https://mp.weixin.qq.com/s/eHPQx3kGKnhXeZpLhUNvng)
6. [Flink滑动窗口原理与细粒度滑动窗口的性能问题](https://mp.weixin.qq.com/s/Q4k0xgPCOUQ-A2DQ-XaJgw)
7. [基于Flink快速开发实时TopN](https://mp.weixin.qq.com/s/Ppz5740WTB7lTTLHNL72Tg)
8. [使用 Apache Flink 开发实时 ETL](https://mp.weixin.qq.com/s/kLjEceslHQxDDi3mRrjR-g)
9. [Flink Source/Sink探究与实践:RocketMQ数据写入HBase](https://mp.weixin.qq.com/s/kZQRQBVjYiXKfMhKM7SSqQ)
10. [Spark/Flink广播实现作业配置动态更新](https://mp.weixin.qq.com/s/GDylIWCDnjpX9_X6T9NmMA)
11. [Flink全链路延迟的测量方式](https://mp.weixin.qq.com/s/A6CIPsGf-aCWXkB7O-toVw)
12. [Flink原理-Flink中的数据抽象及数据交换过程](https://mp.weixin.qq.com/s/5BlCzguYiEP1h48jwkos2w)
13. [Flink SQL Window源码全解析](https://mp.weixin.qq.com/s/UkpkS_JiRGR0ibZKYechbg)
14. [Flink DataStream维度表Join的简单方案](https://mp.weixin.qq.com/s/e-lyViKV4NPmOVwA5Jn6Qw)
15. [Apache Flink的内存管理](https://mp.weixin.qq.com/s/cBMrF814jGtEFdve0Lrr6g)
16. [Flink1.9整合Kafka实战](https://mp.weixin.qq.com/s/e0BQoY5Y79NHhcQ9MqltFQ)
17. [Apache Flink在小米的发展和应用](https://mp.weixin.qq.com/s/KbhmJCW80UmeFwRxM3jerg)
18. [基于Kafka+Flink+Redis的电商大屏实时计算案例](https://mp.weixin.qq.com/s/BPzOBz7oTfn2_yW8tevEEw)
19. [Flink实战-壳找房基于Flink的实时平台建设](https://mp.weixin.qq.com/s/TsU_5N0Csfw-afN9AdAihw)
20. [用Flink取代Spark Streaming!知乎实时数仓架构演进](https://mp.weixin.qq.com/s/0M8XLTgpj6jWNcokNhyxAw)
21. [Flink实时数仓-美团点评实战](https://mp.weixin.qq.com/s/Oom-TaEsT6GKGs95dJil5Q)
22. [来将可留姓名?Flink最强学习资源合集!](https://mp.weixin.qq.com/s/13w43iYT3-riIj757HPGxw)
23. [数据不撒谎,Flink-Kafka性能压测全记录!](https://mp.weixin.qq.com/s/0VXqbzLBj5rZjjf4jAc3UQ)
24. [菜鸟在物流场景中基于Flink的流计算实践](https://mp.weixin.qq.com/s/2_8uOdDJwzYxUP-NLh6VhA)
25. [基于Flink构建实时数据仓库](https://mp.weixin.qq.com/s/Rhgt33y102WzR9-Zq15iVQ)
26. [Flink/Spark 如何实现动态更新作业配置](https://mp.weixin.qq.com/s/sjRV_F9tXEfqKL_00rJc7w)

### 二、Spark实战进阶文章合集

#### Spark实战合集

[点我查看Spark实战合集](实战系列文章/Spark实战.md)

1. [如果你在准备面试,好好看看这130道题](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486986&idx=1&sn=422d1a3c11c72ff97b32cc01142839f4&chksm=fd3d489fca4ac1895242ab94b932b12c65dc57b5f3a16acc7084dc8a189e9026290245a64c4f&token=1999457569&lang=zh_CN#rd)
2. [ORC文件存储格式的深入探究](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486981&idx=1&sn=9c8fc4c127d7e6108ac4e171e750d490&chksm=fd3d4890ca4ac186614f0dda8ffb2d35693a925b03861a01769c898652b53d0d436bca05ea12&token=1999457569&lang=zh_CN#rd)
3. [基于SparkStreaming+Kafka+HBase实时点击流案例](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486938&idx=1&sn=83347b444fc721442d2b4e1a58eca0e8&chksm=fd3d4b4fca4ac2597abd4a39a21dea83220858ae5efd1ae287b471bb9970f165b4854426f4c2&token=1999457569&lang=zh_CN#rd)
4. [HyperLogLog函数在Spark中的高级应用](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486931&idx=1&sn=9c3f3d6a677ed2aa6cc508046ca6da78&chksm=fd3d4b46ca4ac25044d6033458d3b43e7d1f50f447bded4eed8b246991cf620c91919c51e35b&token=1999457569&lang=zh_CN#rd)
5. [我们常说的海量小文件的根源是什么?](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486931&idx=2&sn=1c5987f3bad7a805895484ebfd683e11&chksm=fd3d4b46ca4ac250d31502a76f0cfd02ebea29d1161b9e9faec23ea6aa8947d0dd7d4268427f&token=1999457569&lang=zh_CN#rd)
6. [Structured Streaming | Apache Spark中处理实时数据的声明式API](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486904&idx=1&sn=5f4b673a87497a9c1dc9d7ce253994f3&chksm=fd3d4b2dca4ac23b9850c54d62ebe8be5920cbd4a491a46a0325e1fbbd91f6b1aef9adf2f638&token=1999457569&lang=zh_CN#rd)
7. [Spark面对OOM问题的解决方法及优化总结](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486765&idx=1&sn=516a32e8c1e9842606a7670862ec7e97&chksm=fd3d4bb8ca4ac2ae24e315083cdb195fdf897e1c3cce69d2c94183d5c20c1af1bfbe0e5480fd&token=1999457569&lang=zh_CN#rd)
8. [Spark 动态资源分配(Dynamic Resource Allocation) 解析](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486761&idx=1&sn=959aaa5266307a64181631ba1ae46e86&chksm=fd3d4bbcca4ac2aad8a3958da40c22e565997180c635bb6c32a9b4aa7e82a74a56423320c65a&token=1999457569&lang=zh_CN#rd)
9. [Apache Spark在海致大数据平台中的优化实践](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486668&idx=1&sn=981028fdcf937ba45b3914e2d48b94db&chksm=fd3d4a59ca4ac34f089ff1dce9542a0f481a73439acec9223fa2d776ff3f2eaec498cc543d06&token=1999457569&lang=zh_CN#rd)
10. [Spark/Flink广播实现作业配置动态更新](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486644&idx=1&sn=d2637a1e918c2b1be4c9fe3d74f75a92&chksm=fd3d4a21ca4ac3377cc8836939cc041cf934bb57f73b6b618fd1de608495d86e278c1c7e4cdc&token=1999457569&lang=zh_CN#rd)
11. [Spark SQL读数据库时不支持某些数据类型的问题](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486629&idx=2&sn=54d05858d659756a46b9be36b7b63ee5&chksm=fd3d4a30ca4ac326f69ba9b7cf6a82418afd31193e96aa28346779264a7d8a1df4c7a3a31fba&token=1999457569&lang=zh_CN#rd)
12. [这个面试问题很难么 | 如何处理大数据中的数据倾斜](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486586&idx=1&sn=e35def6429adaada0910b91eed8d7b1f&chksm=fd3d4aefca4ac3f9918d57b947d4bc8f71afd9d7666ffe28e3660e5efab332c714bef0879a90&token=1999457569&lang=zh_CN#rd)
13. [Spark难点 | Join的实现原理](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486582&idx=1&sn=7b6291dedb2e6892342e1ed705bdfb2e&chksm=fd3d4ae3ca4ac3f591f297635c0ff8e63e3deb6d6aec72c8fb18e19f8f6fcdb5cd2abcd37d09&token=1999457569&lang=zh_CN#rd)
14. [面试注意点 | Spark&Flink的区别拾遗](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486577&idx=1&sn=49fd0138ad9837192b6eb78cbdc478e6&chksm=fd3d4ae4ca4ac3f2aec91f40435d1eb55a9e497fe7d8ab9ad52396838f467879ac56cb83c9c2&token=1999457569&lang=zh_CN#rd)
15. [Spark Checkpoint的运行原理和源码实现](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486514&idx=1&sn=a3c3084b8a797a0889d7583fb10d70c9&chksm=fd3d4aa7ca4ac3b12416ba8269041032a515b7a4e71bda1709e359a15a04df4f638738c6e01a&token=1999457569&lang=zh_CN#rd)
16. [阿里云Spark Shuffle的优化](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486505&idx=1&sn=91316d3aa5a99945ccd992c0a98e500d&chksm=fd3d4abcca4ac3aa26f051504244239ff1ab48eb71cf01c14f689eb4767c121b1477d93b60ea&token=1999457569&lang=zh_CN#rd)
17. [使用Kafka+Spark+Cassandra构建实时处理引擎](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486485&idx=1&sn=999b84e0ac87f2aa1be4870921279a21&chksm=fd3d4a80ca4ac39695d9307582ff58938180d0c5b12f6db84ddae933edb345b0cbe46f6284ed&token=1999457569&lang=zh_CN#rd)
18. [基于HBase和Spark构建企业级数据处理平台](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486477&idx=1&sn=21394cca8fe279bbc2032d48f65672f6&chksm=fd3d4a98ca4ac38e3bf9700cfe65131fffadbbf9493ccc658b3e7be97ddb6665ad41a70f67f9&token=1999457569&lang=zh_CN#rd)
19. [SparkSQL在字节跳动的应用实践和优化实战](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486419&idx=1&sn=3bc8af144370a602817ca87415fe525d&chksm=fd3d4d46ca4ac450ec6f1ac84b2f3162071eb85221464bdb1976178ca2363cd2acb3d63d401c&token=1999457569&lang=zh_CN#rd)
20. [SparkRDD转DataSet/DataFrame的一个深坑](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486369&idx=1&sn=9a760114ed2c7509e191ad29370eecce&chksm=fd3d4d34ca4ac4222d310c37fb67bcaafd5425e1a16c43434c4d1fea403756e266de21b83181&token=1999457569&lang=zh_CN#rd)
21. [Spark和Flink的状态管理State的区别和应用](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486332&idx=1&sn=ecbe21981c5c36f6c420755e8d63fb8f&chksm=fd3d4de9ca4ac4ff1b11093432f444c9e4f66c45a6e01155d69cface70456fb418a98163757d&token=1999457569&lang=zh_CN#rd)
22. [Kafka+Spark Streaming管理offset的几种方法](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486328&idx=1&sn=b5d53e0007032114fb277e440e5ce4bf&chksm=fd3d4dedca4ac4fbb5026338af7dbbfe1d845fdc11e7e2279035dddc8ff0b6dd24c32be129ce&token=1999457569&lang=zh_CN#rd)
23. [从 PageRank Example谈Spark应用程序调优](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486322&idx=2&sn=00ddcd16109249e45a70233d5ef959ba&chksm=fd3d4de7ca4ac4f15f85d9a2873c5d1070af3bb929479bd66f7a0dc89ac0777b6960cbce5970&token=1999457569&lang=zh_CN#rd)
24. [Spark调优|SparkSQL参数调优](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247485574&idx=1&sn=723c02562ee1f44e88c389d6ac8a2c87&chksm=fd3d4e13ca4ac705ad892ada2c68792c906946f271bb1e6afd15a93749f8f87d0ea967ba2d37&token=1999457569&lang=zh_CN#rd)
25. [Flink/Spark 如何实现动态更新作业配置](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247485507&idx=1&sn=2cab7a3714ce4e16351b8edcde95e777&chksm=fd3d4ed6ca4ac7c058d820f36b2d03c3ccddc293e1035b6897e99544e55931f6cd6a2f343647&token=1999457569&lang=zh_CN#rd)
26. [Stream SQL的执行原理与Flink的实现](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247485361&idx=1&sn=8348203b6f17662a64fa5d412de97296&chksm=fd3d4124ca4ac8329bac00fae705f37c08236d583e9900f268796d463be9323d82ba4b8b1554&token=1999457569&lang=zh_CN#rd)
27. [Spark将Dataframe数据写入Hive分区表的方案](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247485321&idx=1&sn=13e73673fb29bd134ab79e03a369288c&chksm=fd3d411cca4ac80ac68204d18aee55b95cadde7bf18cc2f8a549fc0b6ff4ec4db570190566c3&token=1999457569&lang=zh_CN#rd)
28. [Spark中几种ShuffleWriter的区别你都知道吗?](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247485316&idx=1&sn=8a7a02023f15324885de7a5c93d4dd94&chksm=fd3d4111ca4ac807f34e1fa03023494d46f3770c035290ddefe1333419453a9258f504584aee&token=1999457569&lang=zh_CN#rd)
29. [SparkSQL的3种Join实现](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247485217&idx=1&sn=3ce9fa8ad179c008754873129e51fbe7&chksm=fd3d41b4ca4ac8a25957fc9541437e6546fc2926df2908bad5adbd30a6cecf9f797fef74f894&token=1999457569&lang=zh_CN#rd)
30. [周期性清除Spark Streaming流状态的方法](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247485138&idx=1&sn=8f71070470c8963e7c973b5f10bf3c03&chksm=fd3d4047ca4ac951981f7f0fa08f9f6a1821270441b5008da28115b67020b01ef2936b5f1e88&token=1999457569&lang=zh_CN#rd)
31. [Structured Streaming之状态存储解析](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247485119&idx=1&sn=fd172b1f9c9ef99eac2ed976a7d4459f&chksm=fd3d402aca4ac93ced929dfb9b3d785fa5e00d82939b4ea7ed31e68096b87f31a0d11c5f7414&token=1999457569&lang=zh_CN#rd)
32. [Spark SQL重点知识总结](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247485105&idx=1&sn=0bea228e6845d04739937b75bd2f8d9a&chksm=fd3d4024ca4ac9322cae55569b7bdc6d8546dc9c4583f25801d3820ab3b30094c50816c7c1c8&token=1999457569&lang=zh_CN#rd)
33. [SparkSQL极简入门](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247485058&idx=1&sn=4d3b5c25ca1fdf1f0fb0cd99959d2371&chksm=fd3d4017ca4ac90140442dfc6032346d5841d6705bd92441ad3d0a0366db346752a6b154b976&token=1999457569&lang=zh_CN#rd)
34. [Spark Shuffle在网易的优化](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247485051&idx=1&sn=a1a70cad450634ceae44d4e14a4fc3ef&chksm=fd3d40eeca4ac9f8901a93271683811da05c62e7a6f1d2825378c32542ef2edf9947273a601a&token=1999457569&lang=zh_CN#rd)
35. [广告点击数实时统计:Spark StructuredStreaming + Redis Streams](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247484999&idx=1&sn=f9cf6eae39bc1d54faaa144357731d2f&chksm=fd3d40d2ca4ac9c4e886bd0208521c45dbab98e9cfd34826e6808d414bcd66f203f0359382e1&token=1999457569&lang=zh_CN#rd)
36. [Spark内存调优](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247484972&idx=1&sn=ff9a2925c31e07b558504be17937872b&chksm=fd3d40b9ca4ac9afe2cce8ff4a6c50724a146b87d2d05d87d6dcdfb34a177f3be4d5ba79b492&token=1999457569&lang=zh_CN#rd)
37. [Structured Streaming 实现思路与实现概述](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247484959&idx=1&sn=2173f71a32e16b510f047fa716549bc2&chksm=fd3d408aca4ac99cef4ec3079d9c7376c14f457e80f814e1494f5324612b553fb5487783a486&token=1999457569&lang=zh_CN#rd)
38. [Spark之数据倾斜调优](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247484956&idx=1&sn=9182a40fcf1fced04acee81aa9261bfe&chksm=fd3d4089ca4ac99f23952f0d627db4600a81808d98b1a635ae40e06c939e4a229a47d666de47&token=1999457569&lang=zh_CN#rd)
39. [你不得不知道的知识-零拷贝](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247484766&idx=1&sn=8d0aeaa1166a9338df9f28bb47959f4a&chksm=fd3d43cbca4acadd0dfc9e753ca4fe1cc2ca49060d886b359bdf3537809015b04f82b4be27ac&token=1999457569&lang=zh_CN#rd)
40. [Spark Streaming消费Kafka数据的两种方案](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247484751&idx=1&sn=11315f599b39eac96c17a78da2fa1258&chksm=fd3d43daca4acaccc624947f5fa84f650e7f61d8638feda67db07cacf51bc985faae74cc94c3&token=1999457569&lang=zh_CN#rd)

### 三、Kafka实战进阶文章合集

#### Kafka实战合集

[点我查看Kafka实战合集](实战系列文章/Kafka实战.md)

### 四、数据仓库实战系列

#### 数据仓库实战合集

[点我查看数据仓库实战合集](实战系列文章/数据仓库.md)

### 五、OLAP实战文章系列OLAP

* [ClickHouse专栏](https://blog.csdn.net/u013411339/category_11681457.html)

### 六、硬刚系列文章合集

#### 硬刚系列文章合集

* [《硬刚Presto|Presto原理&调优&面试&实战全面升级版》](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247503995&idx=1&sn=ead9bbd4ea821c94efc1e18875c1722c&chksm=fd3e96eeca491ff8e0b6a0e1ad3c9ada5365457dd730ce9e19339803504ae10d8108c296fc27&token=935192335&lang=zh_CN&scene=21#wechat_redirect)
* [《硬刚Apache Iceberg | 技术调研&在各大公司的实践应用大总结》](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247503879&idx=2&sn=bd009e298f2bdf9bb8abc9271b515143&chksm=fd3e9692ca491f84722c922000754aafc4d5a271b0ee40d525ce177de3c4442ef83b5ade8e05&scene=21&token=935192335&lang=zh_CN#wechat_redirect)
* [《硬刚ClickHouse | 4万字长文ClickHouse基础&实践&调优全视角解析》](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247503675&idx=1&sn=3ee6af64d0126c78b48cad219308f81e&chksm=fd3e89aeca4900b8b8954e9569ee3c0877881fac8c792bfafc22e7e9d3e8524da8eb860d33d8&scene=21#wechat_redirect)
* [《硬刚数据仓库|SQL Boy的福音之数据仓库体系建模&实施&注意事项小总结》](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247503576&idx=1&sn=f9fc428799e0fcc78e94360e1cec7b95&chksm=fd3e884dca49015b6d38c437f603b4deffeeb0cefafd32d358a891bfe820f734116100395bba&scene=21#wechat_redirect)
* [《硬刚Hive | 4万字基础调优面试小总结》](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247502750&idx=1&sn=bd9a9173d060dc4e4ebd49c8efc6acfe&chksm=fd3e8d0bca49041dea84da93910e5efdc4935e520525c09887c986691377aeb48e5cf7fb5667&scene=21#wechat_redirect)
* [《硬刚用户画像(一) | 标签体系下的用户画像建设小指南》](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247503741&idx=1&sn=e5039be93123f2e337013756a818bfc3&chksm=fd3e89e8ca4900fe603b63c5722a6fb8a32bd63d6ba23e0028851948a71b877eb1f742d95087&scene=21#wechat_redirect)
* [《硬刚用户画像(二) | 基于大数据的用户画像构建小百科全书》](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247489571&idx=1&sn=56a634d66fb689907b4ab51ed2d3707a&chksm=fd3d5eb6ca4ad7a0cc5fa4f895354e58ed7f2cb8558369ed6149560a5e7fca97b8545036fe87&scene=21#wechat_redirect)
* [《我们在学习Spark的时候,到底在学习什么?](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247504460&idx=1&sn=897f9cebaade7861b110fec0a34fc38b&chksm=fd3e94d9ca491dcf85f611c6772703973035ce05e81defdc8d05597f2c478554e063aaa60f00&scene=21#wechat_redirect)》
* [《我们在学习Flink的时候,到底在学习什么?》](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247499604&idx=1&sn=d938dfb30d221774704982d2938b30c1&chksm=fd3eb9c1ca4930d76a391241333de461ca22d2aa27472eab3cffab1564872ae37f1b48fe2d3c&token=935192335&lang=zh_CN&scene=21#wechat_redirect)

### 七、2020精品文章合集

#### 2020精品文章合集

**实时计算篇**

* [Structured Streaming | Apache Spark中处理实时数据的声明式API](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486904&idx=1&sn=5f4b673a87497a9c1dc9d7ce253994f3&chksm=fd3d4b2dca4ac23b9850c54d62ebe8be5920cbd4a491a46a0325e1fbbd91f6b1aef9adf2f638&scene=21#wechat_redirect)
* [HyperLogLog函数在Spark中的高级应用](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486931&idx=1&sn=9c3f3d6a677ed2aa6cc508046ca6da78&chksm=fd3d4b46ca4ac25044d6033458d3b43e7d1f50f447bded4eed8b246991cf620c91919c51e35b&scene=21#wechat_redirect)
* [基于SparkStreaming+Kafka+HBase实时点击流案例](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486938&idx=1&sn=83347b444fc721442d2b4e1a58eca0e8&chksm=fd3d4b4fca4ac2597abd4a39a21dea83220858ae5efd1ae287b471bb9970f165b4854426f4c2&scene=21#wechat_redirect)
* [基于Flink SQL构建实时数据仓库](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486938&idx=1&sn=83347b444fc721442d2b4e1a58eca0e8&chksm=fd3d4b4fca4ac2597abd4a39a21dea83220858ae5efd1ae287b471bb9970f165b4854426f4c2&scene=21#wechat_redirect)
* [Flink异步之矛-锋利的Async I/O](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486945&idx=1&sn=f973d74b58d81e96db2bff431c39b2e2&chksm=fd3d4b74ca4ac2629b9f79232a56d8209627e32ebbfb72ca269bf49c863fdf6f3418f6dee5db&scene=21#wechat_redirect)
* [Spark SQL快速入门系列之Hive](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492891&idx=2&sn=c86bb4c8a3f96d1e5740e4b16d3638d0&chksm=fd3ea38eca492a9841b34a64a10272c83252bb9815dd164d038248fc71bda7119b266afa5c20&scene=21#wechat_redirect)
* [基于SparkStreaming+Kafka+HBase实时点击流案例](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492891&idx=2&sn=c86bb4c8a3f96d1e5740e4b16d3638d0&chksm=fd3ea38eca492a9841b34a64a10272c83252bb9815dd164d038248fc71bda7119b266afa5c20&scene=21#wechat_redirect)
* [三万字长文 | Spark性能优化实战手册](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487074&idx=2&sn=02fc69f890ee9922df1ad6b831f448e5&chksm=fd3d48f7ca4ac1e1377befc32a25e97120c1dde9dcb38fa940834302238eb1ad5ceb8b504e7e&scene=21#wechat_redirect)
* [Flink整合OozieShellAction提交任务带Kerberos认证](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487182&idx=1&sn=e69012117af61a623e30e60c0c356444&chksm=fd3d485bca4ac14d360cff2d1e2e78105d350d71daf9b84d911d95b5cc639c528969a5c2fa71&scene=21#wechat_redirect)
* [Spark源码阅读的正确打开方式](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487182&idx=1&sn=e69012117af61a623e30e60c0c356444&chksm=fd3d485bca4ac14d360cff2d1e2e78105d350d71daf9b84d911d95b5cc639c528969a5c2fa71&scene=21#wechat_redirect)
* [消息队列常见面试问题小集合](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487224&idx=1&sn=340d084b84ae4d49de9f46a7974e8a4c&chksm=fd3d486dca4ac17b31a221ff1cdcd3920035eeb9f480c00955268686623d512b516214efb3a6&scene=21#wechat_redirect)
* [Flink1.10和Hive集成一些需要注意的点](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487242&idx=1&sn=b118be66baa230df95c317d0d14c8a4a&chksm=fd3d499fca4ac089102e0eec2c8248174c43724115a610802e89f5a54d4dfee53aa5d27e8417&scene=21#wechat_redirect)
* [Flink事件时间、水印和迟到数据处理](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487259&idx=1&sn=ab3b02eb7e835831cfceb248ae63880a&chksm=fd3d498eca4ac098eac6864ff1f198575ab2822e26f0cbd18f7b666beb660ddef3e3790ab581&scene=21#wechat_redirect)
* [Flink使用Broadcast State实现流处理配置实时更新](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487268&idx=2&sn=743482009dab625b6b9316e02e793c7f&chksm=fd3d49b1ca4ac0a7d9ffcde9cb9f76d0850ee67ed7861ee24bf5ded980b03bd3ca4d0c86442e&scene=21#wechat_redirect)
* [实战 | MySQL Binlog通过Canal同步HDFS](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487268&idx=1&sn=d2417bd649586c4d6dd73e91ec75538e&chksm=fd3d49b1ca4ac0a7bba4431b4206a84bb13e0dcea01d293a46ae7f69e5fe829e9b37cc46716b&scene=21#wechat_redirect)
* [Flink最难知识点再解析 | 时间/窗口/水印/迟到数据处理](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487283&idx=2&sn=193b0544edb293fe0795dcfe7caf36ef&chksm=fd3d49a6ca4ac0b01c6f3fb7b93151936bef494ed06e236131a5bd4f86735a35e9c530c1229f&scene=21#wechat_redirect)
* [Hive on Spark参数调优姿势小结](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487330&idx=1&sn=fd95cc865937188ee19944d8b643e6ab&chksm=fd3d49f7ca4ac0e17207d2e543b8803444b34b92a777acdec10cf4f63efef87480fb6e5e5543&scene=21#wechat_redirect)
* [Flink Logback日志与邮件报警配置](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487346&idx=3&sn=4426fc62ba4d0895bf66858fcf6b3873&chksm=fd3d49e7ca4ac0f112b05522f1897c97bdfa9a0278041e085768fd9faa8cfbd0e3e1b840f861&scene=21#wechat_redirect)
* [Kafka设计-恰好一次和事务消息](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487346&idx=2&sn=971ffca7e79325e6cd3105eda31af4e1&chksm=fd3d49e7ca4ac0f191f516079b5e1e9451bdd924b4b8fbb8d690c1661e58b7e2dd353c657cdb&scene=21#wechat_redirect)
* [基于Canal和Kafka实现MySQL的Binlog近实时同步](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487357&idx=1&sn=55e0bf4b45ac7f7cae6aadae69e39ace&chksm=fd3d49e8ca4ac0fe6780ac5cf753809e7fa0f0cb73cc70bb8246b1901765199f968bd84fc7b2&scene=21#wechat_redirect)
* [一个基于RabbitMQ的可复用的事务消息方案](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487364&idx=2&sn=63567772a41545834791a7ab2696e4d4&chksm=fd3d4911ca4ac00729a17a37f70819d861ec9dde65df2f91406fbde7cae188594e386bff82ce&scene=21#wechat_redirect)
* [Spark性能优化总结](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487387&idx=1&sn=8b769ead6d7407ac490af0fb1b147598&chksm=fd3d490eca4ac0183f13910819092bb22b454f2390f44fc40dc8bf15e91a844b18ec587dd97e&scene=21#wechat_redirect)
* [Flink常见异常和错误信息小结](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487410&idx=1&sn=24dc08c17ac3d3ac7aaedef9359417d1&chksm=fd3d4927ca4ac0313bdd8999e4760dd6ad0a0471d1fce91493920a08480428d78972ff6f6bdd&scene=21#wechat_redirect)
* [Spark SQL快速入门系列之Hive](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487410&idx=1&sn=24dc08c17ac3d3ac7aaedef9359417d1&chksm=fd3d4927ca4ac0313bdd8999e4760dd6ad0a0471d1fce91493920a08480428d78972ff6f6bdd&scene=21#wechat_redirect)
* [实时计算双星-Flink VS Spark 部署模式对比](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247489363&idx=1&sn=677472699acb15539b4183e463cd00d2&chksm=fd3d51c6ca4ad8d0c01c90a87ab4fc4f172c82b04f27fe12cee19d502ae88e7fc3182ee7a27b&scene=21#wechat_redirect)
* [PID算法和Spark实现反压的原理](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487468&idx=1&sn=1fc593c4f8160b248a008889e098d4cb&chksm=fd3d4979ca4ac06f42518f5fe94d72646395beca75bf1f06dc347b7628baaf56582605705eaa&scene=21#wechat_redirect)
* [关于SparkSQL的开窗函数,你应该知道这些!](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487480&idx=2&sn=a233125a9d7f665ed4b69951deb6105a&chksm=fd3d496dca4ac07b283fa8ca95eee2c638d62b894e563ff40c86e69a82386c355ad7ae2ac223&scene=21#wechat_redirect)
* [Spark SQL是如何选择join策略的?](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487480&idx=1&sn=ec2325cb27dca653269b68f7d6101a07&chksm=fd3d496dca4ac07bbf25fea5a6dc4416be0a19d44ea5aeedc956c640e4acc74066898c195325&scene=21#wechat_redirect)
* [Spark on Hive & Hive on Spark,傻傻分不清楚](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487491&idx=1&sn=4a78fdda3f40f04383f0001522e27c63&chksm=fd3d5696ca4adf804713d6a7651a5a7d0fa35e0e80f453778e6fe84c40d29162bf3874691b18&scene=21#wechat_redirect)
* [来看看一个大二学生的Spark练习题](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487510&idx=1&sn=9d8170be7736ce6cae6e4ce983e2f65b&chksm=fd3d5683ca4adf95fcb1341b6df89dacd647ae890f7db5afad1d3e5251b9ba322b812bbcf767&scene=21#wechat_redirect)
* [Flink 自定义触发器实现带超时时间的 CountWindow](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491813&idx=1&sn=b0b76b7a41ea2a61705949496f07d03d&chksm=fd3ea670ca492f66da69213c0e74798687b4551102f035da0e973dcb8e92ffdb4d9f75e40c71&scene=21#wechat_redirect)
* [Spark Kafka 基于Direct自己管理offset](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491836&idx=2&sn=3eacfa2e3f136f0314012bd4976ee883&chksm=fd3ea669ca492f7fefeb1a8b47af5f53f080512178cfacd9978218b11cbb9c75b078e45f12dd&scene=21#wechat_redirect)
* [Apache Kylin | 麒麟出没,必有祥瑞](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491871&idx=1&sn=207b4283b8c75e3883d3af21e36d4ef5&chksm=fd3ea78aca492e9c442c68377a12d2cc7e4c967f30a24c55b1cc0191b916f7f3f9697bc187f1&scene=21#wechat_redirect)
* [Flink 参数配置和常见参数调优](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491884&idx=2&sn=a13485ac6594e464fad3f1c64771b4e2&chksm=fd3ea7b9ca492eaf0b71a2a841168451c3ce9080f66d5e08838ea922299797d60e5d477265c5&scene=21#wechat_redirect)
* [利用InfluxDB+Grafana搭建Flink on YARN作业监控大屏](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491911&idx=1&sn=ba027a574fd1a531cb3bec943c949363&chksm=fd3ea7d2ca492ec49f636ab5162b7d9f09730cfa9e4b9ff03589ca9eb1cf564c899aaef79ab3&scene=21#wechat_redirect)
* [网站日志实时分析之Flink处理实时热门和PVUV统计](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491992&idx=2&sn=c7942a1ccc921ef5c3a1403752cc81fd&chksm=fd3ea70dca492e1be3e51372fe6414f7f740f0f822d99f1444530da8b1e1dadbf18fe957d8b8&scene=21#wechat_redirect)
* [大数据量下的集合过滤—Bloom Filter](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491992&idx=1&sn=08e4afd7c2a21d6948b7fd7c2351eff0&chksm=fd3ea70dca492e1b51347fb31567a205e5c501d08ed130788e43ab8750316c99b14e7d4bd049&scene=21#wechat_redirect)
* [实时数仓链路分享:kafka =>SparkStreaming=>kudu集成kerberos](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492025&idx=2&sn=91cf0d75fcf8ce47ea4df45e012e23bb&chksm=fd3ea72cca492e3a206d5ea5efec625b7c9eb63991fa1dc4ccbae21d6f3742eb8e0382bf09fc&scene=21#wechat_redirect)
* [Flink CEP 原理和案例详解](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492025&idx=2&sn=91cf0d75fcf8ce47ea4df45e012e23bb&chksm=fd3ea72cca492e3a206d5ea5efec625b7c9eb63991fa1dc4ccbae21d6f3742eb8e0382bf09fc&scene=21#wechat_redirect)
* [ProcessFunction:Flink最底层API使用踩坑记录](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487531&idx=1&sn=183d114f36a697eb7df595fa24fec31c&chksm=fd3d56beca4adfa830cdbd99cf0adde30c7622c3541252cdf172ca0b66e60ae475e7796fdc96&scene=21#wechat_redirect)
* [Flink 1.10之改进的TaskManager内存模型与配置](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487665&idx=2&sn=b6efbb6d5efb6e0b0b978fe8a1515f56&chksm=fd3d5624ca4adf32609d3950c0265420e6b75989f8d0e89bf68ceda2495d0e1d1bc4db2f37ec&scene=21#wechat_redirect)
* [打通实时流处理log4j-flume-kafka-structured-streaming](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487716&idx=2&sn=d9a765695a803b124857db4da4834118&chksm=fd3d5671ca4adf67a5a0b1880548ea9273336638ee073866f672d4edc85c3ff39e42de69e350&scene=21#wechat_redirect)
* [如何设计实时数据平台(设计篇)](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487761&idx=2&sn=9bad34ba9f0c59200f9b10e09b3e7dce&chksm=fd3d5784ca4ade9214b723ed29d6662efd690a52b7e7f667a09690df1e9be2ed6302fb6d7cc1&scene=21#wechat_redirect)
* [如何设计实时数据平台(技术篇)](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487795&idx=2&sn=464480d4e075688ab389502e32ef0611&chksm=fd3d57a6ca4adeb0d31080c92b4f98f3187db6201eebb2be5c161b46c8da1806a275543cbbb7&scene=21#wechat_redirect)
* [SparkSQL内核解析-执行全过程概述](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247488182&idx=1&sn=5bbdf75fbad79d01cdb9de65d32f8d39&chksm=fd3d5423ca4add357c23ead5c0fd9818ca3406e341bd91ab33b1a50c6cd27af86dc3b8d90594&scene=21#wechat_redirect)
* [SparkSQL内核解析之逻辑计划](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247488207&idx=1&sn=2148863c2624465675f8bf6650b5843b&chksm=fd3d545aca4add4c39d828872f206cf198b13fe4f7026ad2725b62068f92e0f7853d3f28dd73&scene=21#wechat_redirect)
* [Flink-1.10中的StreamingFileSink相关特性](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247488580&idx=2&sn=590724dcc514a3f16e0221265858ddde&chksm=fd3d52d1ca4adbc71de9a818b902e9e3e0d1ac2aef374f7f2eecbce3965179066d8e2e041f94&scene=21#wechat_redirect)
* [Kafka下的生产消费者模式与订阅发布模式](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247488638&idx=1&sn=8a122c4b3d455292e13e5da1778de6e6&chksm=fd3d52ebca4adbfd4fe4860a90db3d3cfb0fa0048a896342a6ef6eb919c7dae6e3ade0d4f94e&scene=21#wechat_redirect)
* [Kafka+Spark Streaming如何保证exactly once语义](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247489143&idx=1&sn=3bc211306e107113789f617d39454ee9&chksm=fd3d50e2ca4ad9f45d6302299f35ab3d0423fd170f5959b7acd1826a289c7f4ee2db7cd52460&scene=21#wechat_redirect)
* [Flink之实时统计热门商品的TopN](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247489166&idx=1&sn=7bd32ccc732d9c7e5392b5f3e9ab14e2&chksm=fd3d501bca4ad90dba7326060b08bb09b1523b27ca1bd7254fcd1e86d7ae3b4c8ae18c637e3a&scene=21#wechat_redirect)
* [SparkSQL的自适应执行-Adaptive Execution](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490000&idx=1&sn=984a48afb840b57343d00dfe19335b5c&chksm=fd3d5f45ca4ad6539ba1e979de8f60d29ee003414dfcbb8b34c07e86aa3f15a353303215b076&scene=21#wechat_redirect)
* [Kafka KSQL实战](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490024&idx=2&sn=e4822af63cc067d3f5486acd5a668ad8&chksm=fd3d5f7dca4ad66ba406333b2ef2802fda513a0c6852a315f5e6760b321825ab5737eb52d87a&scene=21#wechat_redirect)
* [ELK+FileBeat+Kafka分布式系统搭建图文教程](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492241&idx=1&sn=8899979a87a17d9d4ed5ddf901e1a615&chksm=fd3ea404ca492d124e3404fe0d019c6642b90ebc146f2c0f500b82109f4056f7abd36413a38b&scene=21#wechat_redirect)
* [HDFS应用场景、原理、基本架构及使用方法](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492293&idx=2&sn=f9354cc4d1ce1541c2aaae99a758437c&chksm=fd3ea450ca492d464331a96746135e9f4ef0205ee968abbcf7a6c4c837b5e1632ef107739f54&scene=21#wechat_redirect)
* [数据模型⽆法复⽤,归根结底还是设计问题](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492293&idx=1&sn=c14bde7878c5b28cd3698f5a8df735ba&chksm=fd3ea450ca492d46fd9f1bdf3e6f45bed2c8839cc314e1239e73bb5f7daff3a067e4dc5ab914&scene=21#wechat_redirect)
* [Hadoop支持Lzo压缩配置及案例](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492923&idx=1&sn=6b5a0594031c40ff0e543aa6d9740cc6&chksm=fd3ea3aeca492ab887f8b95a711d3c1bbbd4c2438d941efe42329d713569bb42482342dd92ba&scene=21#wechat_redirect)
* [快看 | Java连接集成Kerberos的HA HDFS方案](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492168&idx=3&sn=fac9d36291779f58afdf42035d7202a3&chksm=fd3ea4ddca492dcb252d2173f3b247f78552d068eac429f8f10c45e9fff0fbf2d87e0a33d671&scene=21#wechat_redirect)
* [Kafka消费者分区分配策略及自定义分配策略](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491557&idx=2&sn=ba688e1f3b70198975cc57fd845fa337&chksm=fd3d5970ca4ad0668cacb9c1623423fecccdc685e02de5dbf95249f8daf191c2dc732fa9bfc3&scene=21#wechat_redirect)
* [Spark Streaming整合log4j、Flume与Kafka的案例](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492168&idx=2&sn=e460dc527658b327af8437074cc6ac99&chksm=fd3ea4ddca492dcb50a0196d9b65b46b5e90f605f09c4b8307d377955aa1bfbe4a38df33ffb5&scene=21#wechat_redirect)
* [面试必知的 Spark SQL 几种 Join 实现](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491813&idx=2&sn=557cc24073bef28f4f3589b63f71c967&chksm=fd3ea670ca492f660d96498041568c59c60a6390d75931a032d05c53af017451b60a1b9fba49&scene=21#wechat_redirect)
* [Flink在大规模状态数据集下的checkpoint调优](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490732&idx=3&sn=e1ed5667cd2e2f4f2f49c86ca20f40b8&chksm=fd3d5a39ca4ad32f239ed2002503b01bce90c091e636764d35371a1971c46d8a2d24662ff146&scene=21#wechat_redirect)
* [Write-Ahead Log(WAL预写日志)的工作原理](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490732&idx=2&sn=e2771a1b14403478b801b02c28eedd03&chksm=fd3d5a39ca4ad32f031fc13032168907af08507f8e448d9a9bc786ac8602ba84c400a0f65722&scene=21#wechat_redirect)
* [Kafka常见的导致重复消费原因和解决方案](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490749&idx=1&sn=c9ef0b3010bd1196efcef9df546985d6&chksm=fd3d5a28ca4ad33e2f1d9f20629fb5b1f7049760ded0d6eb9f935751bd78c7b41a188ff6bc42&scene=21#wechat_redirect)
* [Spark-submit 参数调优完整攻略](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490768&idx=2&sn=7f78bd89f41d189aa4eee7d9de47359d&chksm=fd3d5a45ca4ad353f036260db7be1d5b91075d2de415745350b5d26c05162c53b092f1cd7de6&scene=21#wechat_redirect)
* [Kafka数据可靠性保证三板斧-ACK/ISR/HW](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490798&idx=1&sn=2c0d6cceb9f1028ac1f2eb8192711c02&chksm=fd3d5a7bca4ad36d442069b4403a7a201c67a1cc72799de3664b63d360b559e58daacb86f396&scene=21#wechat_redirect)
* [Spark常见错误问题汇总](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490873&idx=1&sn=2da5f475f178bda5ee5a74dc0306b25e&chksm=fd3d5bacca4ad2bab06dc1d526e28b71e714d499929a85675f2d40f5ec0d72200f54d42e038d&scene=21#wechat_redirect)
* [HBase操作组件:Hive、Phoenix、Lealone](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490905&idx=2&sn=18196db069e5740503ee6be0907fafc2&chksm=fd3d5bccca4ad2da28a3ef31d17181f91bdedf4079ffea830328e726e0959011161fc41c8b86&scene=21#wechat_redirect)
* [Redis系列 | 缓存穿透、击穿、雪崩、预热、更新、降级](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490966&idx=2&sn=f8f2ee62b1aa8353e0f3bba18a8c2f5e&chksm=fd3d5b03ca4ad2151a4c5898683a7eee080088760c4eca5d46df8eb619f01e8d44628cdead9e&scene=21#wechat_redirect)
* [Kafka工作流程及文件存储机制](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490966&idx=1&sn=e81972c99e2792f449085d03715304b1&chksm=fd3d5b03ca4ad215cea217195cce7b29fd3150250e4d134c37ff57dabec52b8d849f62e0ca36&scene=21#wechat_redirect)
* [Redis6.0主从、哨兵、集群搭建和原理](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491086&idx=2&sn=68f9365aff2a66846143b20943c4b138&chksm=fd3d589bca4ad18d971f9ef6709da0217f63ec12615f0ef54c6327ca7568dccfac1a79628cbc&scene=21#wechat_redirect)
* [Spark Streaming官方编程指南](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491086&idx=2&sn=68f9365aff2a66846143b20943c4b138&chksm=fd3d589bca4ad18d971f9ef6709da0217f63ec12615f0ef54c6327ca7568dccfac1a79628cbc&scene=21#wechat_redirect)
* [【从0开始の全记录】Flume+Kafka+Spark+Spring Boot 统计网页访问量项目](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492401&idx=1&sn=88a3634793c0a3b86e48c5cfe21f6ba3&chksm=fd3ea5a4ca492cb2f298af0a3605fcfe80928057ab24c26e697c11e0c82654636f2ce5190776&scene=21#wechat_redirect)
* [Spark+Kudu的广告业务项目实战笔记(一)](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492426&idx=1&sn=5cc1326f8cdec7f54eedea656acb89c5&chksm=fd3ea5dfca492cc98f89e32b2291d5439896d119fec077cafa65b36fbf46a875d9f3c0996fd4&scene=21#wechat_redirect)
* [大数据入门:Spark+Kudu的广告业务项目实战笔记(二)](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492558&idx=1&sn=82382cfce0d052380d1c4096951402a2&chksm=fd3ea55bca492c4dea78c0e039e2c1993cdf0bd22f56fb97f9445645033140d34cf520aab697&scene=21#wechat_redirect)
* [大数据入门:Spark+Kudu的广告业务项目实战笔记(三)](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492558&idx=2&sn=995b7887879a439dd0dac0d801d82e4d&chksm=fd3ea55bca492c4dd856e6084a727a15359cac7f5278ac0bbbdacd57aec55296d3979bee5f09&scene=21#wechat_redirect)
* [大数据入门:Spark+Kudu的广告业务项目实战笔记(四)](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492558&idx=3&sn=d2273c79be1d99270d41270a2b12d872&chksm=fd3ea55bca492c4dbbc8ccaafc5375fbff8c01101cfca029f5f36d29985201001b83856204d6&scene=21#wechat_redirect)
* [大数据入门:Spark+Kudu的广告业务项目实战笔记(五)](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492558&idx=4&sn=e5f0d6e8168c2c51eafe4952dc41ba19&chksm=fd3ea55bca492c4d6261bfe591627118f2c203bb70ed86a43d03ef2b4d7cc952eef2b93a3235&scene=21#wechat_redirect)
* [大数据入门:Spark+Kudu的广告业务项目实战笔记(六)](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492558&idx=5&sn=9fd6dcc935f6d73a4c57166bc94212be&chksm=fd3ea55bca492c4d1b0d477f02a7f2f40b3944068720db7f7452a165719f6979fee0b1d48eb0&scene=21#wechat_redirect)
* [Flink 1.11新特性之SQL Hive Streaming简单示例](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492648&idx=2&sn=eaef19686a06d04efaf3d6f17a5ca0ba&chksm=fd3ea2bdca492bab691ce6e47d81387ba0a0cdc2350265ef5eb58e38b466237c549267022eb3&scene=21#wechat_redirect)
* [SparkSQL 整体运行架构和底层实现](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492699&idx=3&sn=fb58aad4cc9cd6aad71ab581816f0d0b&chksm=fd3ea2ceca492bd8c7bb490611e1d930b5833ede83d198e61555bc3fdd5687c2a513e700a800&scene=21#wechat_redirect)

**离线计算篇**

* [ORC文件存储格式的深入探究](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486981&idx=1&sn=9c8fc4c127d7e6108ac4e171e750d490&chksm=fd3d4890ca4ac186614f0dda8ffb2d35693a925b03861a01769c898652b53d0d436bca05ea12&scene=21#wechat_redirect)
* [Hadoop支持Lzo压缩配置及案例](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486981&idx=1&sn=9c8fc4c127d7e6108ac4e171e750d490&chksm=fd3d4890ca4ac186614f0dda8ffb2d35693a925b03861a01769c898652b53d0d436bca05ea12&scene=21#wechat_redirect)
* [神策数据分享 | 标签体系应用与建设(文末附下载链接)](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487175&idx=1&sn=c00064434ee03bef0b6aa07d2b9033a4&chksm=fd3d4852ca4ac144a310dc7d901bdf262e3c929ef7ab414e4de03602ed83b3901c098a00145b&scene=21#wechat_redirect)
* [环形缓冲区-Hadoop Shuffle过程中的利器](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487204&idx=1&sn=f344eb2641175828815075cd6e57520e&chksm=fd3d4871ca4ac167e7cd49e7938936e861ca5c14f048730487a7b225561e1794cd5d9e69905d&scene=21#wechat_redirect)
* [eBay | 实践Hadoop任务的性能翻倍之路](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487233&idx=1&sn=dae402bafe49cf98f1b112bca384d7e1&chksm=fd3d4994ca4ac08275d0a3c57efbfcaca370edeec62e7b5cc982f7299c2937103b7dea1af782&scene=21#wechat_redirect)
* [PDFT/Paxos/Raft-分布式一致性协议解析](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487287&idx=1&sn=e632bbf074c0d2c64a892a481872c6e7&chksm=fd3d49a2ca4ac0b4aa36391a8f36f24e49ebcf645a0da96042b76bb582f9d218aa9a57edfb7f&scene=21#wechat_redirect)
* [谈谈经典限流方法—漏桶、令牌桶与Guava RateLimiter的实现](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487294&idx=2&sn=03c969be724fe60b0b14f921cbcec8f6&chksm=fd3d49abca4ac0bdda91ceab31deca10f2059e2e37e34e60e17f54f358ab25c7c7bb93d4b840&scene=21#wechat_redirect)
* [轻量级异步屏障快照(ABS)算法解析](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487294&idx=1&sn=8b808225b1b792e892ab737485492ac6&chksm=fd3d49abca4ac0bdc9ecc3eb45bc89c2cb8f4af7f3a66c0e9e6290d374279fd0569049996143&scene=21#wechat_redirect)
* [Hadoop小文件利器Ozone](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487297&idx=2&sn=c5ef9a22eb4c3e3da9e5fc32fe90c949&chksm=fd3d49d4ca4ac0c29d739867fae2fe796ee9a2fc1aa29b85c30d34efbb167ac885e134af2053&scene=21#wechat_redirect)
* [数据指标体系建设](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487297&idx=2&sn=c5ef9a22eb4c3e3da9e5fc32fe90c949&chksm=fd3d49d4ca4ac0c29d739867fae2fe796ee9a2fc1aa29b85c30d34efbb167ac885e134af2053&scene=21#wechat_redirect)
* [Hbase FAQ热门问答小集合](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487338&idx=1&sn=00e93834aef4c73cd3cd2a487e3e6cc9&chksm=fd3d49ffca4ac0e948bb40ba97c5da9e7fa3ffb78afb6001f9d06603e6ac1938d7f06b5ee38d&scene=21#wechat_redirect)
* [设计HBase RowKey需要注意的二三事](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487369&idx=1&sn=6ad49150d939209cc59965be922ae5ed&chksm=fd3d491cca4ac00a82671803f60c1848932bc45aa649dcfe4f0f765fe24cc364a1bab81ec550&scene=21#wechat_redirect)
* [HBase优化笔记](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487378&idx=2&sn=f9d4400a0677a70a8df94cc0530831e3&chksm=fd3d4907ca4ac011d278ad227be1c902f1ab9408e227f27c6082cfc2580e74fe1788365f97b1&scene=21#wechat_redirect)
* [HBase生产环境优化不完全指南](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487391&idx=2&sn=420d44df8c68349212df31fa4471576d&chksm=fd3d490aca4ac01c7536e0f628cd82be97d5d9dba1b9619a0c637795731813cc36acb8c52079&scene=21#wechat_redirect)
* [Hive SQL50道练习题](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487395&idx=1&sn=7fdb8625da8cccd43ceb07ea6554fe92&chksm=fd3d4936ca4ac020bdc48bdb7cfb55564e8d2cd99c39e785ce0d7eb86cb75b347bcf5914d0c9&scene=21#wechat_redirect)
* [Hive on Spark参数调优小结](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487410&idx=2&sn=c6be05a74ac472b962583a2228996316&chksm=fd3d4927ca4ac031a2b1976393d12c6a78ea81817aded17ee2fd57dcc6f22cd1c4272d3f0e1b&scene=21#wechat_redirect)
* [Hadoop(CDH)分布式环境搭建(简单易懂,绝对有效)](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487523&idx=1&sn=6719f902998dd24a0904444d49f72ec7&chksm=fd3d56b6ca4adfa0a3279696b65467093e4035e94df802e7d44c64853634c1a387c3fc99e989&scene=21#wechat_redirect)
* [ConcurrentHashMap锁机制进化的考量](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487527&idx=1&sn=e9dee7c31da53fd9a27acc49e41e4753&chksm=fd3d56b2ca4adfa49fdc66d49c746359632b85714a1edd4986c3028fbc7a0228650fa4c41ee8&scene=21#wechat_redirect)
* [HBASE列族不能太多的真相](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492803&idx=2&sn=b7a718dea53b5912149fe61f9f396fe1&chksm=fd3ea256ca492b40cf52f812cef0c437610350777d17022edf264e2b7ee6f7b30164e9f84039&scene=21#wechat_redirect)
* [基于ClickHouse的用户行为分析实践](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492803&idx=1&sn=04283aa0913674e158a8519120fc14cd&chksm=fd3ea256ca492b40f60ac90eee4d23f9c2b8156a9c83439297317efcb826b32779b762f554e4&scene=21#wechat_redirect)
* [HBase的系统架构全视角解读](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492822&idx=1&sn=175f6febc758a56c7aa680f5b953de08&chksm=fd3ea243ca492b55e723f64a81f8b334b97cbe3c80cd72da0e5695380f33f25c74b9b7996978&scene=21#wechat_redirect)
* [Kylin Cube构建原理+调优](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492864&idx=1&sn=597c5e992305482d2d49f801b10e8efd&chksm=fd3ea395ca492a839c9c53c6fdd5056d9cec123ebc0c7acc578569480140c3cd6bb7762ebf43&scene=21#wechat_redirect)
* [Hadoop支持Lzo压缩配置及案例](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492923&idx=1&sn=6b5a0594031c40ff0e543aa6d9740cc6&chksm=fd3ea3aeca492ab887f8b95a711d3c1bbbd4c2438d941efe42329d713569bb42482342dd92ba&scene=21#wechat_redirect)
* [Apache Hudi 架构设计和基本概念](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492943&idx=1&sn=fcd6d6e67b7ff0792976b063374306ac&chksm=fd3ea3daca492acc631a0db5d99c82603d7935888e13ea086a5825651132396f3d48db20c5b5&scene=21#wechat_redirect)
* [HiveSQL常用优化方法全面总结](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492943&idx=1&sn=fcd6d6e67b7ff0792976b063374306ac&chksm=fd3ea3daca492acc631a0db5d99c82603d7935888e13ea086a5825651132396f3d48db20c5b5&scene=21#wechat_redirect)
* [MapReduce性能优化大纲](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247488182&idx=2&sn=b2b8e50829f5706295fa5b5201743621&chksm=fd3d5423ca4add3516c141897124468ce51bfa86f599dd499e728bfcf18d8928fafaabac4d4f&scene=21#wechat_redirect)
* [从NoSQL运动谈分布式系统的CAP、BASE理论](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487582&idx=1&sn=6882c98d94b6e3a52132b30ff4e404ef&chksm=fd3d56cbca4adfddcbaf2ded2f540a2b32cf4e0cace93a4a4f6a2b00a0482f80deaa55f6e8f1&scene=21#wechat_redirect)
* [HDFS读写数据过程原理分析](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487609&idx=2&sn=5b5fc64b4c2628f77cfd902228ec78cf&chksm=fd3d56ecca4adffa3d66802fcb091468f96a51730b4a095b6de32842d7325165f797da8b1bf7&scene=21#wechat_redirect)
* [数据中台建设五步法](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487665&idx=1&sn=5a0a007f58a7c9a6872a4b7c65683f08&chksm=fd3d5624ca4adf32159835c42768938c2e455c8354d1ca1bef731ab320a378eb8af002e3cd96&scene=21#wechat_redirect)
* [Step by Step 实现基于 Cloudera 5.8.2 的企业级安全大数据平台 - Kerberos的整合](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247488083&idx=2&sn=30a40939a9ba29a2c55a1c973433e8f4&chksm=fd3d54c6ca4addd0224cd352085966896d45d02e110db5c90142947053f3a7af21b7c1b6673e&scene=21#wechat_redirect)
* [一篇文章全面了解监控知识体系](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247488492&idx=1&sn=6059d4f245afad8230ee71a96e8a3d4e&chksm=fd3d5579ca4adc6f9b66549757ea8e3ac6bacdcdd17185ac87073f6fbbc88827d0b789606aa3&scene=21#wechat_redirect)
* [Sqoop 使用shell命令的各种参数的配置及使用方法](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491557&idx=1&sn=3641e76946290e82f393f2660c07c8dd&chksm=fd3d5970ca4ad0666a3b2449f3d0444177aec2afdc7fd15998a8f006c99d5bb41e28f916fab7&scene=21#wechat_redirect)
* [Hive小知识之分桶抽样](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491603&idx=2&sn=339b7498e6c632cc5663625ef13c5dc1&chksm=fd3ea686ca492f90e7085a444431dea40c1e7f30884b5df283b695f30d4f591a9dfaec32c7b5&scene=21#wechat_redirect)
* [数据仓库和数据集市建模体系化总结](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491647&idx=1&sn=51dd15b000803c1ab1d4cea67fc02e81&chksm=fd3ea6aaca492fbc6fc2fe34f39cb31a585b3c53b02ca0304e7edf63680521ba7036a0bbfdce&scene=21#wechat_redirect)
* [Phoenix(云HBase SQL)核心功能原理及应用场景介绍](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491759&idx=1&sn=2d0b25585442176b0ad74235fe799611&chksm=fd3ea63aca492f2cf8f482a0f5bc88ed3034124d90d0234991a67a9ab1dd4e0086c60e0a83e8&scene=21#wechat_redirect)
* [基于实际业务场景下的Flume部署](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492053&idx=1&sn=594dd94d1bf6d692680d77107d83794a&chksm=fd3ea740ca492e56c2a838e8f6656dc7c9dea787add1c801b6ed379554162c696fcc5e625c1a&scene=21#wechat_redirect)
* [斗转星移 | 三万字总结Kafka各个版本差异](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492958&idx=1&sn=1ef59283fddfcffa14425d8d51af6f81&chksm=fd3ea3cbca492add3660dc37a37b458694cd285ae061884bb8aecbba2725fab3baee89608eba&scene=21#wechat_redirect)
* [Spark SQL自定义函数UDF、UDAF聚合函数以及开窗函数的使用](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490378&idx=1&sn=f45be2b9cb97e896afb1ded5d6c37fcc&chksm=fd3d5ddfca4ad4c90ae2a6c75eee49d5c894d44c3603a94bf117807091467162f2b843e0c43e&scene=21#wechat_redirect)
* [SparkSQL用UDAF实现Bitmap函数](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490581&idx=2&sn=d819fc83a5f553687b5c79bd64cc9843&chksm=fd3d5a80ca4ad39607611d9dc07613b1a5da27570f7b8331e17400a6bf6a4285fbe9ce942a42&scene=21#wechat_redirect)
* [一文了解Kafka核心概念和角色](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490065&idx=1&sn=e8e9539eecc25ef7a5f8703fac277be3&chksm=fd3d5c84ca4ad5925f407f84546146a4e8f75d97b58adb98f9dca32ab8aa90b14e9d2211e0c9&scene=21#wechat_redirect)
* [Apache Spark 内存管理详解](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490160&idx=1&sn=a0bf8702286a348d6e486be62d07751e&chksm=fd3d5ce5ca4ad5f34f643f46662ef37067c1199c28ca4b15ec958bb51b68edc992d3e99ff922&scene=21#wechat_redirect)
* [经典限流方法——漏桶、令牌桶与Guava RateLimiter的实现](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490460&idx=3&sn=b2d23ba7b20ea8cf2d155999ec879b75&chksm=fd3d5d09ca4ad41f1f7328d91bac921168207e5a5e1f4af78b9a392f626ab26c72998cd5457e&scene=21#wechat_redirect)
* [ZooKeeper在HBase集群中的作用](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490460&idx=2&sn=8491948f284d3bc53a4c3be7f285aae7&chksm=fd3d5d09ca4ad41f797ca0b584b83668c78cf5f449207b42dcdb49e90dcd558b759b249be498&scene=21#wechat_redirect)
* [从B+树到LSM树,及LSM树在HBase中的应用](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490581&idx=3&sn=cbf752e85e3cf865e5a673965f020bbd&chksm=fd3d5a80ca4ad39616be9a8d0c6daddfe72f69993a43d322fcdd1966f566bae38b836753aa95&scene=21#wechat_redirect)
* [Hadoop Namenode元数据持久化机制与SecondaryNamenode的作用详解](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490581&idx=1&sn=57492e477ea3360e53eac23a20c3b0ad&chksm=fd3d5a80ca4ad396b7b11cf00ca2a87ff1097692c527208ad6ada9151da88edd6542161c4bc6&scene=21#wechat_redirect)
* [干掉ELK | 使用Prometheus+Grafana搭建监控平台](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490610&idx=1&sn=f435639d0392156082a997411bf42f6c&chksm=fd3d5aa7ca4ad3b1a239b8ac29f156ce7b6198cee7ec613e44dc27579b8bcf27fede954a2883&scene=21#wechat_redirect)
* [盘点:SQL on Hadoop中用到的主要技术](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247488854&idx=1&sn=5b1419b1813e4695c0750445078012d4&chksm=fd3d53c3ca4adad5e0852f1d44e9fb0592bf82e7c635a2900dff196bb5172443e9ce5dccd07b&scene=21#wechat_redirect)
* [用HiveSQL计算连续天数问题的方法](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247489143&idx=2&sn=c25af83fc4bdeb42fb7d4236669489e4&chksm=fd3d50e2ca4ad9f474758e8e5b004a531fded129a5ed5bc7bc549e50f3e72872c4048628d288&scene=21#wechat_redirect)
* [浅谈Linux cgroup机制与YARN的CPU资源隔离](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247489342&idx=2&sn=5354a5fcb497f9a2294b61c707838411&chksm=fd3d51abca4ad8bde70e5980531cf3e63ba314ba000df4df7e10420a093c3f1f80063094c854&scene=21#wechat_redirect)
* [京东JDHBase异地多活实践](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247489415&idx=2&sn=303860edceafc625b97fdeae6e8e7acf&chksm=fd3d5112ca4ad804637f73ccd9d1177d5bb3521f0cda68cbabccd7fe2ab7047f4fd05563e6c2&scene=21#wechat_redirect)
* [Kafka的分区数是不是越多越好?](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247489433&idx=1&sn=cfe0e855bf0addfd06599cc4c4a6f61c&chksm=fd3d510cca4ad81a87c0a2f81e74eabbf55928e38ec605e8870ea96dc0bc5926302de51200ae&scene=21#wechat_redirect)
* [一文俯瞰Elasticsearch核心原理](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490187&idx=2&sn=c8cb0f332019d1bf73ac6d92ed913dbc&chksm=fd3d5c1eca4ad5089cb8949fd1ee6e51c7d8622cf691edfc759836591435f9246a0b729a6b2b&scene=21#wechat_redirect)
* [不可不说的Java"锁"事](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490378&idx=2&sn=e3f62731d911f230463581dae96f7c49&chksm=fd3d5ddfca4ad4c9847a8effc42e9a63e019e2e25712706bc0c3816237bb9bd76fec411bdf2e&scene=21#wechat_redirect)

**数据仓库篇**

* [一文了解数据库和数据仓库](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486890&idx=1&sn=004a86c329439558a4f234fa21744173&chksm=fd3d4b3fca4ac229c6af919153949d4bc60c603635fc432d5695c4ee14c61bd6408a6188ef22&scene=21#wechat_redirect)
* [数据仓库系统的实现与使用(含OLAP重点讲解)](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486892&idx=2&sn=8e9a2c30a596cde9713601a22875af63&chksm=fd3d4b39ca4ac22f082760e7d4c8e0c4c7979e7431f9063215a8669cc5724adcc843df01b1e0&scene=21#wechat_redirect)
* [Data Lake 三剑客—Delta、Hudi、Iceberg 对比分析](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487196&idx=1&sn=313ceebfbf288e309f8e5da00451af5b&chksm=fd3d4849ca4ac15f800f6eebc95e597e47cafad3bfe1e0413f83db88ea7530448ad55b5d7680&scene=21#wechat_redirect)
* [数据也有温度?Elasticsearch 5.x 版本中的冷热数据架构](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247489275&idx=2&sn=1c9783692bc3f9cceb3f403702057744&chksm=fd3d506eca4ad978f6d28a3f933b68367d80566e93a391617996f42c60d1398912c0a73908a7&scene=21#wechat_redirect)
* [数据冷热分离技术](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247489301&idx=1&sn=aa613e6311ae45487286c24f2134aa8d&chksm=fd3d5180ca4ad89606221360a1e7e35f5259d5a2eea4c1b4cf3e0fd0cee2d74b6579ea043546&scene=21#wechat_redirect)
* [冷热数据分离 | Alluxio元数据管理策略](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490610&idx=2&sn=c1751ded1965049fabd61df82484c4b3&chksm=fd3d5aa7ca4ad3b140d5ce00fa06b75e3577cb5bf9e3a944f25ca49c74e03072a03562d635cc&scene=21#wechat_redirect)
* [数据之眼 | 数据探查服务的设计](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490691&idx=2&sn=2c8059004ba6086edb34e901efa5fdc0&chksm=fd3d5a16ca4ad30038888daa1ebf8f65be386cd3ba7ae2ba4fadc1f1eed24a9edc863ef10dc1&scene=21#wechat_redirect)
* [元数据存储系统管理演变升级](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247490691&idx=1&sn=e77e2a1058cf3b9e7015a6451331e406&chksm=fd3d5a16ca4ad300b30467cf550015b6c2474ff18f5093dc1f2a954f1115dc4ee3fc33aeda99&scene=21#wechat_redirect)
* [数据湖 | 一文读懂Data Lake的概念、特征、架构与案例](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492241&idx=2&sn=a1a1c35ac0a71d648ac773b700c0478b&chksm=fd3ea404ca492d128493132600f9cebb2b34a3dc395fe4eb6db02d9a018ace5adcd171f6941a&scene=21#wechat_redirect)
* [用户行为数据采集系统](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247488529&idx=1&sn=bc81e0fb68c13626c9202f1942fb3a80&chksm=fd3d5284ca4adb922308718382820ac48e4dcdf819ef8d2b672ee919cf6b67c4aa7a70053539&scene=21#wechat_redirect)
* [创业公司数据仓库的建设](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247488580&idx=3&sn=6152e6acfe7b600c8d8e4a2138c37aa9&chksm=fd3d52d1ca4adbc72ea737f35c9512fa8a3ef9e9733c14c57c1b269a6798e5a04dd285912d77&scene=21#wechat_redirect)
* [Kylin使用Spark构建Cube](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487697&idx=1&sn=b085af6dd063fa66d017ae8f96bc94af&chksm=fd3d5644ca4adf52f0bc616917ff1a817328ed1bb39a01a04fc839164a2705cf92e0ddcad6dd&scene=21#wechat_redirect)
* [实时统计分析系统-Apache Druid](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247488311&idx=1&sn=f6a8c052d49c6abb0fc7956a2066f4c2&chksm=fd3d55a2ca4adcb4d5f1f5250013fb38aae2e75510f95364cbb58d4ea4f985b0dac7869b5fba&scene=21#wechat_redirect)
* [Elasticsearch索引和检索优化与压测监控总结](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247489054&idx=2&sn=2ec1061def576779947d31eee1c5ce74&chksm=fd3d508bca4ad99d7ea476c9ea70da6df33e3099a55faace760233e6833e8086ac26036420a9&scene=21#wechat_redirect)

**面试题篇**

* [我们常说的海量小文件的根源是什么?](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486931&idx=2&sn=1c5987f3bad7a805895484ebfd683e11&chksm=fd3d4b46ca4ac250d31502a76f0cfd02ebea29d1161b9e9faec23ea6aa8947d0dd7d4268427f&scene=21#wechat_redirect)
* [如果你在准备面试,好好看看这130道题](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486986&idx=1&sn=422d1a3c11c72ff97b32cc01142839f4&chksm=fd3d489fca4ac1895242ab94b932b12c65dc57b5f3a16acc7084dc8a189e9026290245a64c4f&scene=21#wechat_redirect)
* [你可能需要知道的Kafka面试题与部分答案整理](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486992&idx=1&sn=8950101b408b0e02b7957f6d74c9cd74&chksm=fd3d4885ca4ac193a28d3f10f9d25c39ec57c294d51961cac69fdf8bacbcac46a989342d2136&scene=21#wechat_redirect)
* [28道关于ZooKeeper的面试题](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487021&idx=2&sn=bbfa3fb95e4fa3d0b6ae90a8933119ed&chksm=fd3d48b8ca4ac1ae0542948f3abaa00e69ca7b20806699c7a8dec7a13ddfb79a981ca3b45c90&scene=21#wechat_redirect)
* [【数据白皮书重磅分享】推荐|埋点|用研|标签](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487177&idx=2&sn=dea2d55951494a5bf4438ec5a888bdd7&chksm=fd3d485cca4ac14ac7ae87e6a8bf11d988e068c3548762cab91f11f45855431e8f759e131a76&scene=21#wechat_redirect)
* [一份优秀的简历该长成什么样](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487297&idx=1&sn=9b007050efaf50f17e3f46915c70f4ea&chksm=fd3d49d4ca4ac0c2b0e25aae9e4aba71c9cc18e6f388d1dd6615791a7d2432445c32d0039eaa&scene=21#wechat_redirect)
* [1万2千字长文助力春招 | Netty面试篇](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487309&idx=2&sn=a33cc56116cb7891145394f9c5353be3&chksm=fd3d49d8ca4ac0ce28cd0a543109d60db47e8be60d63c6951fa30e4ab4c812f9025286fa5c37&scene=21#wechat_redirect)
* [消息队列面面观](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487314&idx=1&sn=f832be362c7c2ea877c48d47923434c5&chksm=fd3d49c7ca4ac0d1444e8352b35ea3a706cdf496add814cf0fe3aa4844f4c25dc2f4d6c269f1&scene=21#wechat_redirect)
* [关于技术面试的一点点体会](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487326&idx=1&sn=1ba5867700bdfb5ad7f808a245ae3b1f&chksm=fd3d49cbca4ac0dd7d8401e1226ff054441709af6f0cacc3ceeab7354ac6f7452592239efb06&scene=21#wechat_redirect)
* [早点建立自己的知识体系](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487334&idx=1&sn=ce3453c1b63d6932438c8412fc5454d3&chksm=fd3d49f3ca4ac0e5ab43ecb1cc9be9320232bb0234b3c43b0903eca9e225c338b5c6990923cc&scene=21#wechat_redirect)
* [Filter(过滤)|Project(映射)|Pushdowns(谓词下推)](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487414&idx=2&sn=2afda2b854a4f875159a43903bb7b337&chksm=fd3d4923ca4ac035b7627c9a32b7fa003f817176d959aae98770416cf3a55384efdde060b03f&scene=21#wechat_redirect)
* [阅读源码|Spark 与 Flink 的 RPC 实现](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487449&idx=3&sn=5b3d8564d1c84b14f9d941c96d8f6a27&chksm=fd3d494cca4ac05a83b67a04ac5497bb13ecf79a3a5044083d0bf9e93d8bdacd357fa1ffce1e&scene=21#wechat_redirect)
* [三万六千字通关MySQL面试](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487449&idx=1&sn=2cc97bf6669416267c25a7a46192b706&chksm=fd3d494cca4ac05a25e3e91a52d105bba187896cdf322ebbf82b7ce88db8f8a3ed25070cad77&scene=21#wechat_redirect)
* [深入理解CAP理论和适用场景](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487458&idx=1&sn=11e0b029998ecdeccda8e18c1e43f980&chksm=fd3d4977ca4ac0613b5a32d6bed036231239f40021afa15a0b45cab6426a9afdcca7e9d7844c&scene=21#wechat_redirect)
* [HDFS的SecondaryNameNode作用,你别答错了](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487464&idx=1&sn=c7358495832da5e4bf4807691be6ef29&chksm=fd3d497dca4ac06b1caa9b6a0746d01dd4dd44203d57e929193fa79053232c13b1668615f8e7&scene=21#wechat_redirect)
* [Kafka三种可视化监控管理工具Monitor/Manager/Eagle](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491086&idx=1&sn=f2f9bd1a55064ec7abf63666bf54be61&chksm=fd3d589bca4ad18d6e3f353c1d63cee77650d176b582c3f84b7ead45170e0c5523a0a472b1eb&scene=21#wechat_redirect)
* [Kafka体系架构详细分解](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491138&idx=1&sn=f42cac6584ca4c3b74f232e897b33206&chksm=fd3d58d7ca4ad1c19e851a02b88e6e8350e29e8a57f71acc5def38bc14ed5a0f5110122f9927&scene=21#wechat_redirect)
* [Kafka笔记—可靠性、幂等性和事务](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491138&idx=2&sn=a67e3044fd368e61a0290679e2765661&chksm=fd3d58d7ca4ad1c1cf2c6e79747b77005dd811e6d385c7b67a3970e6de74fbd1f307f06a658a&scene=21#wechat_redirect)
* [Kafka体系架构详细分解](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491138&idx=1&sn=f42cac6584ca4c3b74f232e897b33206&chksm=fd3d58d7ca4ad1c19e851a02b88e6e8350e29e8a57f71acc5def38bc14ed5a0f5110122f9927&scene=21#wechat_redirect)
* [Kafka面试题系列(进阶篇)](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491176&idx=1&sn=81b44db6db3cb014dc9002ba840ad283&chksm=fd3d58fdca4ad1ebdb2aa17b6ffce1794daef5def22cdc84a7f280a107cba966658293cb14b6&scene=21#wechat_redirect)
* [Kafka面试题系列(基础篇)](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491156&idx=2&sn=e18e2ae54f08ee34749fdb314652c2fe&chksm=fd3d58c1ca4ad1d777b950bf01f33f36dff4e8bc55779f3a1a50019136fd6d345148188e41db&scene=21#wechat_redirect)
* [面试知识点 | Kafka的数据存储与索引设计](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492699&idx=2&sn=776b93fb683a004d58d432d9b6b551b4&chksm=fd3ea2ceca492bd8f72e397ec63b331aa129f74f803be9f922633df01fdb552adb75371d3dad&scene=21#wechat_redirect)
* [面试必考点:HBase Compaction机制](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247489094&idx=1&sn=76958401eb4fab0a69447b37bda2b4c9&chksm=fd3d50d3ca4ad9c5330d003ee5c8265e8a49f04bbccd9a9513cf2ec2f983b6d45842dae41c30&scene=21#wechat_redirect)
* [ZooKeeper需要关注的点](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247488393&idx=1&sn=f820923ae3e782b509911f787a455a91&chksm=fd3d551cca4adc0a1caa74c53e628b5cf0a5ebe5a0d5584f899c550068eab70b483e5b796f6d&scene=21#wechat_redirect)
* [MySQL中InnoDB及索引深入剖析](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491256&idx=2&sn=34535df6790951e7de0e08bdfceed108&chksm=fd3d582dca4ad13bc76b32b252995a979cd9ae888d510e2201e6c1a2ac169c1cccbf30009ab4&scene=21#wechat_redirect)
* [Kafka面试题系列(进阶篇2)](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491256&idx=1&sn=0bd327392de5e2241b550a12eda43f29&chksm=fd3d582dca4ad13bf53916a5e693a93e2c526b63be165f06732f04940e1d8517803c1e81a322&scene=21#wechat_redirect)
* [MySql的Binlog日志工具分析:Canal、Maxwell、Databus、DTS](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491276&idx=1&sn=133e0d6bcf897ae288a83f1ac6e7456c&chksm=fd3d5859ca4ad14f7d5e9898a335e78e1a9cf163303818e9a61eeb5785b4a02a12c5b86e5f1c&scene=21#wechat_redirect)
* [Redis中的管道Pipeline操作](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491365&idx=3&sn=a1efd7804ceeeb5b5980ade4a7c75d90&chksm=fd3d59b0ca4ad0a6eedfb7333b30bf66f2f3da7a3adab8242a5c7ea949803d101b4f386b7b95&scene=21#wechat_redirect)
* [查看YARN任务日志的几种方式](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491365&idx=2&sn=fdb72a0d6ac9b5f4723f8362e42acd9b&chksm=fd3d59b0ca4ad0a6ccf7e8ebd1ce5d931962575adba97281db5d6f5becc8f98781c543a49d4d&scene=21#wechat_redirect)
* [Yarn 使用 Cgroup 实现任务资源限制](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491453&idx=2&sn=df9b9bf0127e3c4648eec90ef3499306&chksm=fd3d59e8ca4ad0fe12fa54cf22baea1223bfc944436e98ec89b7b61b75d1a0ed05dbf99f31ac&scene=21#wechat_redirect)
* [分析和定位线上作业 OOM 问题利器-MAT](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491453&idx=1&sn=49e762ec986faf408c03b4bc52634872&chksm=fd3d59e8ca4ad0fe38d0827899e82865b9c913524b3c71913ffc596aad0f9292239a29f704b2&scene=21#wechat_redirect)
* [浅谈ZooKeeper中Kafka相关信息的存储](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487838&idx=1&sn=a82d191e12dc2590fc376258965e34f6&chksm=fd3d57cbca4adedd518eb3944835a2629e9e9cb77ec5fdaf127d87b6a7bb847780b700115020&scene=21#wechat_redirect)
* [JVM架构体系与GC命令小总结](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247488882&idx=1&sn=62cc37f20fdce1193f2400507b7d478a&chksm=fd3d53e7ca4adaf181d221711806599b49e165d3b73b1913fb703ff24aa23e95526413549897&scene=21#wechat_redirect)

**其他**

* [腾讯如何用Elasticsearch挖掘万亿数据价值?](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487021&idx=1&sn=b9c9feaceae4d6315b801c021964c9cf&chksm=fd3d48b8ca4ac1aea20937d29058ab1ebabd480bf15e37a41dec52d4c7d712342be7f963f33e&scene=21#wechat_redirect)
* [Apache Beam 大数据处理一站式分析](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487555&idx=1&sn=4f46c221e6830c25371ed239bb7bb12c&chksm=fd3d56d6ca4adfc08dc936586a9d82d0b44e8b584ec21dd83f081d5c7a40a1ef2d4c52733384&scene=21#wechat_redirect)
* [Apache Hudi 架构设计和基本概念](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492943&idx=1&sn=fcd6d6e67b7ff0792976b063374306ac&chksm=fd3ea3daca492acc631a0db5d99c82603d7935888e13ea086a5825651132396f3d48db20c5b5&scene=21#wechat_redirect)
* [Apache Hudi 架构设计和基本概念](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492943&idx=1&sn=fcd6d6e67b7ff0792976b063374306ac&chksm=fd3ea3daca492acc631a0db5d99c82603d7935888e13ea086a5825651132396f3d48db20c5b5&scene=21#wechat_redirect)
* [MySQL8.0发布,你熟悉又陌生的Hash Join?](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486990&idx=1&sn=9c8a238ca63dc52af251bbf03aafc75c&chksm=fd3d489bca4ac18d5fa9130df79ae4f7ccc9ad34ab99b91cc8fb80b9a2542d1987f6447d3e39&scene=21#wechat_redirect)
* [转载一个看不懂的文章:F1 Query](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487002&idx=1&sn=97bd47e0f4d3ee666c1a624872a358f0&chksm=fd3d488fca4ac199fbf0352b1df513d968d000a1ce4c38b3c16b480776da2827857f4a750215&scene=21#wechat_redirect)
* [Apache Hudi | 统一批和近实时分析的增量处理框架](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487043&idx=1&sn=91fcb6e573404cac6a89a60603cd138d&chksm=fd3d48d6ca4ac1c0d325a63bb25eba3df146e423d6ff484869cb76dff970c8dd21cc1f047a6f&scene=21#wechat_redirect)
* [寻找5亿次访问中,访问次数最多的人](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487056&idx=1&sn=27f648d927c248204f449f33c8fbc8bc&chksm=fd3d48c5ca4ac1d36785402c263924006ea6c4232565dca1dce490763bae5acc455aec8d537e&scene=21#wechat_redirect)
* [聊聊阿里巴巴的全链路压测](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487066&idx=1&sn=49a9a99927bb4223bb40080106cf4a91&chksm=fd3d48cfca4ac1d99976d5d7cc8ba6426b02bedb61a29657d262830cb334f02aed3834d74e49&scene=21#wechat_redirect)
* [年轻人你渴望力量吗 | 我读过的一些书推荐](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487080&idx=1&sn=71145f8a66f3d16d1e29526f81eeeb5f&chksm=fd3d48fdca4ac1eb7e00744bac1b03ac3e0e7a342675c3dc7234f182327cd2a4904a3e33d7e4&scene=21#wechat_redirect)
* [数据算法之反转排序 | 寻找相邻单词的数量](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487116&idx=1&sn=7ea20fdf1134eaceab3e36f03cb061c8&chksm=fd3d4819ca4ac10fc438c8efff61f744b2e1ebfb91c85c365d5f0b43e029ae5068d319ace90d&scene=21#wechat_redirect)
* [MySQL Binlog同步HDFS的方案](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487212&idx=1&sn=a42d9dba3c7a34df7134292a26793c3f&chksm=fd3d4879ca4ac16f9777aceb3ee3d09e271dab155a7ba4d27e5a07af4278b406bd10fceefc44&scene=21#wechat_redirect)
* [循环查询数据的性能问题及优化](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247488638&idx=2&sn=8850f47e776a4bd9e6945c142dd4f74d&chksm=fd3d52ebca4adbfde14a5d24300338d1db4b231203f778ed242855cbbe95d17b5ffcfea735ee&scene=21#wechat_redirect)
* [推荐系统 embedding 技术实践总结](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247488827&idx=1&sn=b8186e482bdf34ce24cc161df46f290c&chksm=fd3d53aeca4adab8810b5f96f2c08b996ca524ca7978ada3d529f5e617c73d89a21c7bcbc544&scene=21#wechat_redirect)
* [Prometheus+Clickhouse实现业务告警](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247489470&idx=1&sn=734886246f2413f72d605cad726b59f2&chksm=fd3d512bca4ad83d516e5d6f0dd4c21c83867de758238277a162fecad0ec21a9eff9a3a59c23&scene=21#wechat_redirect)
* [亿级用户的分布式数据存储解决方案](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247489519&idx=2&sn=5c1b1a4738e2de958920d6f6adc99e95&chksm=fd3d517aca4ad86cc64260dcd41cd939cde52b6c3ebf0fa5322023ef6396ecf8cb24f0c392b4&scene=21#wechat_redirect)
* [基于大数据的用户画像构建小百科全书](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247489571&idx=1&sn=56a634d66fb689907b4ab51ed2d3707a&chksm=fd3d5eb6ca4ad7a0cc5fa4f895354e58ed7f2cb8558369ed6149560a5e7fca97b8545036fe87&scene=21#wechat_redirect)
* [魅族持续交付平台建设实践](http://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247489915&idx=1&sn=4c926ceac7fe3f0fd55eafa07ce0b223&chksm=fd3d5feeca4ad6f8f3583a541c306575a80d1f5e846e8b8e9561cb938d86e0c3bda460eeb7c9&scene=21#wechat_redirect)

### 八、2021精品文章合集

#### 2021精品文章合集

**汇总部分**

* [八千里路云和月|从零到大数据专家学习路径指南](https://mp.weixin.qq.com/s/MAwD-UJgvIa_dZjmaykqrQ)
* [我们在学习Flink的时候,到底在学习什么?](https://mp.weixin.qq.com/s/xh4SEX9t-fRVdoiAl0KKSQ)
* [我们在学习Spark的时候,到底在学习什么?](https://mp.weixin.qq.com/s/pN0AqNJuFnlLjNW2OonGtA)
* [一线互联网公司面试进阶全攻略](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247499539&idx=1&sn=26b8569ae4e3d5c6c1a1b8a98807f625&scene=21#wechat_redirect)
* [【大数据成神之路】第一版更新完毕](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247499250&idx=1&sn=ca900c226738048c28b56ae5c7b8f723&chksm=fd3ebb67ca4932712f602302ddc72aa616ff462a9c11605d6becbeaf347b5e7b1b2be2172cea&token=1917475764&lang=zh_CN&scene=21#wechat_redirect)
* [关于技术面试的一点点体会](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487326&idx=1&sn=1ba5867700bdfb5ad7f808a245ae3b1f&scene=21#wechat_redirect)
* [早点建立自己的知识体系](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487334&idx=1&sn=ce3453c1b63d6932438c8412fc5454d3&scene=21#wechat_redirect)

**专题部分**

**Hadoop系列**

* [最新Hive/Hadoop高频面试点小集合](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247501553&idx=1&sn=487e44e8fb362e97a0d418dff62de8bb&scene=21#wechat_redirect)
* [Hadoop所支持的几种压缩格式](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247484567&idx=1&sn=36e47d34cbdd7b5f0bf93f3bb5338c90&scene=21#wechat_redirect)
* [【大数据面试之对线面试官】MapReduce/HDFS/YARN面试题70连击](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247497849&idx=1&sn=a82f9787765436b5514fe77aa6aa020f&scene=21#wechat_redirect)
* [HDFS的SecondaryNameNode作用,你别答错了](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487464&idx=1&sn=c7358495832da5e4bf4807691be6ef29&scene=21#wechat_redirect)
* [Yarn 使用 Cgroup 实现任务资源限制](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491453&idx=2&sn=df9b9bf0127e3c4648eec90ef3499306&scene=21#wechat_redirect)
* [查看YARN任务日志的几种方式](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491365&idx=2&sn=fdb72a0d6ac9b5f4723f8362e42acd9b&scene=21#wechat_redirect)
* [大数据哔哔集20210106 - Hadoop3.0有哪些新特性](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247498514&idx=2&sn=76808f8610e57360aced9c08b6e93c11&chksm=fd3ebd87ca49349155ddc062a4d812980f4b0075c65d448200136acdc903d86f2fc078972d72&token=1917475764&lang=zh_CN&scene=21#wechat_redirect)

**Hive系列**

* [Hive性能调优 | 数据倾斜](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247493923&idx=1&sn=bdb745dc04af43e1dc9e476cd1d67a86&scene=21#wechat_redirect)
* [面试必备技能-HiveSQL优化](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247484617&idx=1&sn=cd734b7b04e0278f4445a09204d6a2bf&scene=21#wechat_redirect)
* [Hive常用参数调优十二板斧](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247494838&idx=1&sn=5f2aeba4509603bf50742987aff54469&chksm=fd3eaa23ca492335ae6e436806fd8de3da919766c80d661bb468c17a61559d034a2a16b2ec93&token=1917475764&lang=zh_CN&scene=21#wechat_redirect)

**HBase**

* [面试必考点:HBase Compaction机制](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247489094&idx=1&sn=76958401eb4fab0a69447b37bda2b4c9&scene=21#wechat_redirect)
* [Hbase性能优化百科全书](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247497379&idx=1&sn=f712373bc5d4826f255f8b47c4e7ff71&chksm=fd3eb036ca493920840f1c7808124b07e838c17b33b7f0188fa78fbb9877fdd8cade45563ec8&token=1917475764&lang=zh_CN&scene=21#wechat_redirect)

**ES等**

* [触类旁通Elasticearch之吊打同行系列:原理篇](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247494711&idx=1&sn=4846c2665cce5eb20e4a98af0b1b6ac6&chksm=fd3eaaa2ca4923b4619a40d63a89b0818b22df8f98e68d4382ac77a56f921a93f14ef1aeeaf2&token=1917475764&lang=zh_CN&scene=21#wechat_redirect)
* [触类旁通ElasticSearch之吊打同行系列:操作篇](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247494765&idx=1&sn=600f78c4cdf5875ed9524e737c55a8fb&chksm=fd3eaaf8ca4923ee3aa3907538f3b7fe06eca67b49003a6002f5d65e80800645914497d9159b&token=1917475764&lang=zh_CN&scene=21#wechat_redirect)
* [触类旁通Elasticearch之吊打同行系列:搜索篇](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247494808&idx=1&sn=67aa00f4fff4486825bea7a5bd3956c5&chksm=fd3eaa0dca49231b88c5a1bbb083bd9ea89aa20b1d1d4edb0f409ad4a192fcd6d87eaafba346&token=1917475764&lang=zh_CN&scene=21#wechat_redirect)

**Kafka/消息队列**

* [面试知识点 | Kafka的数据存储与索引设计](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492699&idx=2&sn=776b93fb683a004d58d432d9b6b551b4&scene=21#wechat_redirect)
* [Kafka面试题系列(基础篇)](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491156&idx=2&sn=e18e2ae54f08ee34749fdb314652c2fe&scene=21#wechat_redirect)
* [Kafka面试题系列(进阶篇)](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491176&idx=1&sn=81b44db6db3cb014dc9002ba840ad283&scene=21#wechat_redirect)
* [Kafka面试题系列(进阶篇2)](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491256&idx=1&sn=0bd327392de5e2241b550a12eda43f29&scene=21#wechat_redirect)
* [关于MQ面试的几件小事 | 消息队列的用途、优缺点、技术选型](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247485312&idx=2&sn=97c7d499760154a98cd1bfa416697a42&scene=21#wechat_redirect)
* [关于MQ面试的几件小事 | 如何保证消息不丢失](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247485321&idx=3&sn=2e6769937df43c3555213bb17d8605fe&scene=21#wechat_redirect)
* [关于MQ面试的几件小事 | 如何保证消息按顺序执行](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247485321&idx=2&sn=73e56ed1f46fd3337e13ee359631adc9&scene=21#wechat_redirect)
* [一道真实的阿里面试题 | 如何保证消息队列的高可用](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247485184&idx=1&sn=19680883eb2b9739f834d743e5304179&scene=21#wechat_redirect)
* [你可能需要知道的Kafka面试题与部分答案整理](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486992&idx=1&sn=8950101b408b0e02b7957f6d74c9cd74&scene=21#wechat_redirect)
* [消息队列面面观](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487314&idx=1&sn=f832be362c7c2ea877c48d47923434c5&scene=21#wechat_redirect)
* [Kafka三种可视化监控管理工具Monitor/Manager/Eagle](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491086&idx=1&sn=f2f9bd1a55064ec7abf63666bf54be61&scene=21#wechat_redirect)
* [Kafka体系架构详细分解](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491138&idx=1&sn=f42cac6584ca4c3b74f232e897b33206&scene=21#wechat_redirect)
* [Kafka笔记—可靠性、幂等性和事务](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491138&idx=2&sn=a67e3044fd368e61a0290679e2765661&scene=21#wechat_redirect)
* [Kafka体系架构详细分解](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491138&idx=1&sn=f42cac6584ca4c3b74f232e897b33206&scene=21#wechat_redirect)
* [Kafka面试题系列(进阶篇)](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491176&idx=1&sn=81b44db6db3cb014dc9002ba840ad283&scene=21#wechat_redirect)
* [Kafka面试题系列(基础篇)](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491156&idx=2&sn=e18e2ae54f08ee34749fdb314652c2fe&scene=21#wechat_redirect)
* [面试知识点 | Kafka的数据存储与索引设计](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492699&idx=2&sn=776b93fb683a004d58d432d9b6b551b4&scene=21#wechat_redirect)
* [斗转星移 | 三万字总结Kafka各个版本差异](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492958&idx=1&sn=1ef59283fddfcffa14425d8d51af6f81&chksm=fd3ea3cbca492add3660dc37a37b458694cd285ae061884bb8aecbba2725fab3baee89608eba&token=1917475764&lang=zh_CN&scene=21#wechat_redirect)

**Spark**

* [面试必知的 Spark SQL 几种 Join 实现](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491813&idx=2&sn=557cc24073bef28f4f3589b63f71c967&scene=21#wechat_redirect)
* [面试注意点 | Spark&Flink的区别拾遗](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486577&idx=1&sn=49fd0138ad9837192b6eb78cbdc478e6&scene=21#wechat_redirect)
* [Spark如何协调来完成整个Job的运行详解](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247501609&idx=2&sn=d92b8fee97a0832ae6aa8dab82a7a9c3&chksm=fd3e81bcca4908aa29f2d1bf4145f3163933a74a5f70b6b13f33811488c57517cb660fb92f84&token=1917475764&lang=zh_CN&scene=21#wechat_redirect)
* [独孤九剑-Spark面试80连击(上)](https://blog.csdn.net/u013411339/article/details/100179471)
* [独孤九剑-Spark面试80连击(下)](https://blog.csdn.net/u013411339/article/details/100179469)
* [Spark的Cache和Checkpoint区别和联系拾遗](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247501217&idx=1&sn=75ab0ec8b505b87aac9ee37e2328db9b&chksm=fd3e8334ca490a226fbe5e779de27c43fd806f6c6d64e66d9efd8429a7f4b3acc9f4447a9157&scene=21#wechat_redirect)
* [Spark Job 逻辑执行图和数据依赖解析](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247501259&idx=1&sn=8716f0fcb9b85d36bdfe69039983c518&chksm=fd3e835eca490a48cb02d75e7d5183961461546510c0c4b9f7edee01954c92f9bf2de4432a53&scene=21#wechat_redirect)
* [Spark Job 物理执行图详解](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247501309&idx=1&sn=2a3edaed6845031acac7cc5c377bdd53&chksm=fd3e8368ca490a7e2b580983aba4aff7c6acadfba5f4863a776ed6cc3f63839353dbc935344e&scene=21#wechat_redirect)
* [Spark Shuffle过程详解](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247501347&idx=1&sn=1809c8d80c4af0772b7eef999e2b84d7&chksm=fd3e80b6ca4909a028b29e014fb1e7a1de10203277baa57ded90507668fd429d8ae6690c31aa&scene=21#wechat_redirect)

**Flink**

* [【大数据面试题】Flink企业级面试题60连击](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247497541&idx=1&sn=521d3f59d7c5bfe56607ccc0867c7575&scene=21#wechat_redirect)
* [全网第一|Flink学习面试灵魂40问答案](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486150&idx=1&sn=691f8cab1ba9d72984c8e0250ab24b79&chksm=fd3d4c53ca4ac545eb01a6781c4317bc3a3f5e5c2129bd2d57e5e2ea53edcbeeae2acd2a4ed6&token=1917475764&lang=zh_CN&scene=21#wechat_redirect)
* [面试别人说他熟悉Flink,我问了他Flink如何实现exactly-once语义](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247485260&idx=1&sn=7af0f27520b6fd74adf424d4d64f032d&scene=21#wechat_redirect)
* [阅读源码|Spark与Flink的RPC实现](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487449&idx=3&sn=5b3d8564d1c84b14f9d941c96d8f6a27&scene=21#wechat_redirect)
* [Flink性能调优小小总结](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247501028&idx=1&sn=40e36b5e0c58d909e7933622de0e53d0&chksm=fd3e8271ca490b6753b9c34671c4cf07c88ca953ddbe723abf5e28324f5248044f372f48abd4&token=1917475764&lang=zh_CN&scene=21#wechat_redirect)
* [生产上的坑才是真的坑 | 盘一盘Flink那些经典线上问题](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247500470&idx=1&sn=a9ddfabd545a52e463dbc728fdaa872d&chksm=fd3e8423ca490d3578b802859924a5de96038e5f114788443f10763b90cea4955ff352982e5a&token=1917475764&lang=zh_CN&scene=21#wechat_redirect)
* [我们在学习Flink的时候,到底在学习什么?](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247499604&idx=1&sn=d938dfb30d221774704982d2938b30c1&chksm=fd3eb9c1ca4930d76a391241333de461ca22d2aa27472eab3cffab1564872ae37f1b48fe2d3c&token=1917475764&lang=zh_CN&scene=21#wechat_redirect)

**数据仓库/数据湖**

* [数据湖存储架构选型](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247501955&idx=2&sn=0732c077a76a99b67bafa57e3dedbf3e&chksm=fd3e8e16ca4907003d62f768b629f7f2a99c14f073cdd4a117559ec755d5876eec814a3fd9f7&token=1917475764&lang=zh_CN&scene=21#wechat_redirect)
* [数据湖架构、战略和分析的8大错误认知](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247501798&idx=1&sn=ee75c1c067833cf0cb285e99e71a91e4&chksm=fd3e8173ca4908655af05c56447dd728d14f0cdde511b4675c4ba12bbbb293e9060a7911972a&token=1917475764&lang=zh_CN&scene=21#wechat_redirect)
* [数据湖在大数据典型场景下应用调研个人笔记](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247500604&idx=1&sn=75e2492e954fbaac868afacc9397328d&chksm=fd3e85a9ca490cbfd5d10b6feb743ad172f09bc87c357bdc88b5be8b8fc39a7ee3581c3a0f81&token=1917475764&lang=zh_CN&scene=21#wechat_redirect)
* [数据湖VS数据仓库?湖仓一体了解一下](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247496181&idx=1&sn=90753808e8dd92a6d9fb95d29a9925a2&chksm=fd3eb760ca493e769275c92e26812154314222849bc96dde2ed6ca2b568ceaef493f277864c5&token=1917475764&lang=zh_CN&scene=21#wechat_redirect)

**后端相关**

* [三万六千字通关MySQL面试](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487449&idx=1&sn=2cc97bf6669416267c25a7a46192b706&scene=21#wechat_redirect)
* [分析和定位线上作业 OOM 问题利器-MAT](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491453&idx=1&sn=49e762ec986faf408c03b4bc52634872&scene=21#wechat_redirect)
* [浅谈ZooKeeper中Kafka相关信息的存储](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487838&idx=1&sn=a82d191e12dc2590fc376258965e34f6&scene=21#wechat_redirect)
* [JVM架构体系与GC命令小总结](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247488882&idx=1&sn=62cc37f20fdce1193f2400507b7d478a&scene=21#wechat_redirect)
* [深入理解CAP理论和适用场景](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487458&idx=1&sn=11e0b029998ecdeccda8e18c1e43f980&scene=21#wechat_redirect)
* [ZooKeeper需要关注的点](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247488393&idx=1&sn=f820923ae3e782b509911f787a455a91&scene=21#wechat_redirect)
* [MySQL中InnoDB及索引深入剖析](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491256&idx=2&sn=34535df6790951e7de0e08bdfceed108&scene=21#wechat_redirect)
* [MySql的Binlog日志工具分析:Canal、Maxwell、Databus、DTS](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247491276&idx=1&sn=133e0d6bcf897ae288a83f1ac6e7456c&scene=21#wechat_redirect)
* [一致性协议算法-2PC、3PC、Paxos、Raft、ZAB、NWR超详细解析](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247495156&idx=1&sn=3502f333eaaf19eccef2a721ab1fead1&chksm=fd3eab61ca4922771033e47326464bda019b6d99d3d6f8a81b4cc5ca6bcddbc9307a0ca91eaa&token=1917475764&lang=zh_CN&scene=21#wechat_redirect)
* [MySQL数据库性能优化史诗级大总结](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247495088&idx=1&sn=f3e8460921135780a1e38fdfd163f909&chksm=fd3eab25ca492233ea3dae78b07db1f601b70155cdfd371df51beb4cacb2191fd9d383212999&token=1917475764&lang=zh_CN&scene=21#wechat_redirect)

**不便分类的其他**

* [十道海量数据处理面试题](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247495750&idx=1&sn=125e5a1800beeeb9faddd5c614136fec&scene=21#wechat_redirect)
* [这个面试问题很难么 | 如何处理大数据中的数据倾斜](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486586&idx=1&sn=e35def6429adaada0910b91eed8d7b1f&scene=21#wechat_redirect)
* [面试系列:十个海量数据处理方法大总](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247485151&idx=1&sn=665e5e284934aecd2fcf3d69e26e3b52&scene=21#wechat_redirect)
* [我们常说的海量小文件的根源是什么?](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486931&idx=2&sn=1c5987f3bad7a805895484ebfd683e11&scene=21#wechat_redirect)
* [28道关于ZooKeeper的面试题](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487021&idx=2&sn=bbfa3fb95e4fa3d0b6ae90a8933119ed&scene=21#wechat_redirect)
* [【数据白皮书重磅分享】推荐|埋点|用研|标签](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487177&idx=2&sn=dea2d55951494a5bf4438ec5a888bdd7&scene=21#wechat_redirect)
* [1万2千字长文助力春招 | Netty面试篇](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487309&idx=2&sn=a33cc56116cb7891145394f9c5353be3&scene=21#wechat_redirect)
* [Filter(过滤)|Project(映射)|Pushdowns(谓词下推)](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487414&idx=2&sn=2afda2b854a4f875159a43903bb7b337&scene=21#wechat_redirect)
* [代达罗斯之殇-大数据领域小文件问题解决攻略](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247495807&idx=1&sn=d23efef601586506e00435889a8e1657&chksm=fd3eb6eaca493ffcc18ccfc6c1112124be13486d8778427099e37b4ed536be1ffe1348f55131&token=1917475764&lang=zh_CN&scene=21#wechat_redirect)

**面试综合系列**

* [【对线面试官】阿里面试经历,有些人走一步看一步就挂了](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247497893&idx=1&sn=50837fcfceaf6df459936c2b4af36d3d&scene=21#wechat_redirect)
* [关于技术面试的一点点体会](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487326&idx=1&sn=1ba5867700bdfb5ad7f808a245ae3b1f&scene=21#wechat_redirect)
* [助力秋招-独孤九剑破剑式 | 10家企业面试真题](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486054&idx=1&sn=a9385d955df56d5834f6575748bd5277&scene=21#wechat_redirect)
* [如果你在准备面试,好好看看这130道题](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247486986&idx=1&sn=422d1a3c11c72ff97b32cc01142839f4&scene=21#wechat_redirect)

**简历系列**

* [一份优秀的简历该长成什么样](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247487297&idx=1&sn=9b007050efaf50f17e3f46915c70f4ea&scene=21#wechat_redirect)
* [你过来,我给你看个宝贝](https://mp.weixin.qq.com/s?__biz=MzU3MzgwNTU2Mg==&mid=2247492974&idx=1&sn=2d2a3ff8e805072df331dea7a94a7541&chksm=fd3ea3fbca492aed6d43990896edbe806a47a16e15c1541378528792e87f8df84c9b43e5a6d9&token=1917475764&lang=zh_CN&scene=21#wechat_redirect)

#### 面试系列合集
---
#### 一、Hadoop

1. [Hadoop面试题总结(一)](面试系列/Hadoop面试题总结/Hadoop%E9%9D%A2%E8%AF%95%E9%A2%98%E6%80%BB%E7%BB%93%EF%BC%88%E4%B8%80%EF%BC%89.md)
2. [Hadoop面试题总结(二)——HDFS](面试系列/Hadoop面试题总结/Hadoop%E9%9D%A2%E8%AF%95%E9%A2%98%E6%80%BB%E7%BB%93%EF%BC%88%E4%BA%8C%EF%BC%89%E2%80%94%E2%80%94HDFS.md)
3. [Hadoop面试题总结(三)——MapReduce](面试系列/Hadoop面试题总结/Hadoop%E9%9D%A2%E8%AF%95%E9%A2%98%E6%80%BB%E7%BB%93%EF%BC%88%E4%B8%89%EF%BC%89%E2%80%94%E2%80%94MapReduce.md)
4. [Hadoop面试题总结(四)——YARN](面试系列/Hadoop面试题总结/Hadoop%E9%9D%A2%E8%AF%95%E9%A2%98%E6%80%BB%E7%BB%93%EF%BC%88%E5%9B%9B%EF%BC%89%E2%80%94%E2%80%94YARN.md)
5. [Hadoop面试题总结(五)——优化问题](面试系列/Hadoop面试题总结/Hadoop%E9%9D%A2%E8%AF%95%E9%A2%98%E6%80%BB%E7%BB%93%EF%BC%88%E4%BA%94%EF%BC%89%E2%80%94%E2%80%94%E4%BC%98%E5%8C%96%E9%97%AE%E9%A2%98.md)

#### 二、Zookeeper

1. [Zookeeper面试题总结(一)](面试系列/Zookeeper面试题总结/Zookeeper.md)

#### 三、Hive

1. [Hive面试题总结(一)](面试系列/Hive面试题总结/Hive%EF%BC%88%E4%B8%80%EF%BC%89.md)
2. [Hive面试题总结(二)](面试系列/Hive面试题总结/Hive%EF%BC%88%E4%BA%8C%EF%BC%89.md)

#### 四、HBase

1. [HBase面试题总结(一)](面试系列/HBase面试题整理/HBase.md)

#### 五、Flume

1. [Flume面试题总结(一)](面试系列/Flume面试题整理/Flume.md)

#### 六、Kafka

1. [Kafka面试题总结(一)](面试系列/Kafka面试题整理/Kafka%EF%BC%88%E4%B8%80%EF%BC%89.md)
2. [Kafka面试题总结(二)](面试系列/Kafka面试题整理/Kafka%EF%BC%88%E4%BA%8C%EF%BC%89.md)

#### 七、Spark
1. [Spark面试题总结(一)](面试系列/Spark面试题整理/Spark%EF%BC%88%E4%B8%80%EF%BC%89.md)
2. [Spark面试题总结(二)](面试系列/Spark面试题整理/Spark%EF%BC%88%E4%BA%8C%EF%BC%89.md)
3. [Spark面试题总结(三)](面试系列/Spark面试题整理/Spark%EF%BC%88%E4%B8%89%EF%BC%89.md)
4. [Spark面试题总结(四)](面试系列/Spark面试题整理/Spark%EF%BC%88%E5%9B%9B%EF%BC%89.md)

**Spark性能优化:**

5. [Spark面试题总结(五)——几种常见的数据倾斜情况及调优方式](面试系列/Spark面试题整理/Spark调优/%E6%95%B0%E6%8D%AE%E5%80%BE%E6%96%9C.md)
6. [Spark面试题总结(六)——Shuffle配置调优](面试系列/Spark面试题整理/Spark调优/Shuffle%E9%85%8D%E7%BD%AE%E8%B0%83%E4%BC%98.md)
7. [Spark面试题总结(七)——程序开发调优](面试系列/Spark面试题整理/Spark调优/%E7%A8%8B%E5%BA%8F%E5%BC%80%E5%8F%91%E8%B0%83%E4%BC%98.md)
8. [Spark面试题总结(八)——运行资源调优](面试系列/Spark面试题整理/Spark调优/%E8%B5%84%E6%BA%90%E8%B0%83%E4%BC%98.md)

本系列的大纲会根据实际情况进行调整,欢迎大家关注~

----------------------

## 声明

文档中参考引用了网络上的博客和文章,大部分给出了出处,有些没写,如果造成了侵权行为,请您联系我,立即删除~

## 转载分享

建立本开源项目的初衷是基于个人学习与工作中对Java和大数据相关技术栈的总结记录,在这里也希望能帮助一些在学习Java和大数据过程中遇到问题的小伙伴,如果您需要转载本仓库的一些文章到自己的博客,请按照以下格式注明出处,谢谢合作。

```
作者:王知无
链接:https://mp.weixin.qq.com/s/0N4XSMFPuD7U_paGsBsblw
来源:《大数据技术与架构》
```

## 参与贡献

1. 如果您对本项目有任何建议或发现文中内容有误的,欢迎提交 issues 进行指正。
2. 对于文中我没有涉及到知识点,欢迎提交 PR。

## 扫我关注公众号

关注公众号:每天定时推送Hadoop/Spark/Flink等最新的**大数据领域最新动态和精品技术文章**!

**扫我加我好友,打造高质量、高知识密度朋友圈!**

## 扫我加群

备注**来自GitHub加群**,小助手会拉你进大数据讨论组,一起学习交流,期待你的到来~

## 如果对你有用,欢迎请我喝杯咖啡

备注Github,感谢您~

## 言而总之

**大数据成神之路** 该系列文章将为希望从事大数据开发或者由后端转型为大数据开发的工程师们指出需要学习的知识点和路径,本系列文章同时致敬我曾经在网络上看到无数个Java和大数据系列文章,深受启发同时也收货很多。