https://github.com/myxof/sparknotes
Spark 2.0学习笔记
https://github.com/myxof/sparknotes
distributed-computing spark spark-sql
Last synced: 10 months ago
JSON representation
Spark 2.0学习笔记
- Host: GitHub
- URL: https://github.com/myxof/sparknotes
- Owner: MyXOF
- License: apache-2.0
- Created: 2017-03-19T07:47:34.000Z (almost 9 years ago)
- Default Branch: master
- Last Pushed: 2018-11-03T17:35:44.000Z (over 7 years ago)
- Last Synced: 2025-04-15T03:13:44.357Z (10 months ago)
- Topics: distributed-computing, spark, spark-sql
- Size: 1.59 MB
- Stars: 5
- Watchers: 3
- Forks: 1
- Open Issues: 11
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# SparkNotes
[](https://travis-ci.com/MyXOF/SparkNotes)
[](https://www.apache.org/licenses/LICENSE-2.0.html)
Spark 2.0学习笔记
主要结合Spark 2.3.2源码和《图解Spark 核心技术与案例实战》一书,记录对Spark系统的一些思考。
在阅读的过程中发现《图解Spark 核心技术与案例实战》一书中许多地方的描述和源码不符合,这里以实现源码为准。
作图工具推荐一下[ProcessOn](https://www.processon.com/)网站,非常不错
话不多说,请从[这里](https://github.com/MyXOF/SparkNotes/blob/master/doc/markdown/README.md)开始吧。
## 参考资料
[1]. Apache Spark. http://spark.apache.org/
[2]. 《图解Spark 核心技术与案例实战》. 郭景瞻著
[3]. Zaharia M, Chowdhury M, Das T, et al. Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing[C]// Usenix Conference on Networked Systems Design and Implementation. USENIX Association, 2012:2-2.