官术网_书友最值得收藏!

Chapter 3. Deep Dive into Apache Spark

Apache Spark is growing at a fast pace in terms of technology, community, and user base. Two new APIs were introduced in 2015: the DataFrame API and DataSet API. These two APIs are built on top of the core API, which is based on RDDs. It is essential to understand the deeper concepts of RDDs including runtime architecture and behavior on various resource managers of Spark.

This chapter is divided into the following sub topics:

  • Starting Spark daemons
  • Spark core concepts
  • Pairing RDDs
  • The lifecycle of a Spark program
  • Spark applications
  • Persistence and caching
  • Spark resource managers—Standalone, Yarn, and Mesos
主站蜘蛛池模板: 遂昌县| 台州市| 汝州市| 石屏县| 闽侯县| 武邑县| 襄城县| 荥经县| 修水县| 临城县| 会理县| 泽普县| 铜陵市| 马鞍山市| 阳春市| 瑞丽市| 灵台县| 江口县| 儋州市| 广水市| 齐河县| 竹山县| 襄汾县| 沧州市| 天峻县| 精河县| 孟村| 陆良县| 靖宇县| 丰台区| 甘孜| 凤台县| 泾阳县| 五华县| 岫岩| 彭阳县| 梅河口市| 博兴县| 化德县| 日喀则市| 常宁市|