- Big Data Analytics
- Venkat Ankam
- 111字
- 2021-08-20 10:32:24
Chapter 3. Deep Dive into Apache Spark
Apache Spark is growing at a fast pace in terms of technology, community, and user base. Two new APIs were introduced in 2015: the DataFrame API and DataSet API. These two APIs are built on top of the core API, which is based on RDDs. It is essential to understand the deeper concepts of RDDs including runtime architecture and behavior on various resource managers of Spark.
This chapter is divided into the following sub topics:
- Starting Spark daemons
- Spark core concepts
- Pairing RDDs
- The lifecycle of a Spark program
- Spark applications
- Persistence and caching
- Spark resource managers—Standalone, Yarn, and Mesos
推薦閱讀
- Learn Type:Driven Development
- 零基礎學Scratch少兒編程:小學課本中的Scratch創意編程
- Mastering Articulate Storyline
- Windows Server 2012 Unified Remote Access Planning and Deployment
- INSTANT Yii 1.1 Application Development Starter
- 21天學通C++(第5版)
- Spring MVC+MyBatis開發從入門到項目實踐(超值版)
- Unity 2018 Augmented Reality Projects
- 移動增值應用開發技術導論
- PrimeFaces Blueprints
- 從零開始學Android開發
- 愛上C語言:C KISS
- Delphi開發典型模塊大全(修訂版)
- UML軟件建模
- Visual C++開發寶典