- Big Data Analytics
- Venkat Ankam
- 111字
- 2021-08-20 10:32:24
Chapter 3. Deep Dive into Apache Spark
Apache Spark is growing at a fast pace in terms of technology, community, and user base. Two new APIs were introduced in 2015: the DataFrame API and DataSet API. These two APIs are built on top of the core API, which is based on RDDs. It is essential to understand the deeper concepts of RDDs including runtime architecture and behavior on various resource managers of Spark.
This chapter is divided into the following sub topics:
- Starting Spark daemons
- Spark core concepts
- Pairing RDDs
- The lifecycle of a Spark program
- Spark applications
- Persistence and caching
- Spark resource managers—Standalone, Yarn, and Mesos
推薦閱讀
- Vue.js 2 and Bootstrap 4 Web Development
- C語言程序設計基礎與實驗指導
- 算法精粹:經典計算機科學問題的Java實現
- CentOS 7 Linux Server Cookbook(Second Edition)
- Django:Web Development with Python
- 機器人Python青少年編程開發實例
- Groovy for Domain:specific Languages(Second Edition)
- 微信小程序開發解析
- 飛槳PaddlePaddle深度學習實戰
- Python算法從菜鳥到達人
- Unity UI Cookbook
- Qt5 C++ GUI Programming Cookbook
- 零基礎學Python編程(少兒趣味版)
- Building Business Websites with Squarespace 7(Second Edition)
- 30天學通C#項目案例開發