官术网_书友最值得收藏!

Summary

In this chapter, we introduced Apache Spark and its architecture. We discussed the concept of driver program and executors, which are the core components of Spark.

We then briefly discussed the different programming APIs for Spark, and its major components including Spark Core, Spark SQL, Spark Streaming, and Spark GraphX. 

Finally, we discussed some major differences between Spark and Hadoop and how they complement each other. In the next chapter, we will install Spark on an AWS EC2 instance and go through different clients to interact with Spark. 

主站蜘蛛池模板: 北宁市| 广东省| 田东县| 宁化县| 壤塘县| 雷州市| 隆德县| 怀安县| 三穗县| 闻喜县| 河池市| 永吉县| 怀化市| 临猗县| 抚宁县| 宁都县| 于田县| 图木舒克市| 射洪县| 慈溪市| 丽江市| 且末县| 洮南市| 涞源县| 白银市| 高雄县| 邢台县| 麦盖提县| 五台县| 北安市| 当雄县| 安岳县| 浦东新区| 高淳县| 阿拉善左旗| 永清县| 五指山市| 邹平县| 建德市| 阿图什市| 平罗县|