官术网_书友最值得收藏!

Summary

This chapter has helped you install Java, Scala, and Spark on a Linux machine that has been procured from an AWS EC2 instance. You can now use the same setup to execute the queries/examples that are provided in other chapters of this book. 

In the next chapter, you will learn about the basic unit of Spark, which is RDD. RDD refers to an immutable Resilient Distributed Dataset, on which we can apply further actions and transformations.

主站蜘蛛池模板: 墨脱县| 宁远县| 双城市| 涟水县| 馆陶县| 云浮市| 肇州县| 甘泉县| 松滋市| 克拉玛依市| 陆良县| 淮北市| 安康市| 泉州市| 江津市| 都安| 福州市| 富川| 讷河市| 沅陵县| 云龙县| 奈曼旗| 喀喇沁旗| 太湖县| 余姚市| 新沂市| 长宁县| 扶风县| 吉木乃县| 普兰店市| 固始县| 县级市| 乌什县| 广平县| 离岛区| 南充市| 桃江县| 泾川县| 澄迈县| 赣州市| 阜康市|