- Learning Apache Spark 2
- Muhammad Asif Abbasi
- 150字
- 2021-07-09 18:46:00
Summary
In this chapter, we have gone through the concept of creating an RDD, to manipulating data within the RDD. We've looked at the transformations and actions available to an RDD, and walked you through various code examples to explain the differences between transformations and actions. Finally, we moved on to the advanced topics of PairRDD, where we demonstrated the creation of a Pair RDD along with some advanced transformations on the RDD.
We are now ready to explain the ETL process and the types of external storage systems that Spark can read/write data from including external filesystems, Apache Hadoop HDFS, Apache Hive, Amazon S3, and so on. We'll also look at some of the connectors to the most popular databases and how to optimally load data from storage systems, and store it back.
However, before moving on to the next chapter, have a break as you definitely deserve it!
- 教父母學(xué)會(huì)上網(wǎng)
- Learning Social Media Analytics with R
- Hands-On Neural Networks with Keras
- 樂(lè)高創(chuàng)意機(jī)器人教程(中級(jí) 下冊(cè) 10~16歲) (青少年iCAN+創(chuàng)新創(chuàng)意實(shí)踐指導(dǎo)叢書(shū))
- 最后一個(gè)人類
- 工業(yè)機(jī)器人入門(mén)實(shí)用教程(KUKA機(jī)器人)
- 完全掌握AutoCAD 2008中文版:機(jī)械篇
- 計(jì)算機(jī)網(wǎng)絡(luò)安全
- 大學(xué)C/C++語(yǔ)言程序設(shè)計(jì)基礎(chǔ)
- LAMP網(wǎng)站開(kāi)發(fā)黃金組合Linux+Apache+MySQL+PHP
- 電腦上網(wǎng)輕松入門(mén)
- 人工智能技術(shù)入門(mén)
- Excel 2007終極技巧金典
- 青少年VEX IQ機(jī)器人實(shí)訓(xùn)課程(初級(jí))
- 空間機(jī)器人