- Hadoop Beginner's Guide
- Garry Turkington
- 181字
- 2021-07-29 16:51:41
Summary
We have covered a lot of ground in this chapter and we now have the foundation to explore MapReduce in more detail. Specifically, we learned how key/value pairs is a broadly applicable data model that is well suited to MapReduce processing. We also learned how to write mapper and reducer implementations using the 0.20 and above versions of the Java API.
We then moved on and saw how a MapReduce job is processed and how the map
and reduce
methods are tied together by significant coordination and task-scheduling machinery. We also saw how certain MapReduce jobs require specialization in the form of a custom partitioner or combiner.
We also learned how Hadoop reads data to and from the filesystem. It uses the concept of InputFormat
and OutputFormat
to handle the file as a whole and RecordReader
and RecordWriter
to translate the format to and from key/value pairs.
With this knowledge, we will now move on to a case study in the next chapter, which demonstrates the ongoing development and enhancement of a MapReduce application that processes a large data set.
- 大數(shù)據(jù)技術(shù)與應(yīng)用基礎(chǔ)
- Mastering Matplotlib 2.x
- 工業(yè)機(jī)器人產(chǎn)品應(yīng)用實(shí)戰(zhàn)
- Spark編程基礎(chǔ)(Scala版)
- 自動檢測與傳感技術(shù)
- 機(jī)艙監(jiān)測與主機(jī)遙控
- 數(shù)據(jù)產(chǎn)品經(jīng)理:解決方案與案例分析
- 氣動系統(tǒng)裝調(diào)與PLC控制
- Salesforce for Beginners
- Linux內(nèi)核精析
- ESP8266 Robotics Projects
- Windows安全指南
- 生物3D打印:從醫(yī)療輔具制造到細(xì)胞打印
- 寒江獨(dú)釣:Windows內(nèi)核安全編程
- Creating ELearning Games with Unity