- Hadoop Beginner's Guide
- Garry Turkington
- 240字
- 2021-07-29 16:51:41
Chapter 4. Developing MapReduce Programs
Now that we have explored the technology of MapReduce, we will spend this chapter looking at how to put it to use. In particular, we will take a more substantial dataset and look at ways to approach its analysis by using the tools provided by MapReduce.
In this chapter we will cover the following topics:
- Hadoop Streaming and its uses
- The UFO sighting dataset
- Using Streaming as a development/debugging tool
- Using multiple mappers in a single job
- Efficiently sharing utility files and data across the cluster
- Reporting job and task status and log information useful for debugging
Throughout this chapter, the goal is to introduce both concrete tools and ideas about how to approach the analysis of a new data set. We shall start by looking at how to use scripting programming languages to aid MapReduce prototyping and initial analysis. Though it may seem strange to learn the Java API in the previous chapter and immediately move to different languages, our goal here is to provide you with an awareness of different ways to approach the problems you face. Just as many jobs make little sense being implemented in anything but the Java API, there are other situations where using another approach is best suited. Consider these techniques as new additions to your tool belt and with that experience you will know more easily which is the best fit for a given scenario.
- 工業機器人虛擬仿真實例教程:KUKA.Sim Pro(全彩版)
- 大數據項目管理:從規劃到實現
- MCSA Windows Server 2016 Certification Guide:Exam 70-741
- VMware Performance and Capacity Management(Second Edition)
- 視覺檢測技術及智能計算
- 水晶石精粹:3ds max & ZBrush三維數字靜幀藝術
- 計算機網絡原理與技術
- 大學C/C++語言程序設計基礎
- PostgreSQL 10 Administration Cookbook
- Visual C++項目開發案例精粹
- 網絡服務器搭建與管理
- 菜鳥起飛電腦組裝·維護與故障排查
- 玩轉PowerPoint
- 單片機C51應用技術
- Learning iOS 8 for Enterprise