- Mastering Hadoop
- Sandeep Karanth
- 250字
- 2021-08-06 19:52:59
Summary
In this chapter, we saw the evolution of Hadoop and some of its milestones and releases. We went into depth on Hadoop 2.X and the changes it brings into Hadoop. The key takeaways from this chapter are:
- MapReduce was born out of the necessity to gather, process, and index data at web scale. Apache Hadoop is an open source distribution of the MapReduce computational model.
- In over 6 years of its existence, Hadoop has become the number one choice as a framework for massively parallel and distributed computing. The community has been shaping Hadoop to gear up for enterprise use. In 1.X releases, HDFS append and security, were the key features that made Hadoop enterprise-friendly.
- MapReduce supports a limited set of use cases. Onboarding other paradigms into Hadoop enables support for a wider range of analytics and can also increase cluster resource utilization. In Hadoop 2.X, the JobTracker functions are separated and YARN handles cluster resource management and scheduling. MapReduce is one of the applications that can run on YARN.
- Hadoop's storage layer was enhanced in 2.X to separate the filesystem from the block storage service. This enables features such as supporting multiple namespaces and integration with other filesystems. 2.X shows improvements in Hadoop storage availability and snapshotting.
- Distributions of Hadoop provide enterprise-grade management software, tools, support, training, and services. Most distributions shadow Apache Hadoop in their capabilities.
MapReduce is still an integral part of Hadoop's DNA. In the next chapter, we will explore MapReduce optimizations and best practices.
推薦閱讀
- 電氣自動化專業英語(第3版)
- 構建高質量的C#代碼
- 自動控制工程設計入門
- R Machine Learning By Example
- 自動控制原理
- 數控銑削(加工中心)編程與加工
- ServiceNow Cookbook
- 快學Flash動畫百例
- 網絡綜合布線設計與施工技術
- Implementing AWS:Design,Build,and Manage your Infrastructure
- Machine Learning with Apache Spark Quick Start Guide
- AVR單片機工程師是怎樣煉成的
- 計算智能算法及其生產調度應用
- 計算機應用基礎實訓(職業模塊)
- Machine Learning with Spark(Second Edition)