- Mastering Hadoop
- Sandeep Karanth
- 219字
- 2021-08-06 19:52:58
Chapter 1. Hadoop 2.X
"There's nothing that cannot be found through some search engine or on the Internet somewhere." |
||
--Eric Schmidt, Executive Chairman, Google |
Hadoop is the de facto open source framework used in the industry for large scale, massively parallel, and distributed data processing. It provides a computation layer for parallel and distributed computation processing. Closely associated with the computation layer is a highly fault-tolerant data storage layer, the Hadoop Distributed File System (HDFS). Both the computation and data layers run on commodity hardware, which is inexpensive, easily available, and compatible with other similar hardware.
In this chapter, we will look at the journey of Hadoop, with a focus on the features that make it enterprise-ready. Hadoop, with 6 years of development and deployment under its belt, has moved from a framework that supports the MapReduce paradigm exclusively to a more generic cluster-computing framework. This chapter covers the following topics:
- An outline of Hadoop's code evolution, with major milestones highlighted
- An introduction to the changes that Hadoop has undergone as it has moved from 1.X releases to 2.X releases, and how it is evolving into a generic cluster-computing framework
- An introduction to the options available for enterprise-grade Hadoop, and the parameters for their evaluation
- An overview of a few popular enterprise-ready Hadoop distributions
- Unreal Engine:Game Development from A to Z
- AutoCAD繪圖實用速查通典
- 影視后期制作(Avid Media Composer 5.0)
- Blender Compositing and Post Processing
- C語言寶典
- 嵌入式操作系統
- 水下無線傳感器網絡的通信與決策技術
- Learning Azure Cosmos DB
- Windows Server 2008 R2活動目錄內幕
- Salesforce for Beginners
- 筆記本電腦維修90個精選實例
- 工業自動化技術實訓指導
- Visual Studio 2010 (C#) Windows數據庫項目開發
- Photoshop CS5圖像處理入門、進階與提高
- 青少年VEX IQ機器人實訓課程(初級)