- Hadoop Cluster Deployment
- Danil Zburivsky
- 310字
- 2021-07-21 18:16:39
Preface
In the last couple of years, Hadoop has become a standard solution for building data integration platforms. Introducing any new technology into a company's data infrastructure stack requires system engineers and database administrators to quickly learn all the aspects of the new component. Hadoop doesn't make this task any easier because it is not a single software product, but it is rather a collection of multiple separate open source projects. These projects need to be properly installed and configured in order to make the Hadoop platform robust and reliable.
Many existing Hadoop distributions provide a simplified way to install Hadoop using some kind of graphical interface. This approach dramatically reduces the amount of time required to go from zero to the fully functional Hadoop cluster. It also simplifies managing the cluster configuration. The problem with an automated setup and configuration is that it actually hides a lot of important aspects about Hadoop components that work together, such as why some components require other components, and which configuration parameters are the most important, and so on.
This book provides a guide to installing and configuring all the main Hadoop components manually. Setting up at least one fully operational cluster by yourself will provide very useful insights into how Hadoop operates under the hood and will make it much easier for you to debug any issues that may arise. You can also use this book as a quick reference to the main Hadoop components and configuration options gathered in one place and in a succinct format. While writing this book, I found myself constantly referring to it when working on real production Hadoop clusters, to look up a specific variable or refresh a best practice when it comes to OS configuration. This habit reassured me that such a guide might be useful to other aspiring and experienced Hadoop administrators and developers.
- 一本書學(xué)內(nèi)部審計:新手內(nèi)部審計從入門到精通
- Managing IaaS and DBaaS Clouds with Oracle Enterprise Manager Cloud Control 12c
- 新中國審計制度變遷
- Magento 2 Cookbook
- 審計綜合模擬實訓(xùn)
- 項目管理(第二版)
- Business Intelligence with MicroStrategy Cookbook
- 國家治理能力視角的國家審計功能理論研究
- Microsoft System Center Data Protection Manager 2012 SP1
- 財務(wù)建模與綜合估值:數(shù)據(jù)研磨、模型校準(zhǔn)、動態(tài)估值
- 《企業(yè)內(nèi)部控制基本規(guī)范》合規(guī)實務(wù)指南
- 風(fēng)險導(dǎo)向?qū)徲嫓?zhǔn)則實施效果研究
- Business Intelligence Cookbook:A Project Lifecycle Approach Using Oracle Technology
- 計量經(jīng)濟(jì)學(xué)
- Amazon EC2 Cookbook