- Hadoop Beginner's Guide
- Garry Turkington
- 300字
- 2021-07-29 16:51:37
Comparison of local versus EMR Hadoop
After our first experience of both a local Hadoop cluster and its equivalent in EMR, this is a good point at which we can consider the differences of the two approaches.
As may be apparent, the key differences are not really about capability; if all we want is an environment to run MapReduce jobs, either approach is completely suited. Instead, the distinguishing characteristics revolve around a topic we touched on in Chapter 1, What It's All About, that being whether you prefer a cost model that involves upfront infrastructure costs and ongoing maintenance effort over one with a pay-as-you-go model with a lower maintenance burden along with rapid and conceptually infinite scalability. Other than the cost decisions, there are a few things to keep in mind:
- EMR supports specific versions of Hadoop and has a policy of upgrading over time. If you have a need for a specific version, in particular if you need the latest and greatest versions immediately after release, then the lag before these are live on EMR may be unacceptable.
- You can start up a persistent EMR job flow and treat it much as you would a local Hadoop cluster, logging into the hosting nodes and tweaking their configuration. If you find yourself doing this, its worth asking if that level of control is really needed and, if so, is it stopping you getting all the cost model benefits of a move to EMR?
- If it does come down to a cost consideration, remember to factor in all the hidden costs of a local cluster that are often forgotten. Think about the costs of power, space, cooling, and facilities. Not to mention the administration overhead, which can be nontrivial if things start breaking in the early hours of the morning.
- Word 2000、Excel 2000、PowerPoint 2000上機指導與練習
- PowerShell 3.0 Advanced Administration Handbook
- 協作機器人技術及應用
- Visual C# 2008開發技術實例詳解
- 計算機網絡應用基礎
- 物聯網與云計算
- Python Algorithmic Trading Cookbook
- Embedded Programming with Modern C++ Cookbook
- Salesforce for Beginners
- R Data Analysis Projects
- 空間機器人
- Python文本分析
- Creating ELearning Games with Unity
- Photoshop CS4圖像處理考前12小時
- INSTANT R Starter