- Hadoop Beginner's Guide
- Garry Turkington
- 300字
- 2021-07-29 16:51:37
Comparison of local versus EMR Hadoop
After our first experience of both a local Hadoop cluster and its equivalent in EMR, this is a good point at which we can consider the differences of the two approaches.
As may be apparent, the key differences are not really about capability; if all we want is an environment to run MapReduce jobs, either approach is completely suited. Instead, the distinguishing characteristics revolve around a topic we touched on in Chapter 1, What It's All About, that being whether you prefer a cost model that involves upfront infrastructure costs and ongoing maintenance effort over one with a pay-as-you-go model with a lower maintenance burden along with rapid and conceptually infinite scalability. Other than the cost decisions, there are a few things to keep in mind:
- EMR supports specific versions of Hadoop and has a policy of upgrading over time. If you have a need for a specific version, in particular if you need the latest and greatest versions immediately after release, then the lag before these are live on EMR may be unacceptable.
- You can start up a persistent EMR job flow and treat it much as you would a local Hadoop cluster, logging into the hosting nodes and tweaking their configuration. If you find yourself doing this, its worth asking if that level of control is really needed and, if so, is it stopping you getting all the cost model benefits of a move to EMR?
- If it does come down to a cost consideration, remember to factor in all the hidden costs of a local cluster that are often forgotten. Think about the costs of power, space, cooling, and facilities. Not to mention the administration overhead, which can be nontrivial if things start breaking in the early hours of the morning.
- 火格局的時空變異及其在電網(wǎng)防火中的應用
- 返璞歸真:UNIX技術(shù)內(nèi)幕
- PostgreSQL Administration Essentials
- 大數(shù)據(jù)安全與隱私保護
- 水晶石精粹:3ds max & ZBrush三維數(shù)字靜幀藝術(shù)
- 中國戰(zhàn)略性新興產(chǎn)業(yè)研究與發(fā)展·智能制造
- 我也能做CTO之程序員職業(yè)規(guī)劃
- 網(wǎng)絡安全技術(shù)及應用
- 基于RPA技術(shù)財務機器人的應用與研究
- Windows 7故障與技巧200例
- Moodle 2.0 Course Conversion(Second Edition)
- Learning iOS 8 for Enterprise
- 這樣用Word!
- 仿龜機器人的設計與制作
- Deep Learning with PyTorch Quick Start Guide