官术网_书友最值得收藏!

The fundamental premise of Hadoop

The fundamental premise of Hadoop is that instead of attempting to perform a task on a single large machine, the task can be subpided into smaller segments that can then be delegated to multiple smaller machines. These so-called smaller machines would then perform the task on their own portion of the data. Once the smaller machines have completed their tasks to produce the results on the tasks they were allocated, the inpidual units of results would then be aggregated to produce the final result.

Although, in theory, this may appear relatively simple, there are various technical considerations to bear in mind. For example:

  • Is the network fast enough to collect the results from each inpidual server?
  • Can each inpidual server read data fast enough from the disk?
  • If one or more of the servers fail, do we have to start all over?
  • If there are multiple large tasks, how should they be prioritized?

There are many more such considerations that must be considered when working with a distributed architecture of this nature.

主站蜘蛛池模板: 海安县| 晋江市| 岗巴县| 宁河县| 柘城县| 文成县| 长顺县| 社会| 应用必备| 开平市| 伊金霍洛旗| 三亚市| 堆龙德庆县| 九江县| 屯门区| 历史| 杭锦旗| 嵊泗县| 孟津县| 大田县| 扶余县| 浑源县| 拜城县| 柳河县| 化隆| 噶尔县| 祁东县| 永福县| 天气| 营口市| 景德镇市| 眉山市| 枝江市| 荥经县| 徐闻县| 霍山县| 额济纳旗| 梅河口市| 遵义县| 万载县| 鹿邑县|