官术网_书友最值得收藏!

Preparing hardware for Hadoop

One important aspect of Hadoop setup is defining the hardware requirements and sizing before the start of a project. Although Apache Hadoop can run on commodity hardware, most of the implementations utilize server-class hardware for their Hadoop cluster. (Look at powered by Hadoop or go through the Facebook Data warehouse research paper in SIGMOD-2010 for more information).

There is no rule of thumb regarding the minimum hardware requirements for setting up Hadoop, but we would recommend the following configurations while running Hadoop to ensure reasonable performance:

  • CPU ≥ 2 Core 2.5 GHz or more frequency
  • Memory 8 GB RAM
  • Storage 100 GB of free space, for running programs and processing data
  • Good internet connection

There is an official Cloudera blog for cluster sizing information if you need more detail. If you are setting up a virtual machine, you can always opt for dynamically sized disks that can be increased based on your needs. We will look at how to size the cluster in the upcoming Hadoop cluster section.

主站蜘蛛池模板: 南江县| 崇文区| 吉隆县| 南乐县| 湖州市| 龙山县| 盐城市| 屯昌县| 静安区| 普定县| 郓城县| 孝义市| 秦安县| 滕州市| 南木林县| 鄄城县| 邵阳市| 富源县| 襄垣县| 读书| 嵊泗县| 呈贡县| 黄浦区| 古交市| 常熟市| 潜江市| 普定县| 什邡市| 军事| 澎湖县| 连江县| 恩平市| 东山县| 龙泉市| 平顶山市| 渝北区| 巴中市| 嘉兴市| 新民市| 固安县| 永宁县|