官术网_书友最值得收藏!

Planning and sizing clusters

Once you start working on problems and implementing Hadoop clusters, you'll have to deal with the issue of sizing. It's not just the sizing aspect of clusters that needs to be considered, but the SLAs associated with Hadoop runtime as well. A cluster can be categorized based on workloads as follows:

  • Lightweight: This category is intended for low computation and fewer storage requirements, and is more useful for defined datasets with no growth
  • Balanced: A balanced cluster can have storage and computation requirements that grow over time
  • Storage-centric: This category is more focused towards storing data, and less towards computation; it is mostly used for archival purposes, as well as minimal processing
  • Computational-centric: This cluster is intended for high computation which requires CPU or GPU-intensive work, such as analytics, prediction, and data mining

Before we get on to solve the sizing problem of a Hadoop cluster, however, we have to understand the following topics.

主站蜘蛛池模板: 柘城县| 皋兰县| 株洲县| 南溪县| 左云县| 定陶县| 西乌| 修水县| 尖扎县| 睢宁县| 惠东县| 奇台县| 和田市| 铜陵市| 常熟市| 建阳市| 上犹县| 孟州市| 沂源县| 西平县| 平武县| 綦江县| 嵩明县| 托克逊县| 甘南县| 奈曼旗| 乌兰浩特市| 广饶县| 永福县| 贡山| 宽甸| 麟游县| 桐乡市| 文化| 水城县| 武穴市| 离岛区| 泸溪县| 东光县| 平阴县| 冀州市|