官术网_书友最值得收藏!

The origin of HBase

Looking at the limitations of GFS and MR, Google approached another solution, which not only uses GFS for data storage but it is also used for processing the smaller data files very efficiently. They called this new solution BigTable.

Note

BigTable is a distributed storage system for managing structured data that is designed to scale to a very large size: petabytes of data across thousands of commodity servers.

Welcome to the world of HBase, databases and the data access design from the keyvalue store databases where a key-based access to a specific cell of data is provided.

Note

In column-oriented databases, data grouped by columns and column values is stored contiguously on a disk. Such a design is highly I/O effective when dealing with very large data sets used for analytical queries where not all the columns are needed.

HBase can be defined as a sparse, distributed, persistent, multidimensional sorted map, which is indexed by a row key, column key, and timestamp. HBase is designed to run on a cluster of commodity hardware and stores both structured and semi-structured data. HBase has the ability to scale horizontally as you add more machines to the cluster.

主站蜘蛛池模板: 治多县| 吕梁市| 泰兴市| 宣城市| 鲜城| 保康县| 琼中| 四子王旗| 马尔康县| 民勤县| 丰镇市| 依兰县| 通山县| 崇左市| 新昌县| 威信县| 方城县| 德安县| 包头市| 南皮县| 华坪县| 乐平市| 偃师市| 鄂尔多斯市| 台中县| 敦化市| 高陵县| 华池县| 闸北区| 海城市| 婺源县| 武隆县| 伊宁县| 绥德县| 隆昌县| 阳山县| 平顺县| 阿巴嘎旗| 昌平区| 额尔古纳市| 兰州市|