- HBase Essentials
- Nishant Garg
- 199字
- 2021-08-05 17:24:19
The origin of HBase
Looking at the limitations of GFS and MR, Google approached another solution, which not only uses GFS for data storage but it is also used for processing the smaller data files very efficiently. They called this new solution BigTable.
Note
BigTable is a distributed storage system for managing structured data that is designed to scale to a very large size: petabytes of data across thousands of commodity servers.
Welcome to the world of HBase, databases and the data access design from the keyvalue store databases where a key-based access to a specific cell of data is provided.
Note
In column-oriented databases, data grouped by columns and column values is stored contiguously on a disk. Such a design is highly I/O effective when dealing with very large data sets used for analytical queries where not all the columns are needed.
HBase can be defined as a sparse, distributed, persistent, multidimensional sorted map, which is indexed by a row key, column key, and timestamp. HBase is designed to run on a cluster of commodity hardware and stores both structured and semi-structured data. HBase has the ability to scale horizontally as you add more machines to the cluster.
- Hands-On Intelligent Agents with OpenAI Gym
- Big Data Analytics with Hadoop 3
- Deep Learning Quick Reference
- 21天學通PHP
- ETL with Azure Cookbook
- 商戰數據挖掘:你需要了解的數據科學與分析思維
- 走入IBM小型機世界
- 來吧!帶你玩轉Excel VBA
- 高維聚類知識發現關鍵技術研究及應用
- Silverlight 2完美征程
- INSTANT VMware vCloud Starter
- Spark大數據商業實戰三部曲:內核解密|商業案例|性能調優
- Linux Shell Scripting Cookbook(Third Edition)
- 計算機組裝與維修實訓
- PowerPoint 2010幻燈片制作高手速成