- Hands-On Big Data Modeling
- James Lee Tao Wei Suresh Kumar Mukhiya
- 148字
- 2021-06-10 18:58:48
Data storage
The primary goal of a storage infrastructure is to store data. There are two issues that needed to be considered when dealing with data storage, as follows:
- Capacity: The capacity refers to how much storage one should allocate (or what size the memory should be) in order to store data.
- Scalability: The attached storage devices should be scalable, as the volume of data will grow over time. Also, scalability deals with the ability to connect to the network in order to get extra storage over time.
In a big data system, we have the choice of architecting a storage infrastructure by choosing how much of each type of storage we need to have. Using SSDs for storing a large amount of data speeds up lookup operations in the data by at least a factor of ten over hard drives; however, it also increases the cost.
推薦閱讀
- 數據展現的藝術
- 網絡服務器架設(Windows Server+Linux Server)
- Ansible Quick Start Guide
- Java實用組件集
- Mastering VMware vSphere 6.5
- Getting Started with Containerization
- Dreamweaver CS3網頁設計與網站建設詳解
- 機器學習與大數據技術
- Nginx高性能Web服務器詳解
- 悟透AutoCAD 2009案例自學手冊
- Statistics for Data Science
- 從零開始學SQL Server
- Mastering OpenStack(Second Edition)
- Learn Microsoft Azure
- Windows 7故障與技巧200例