- Hands-On Big Data Modeling
- James Lee Tao Wei Suresh Kumar Mukhiya
- 291字
- 2021-06-10 18:58:50
File-based storage
File-based storage is the most common storage model, and has benefits over other storage models, in terms of ease of use and simplicity. It extends the traditional filesystem architecture, maintaining data in a hierarchical structure as files. Accordingly, this type of storage has hierarchically arranged file paths, which are used as entries for accessing data in the physical storage. Big data commonly uses distributed file systems (DFS) as the basic storage system. Users require namespaces and path knowledge to access saved files in file-based storage. For cross-system file sharing, there are three main parts to the path or namespace of a file: the protocol, the domain name, and the path of the file. NFS-Family and HDFS are two common file-based storage models, and are defined as follows:
- NFS-Family: This is a distributed filesystem protocol, originally produced by Sun Microsystems. In this system, a network file system (NFS) enables remote hosts to mount filesystems over a network, and interact with those filesystems as though they are mounted locally. NFS has been widely used in Unix and Linux-based operating systems, and also encouraged the development of modern distributed filesystems.
- HDFS: Hadoop Distributed File System (HDFS) is a distributed filesystem that was created and optimized for processing large data volumes, and for higher availability. It spreads across the local storage of a cluster consisting of many server nodes. HDFS [14] is an open source distributed filesystem, written in Java. It is the open-source implementation of Google File System (GFS), and works as the core storage for Hadoop ecosystems and the majority of the existing big data platforms. HDFS was designed for fault detection and recovery, and has the ability to handle huge datasets.
- 后稀缺:自動化與未來工作
- 21小時學(xué)通AutoCAD
- 計算機圖形學(xué)
- R Machine Learning By Example
- 網(wǎng)上生活必備
- 影視后期制作(Avid Media Composer 5.0)
- STM32G4入門與電機控制實戰(zhàn):基于X-CUBE-MCSDK的無刷直流電機與永磁同步電機控制實現(xiàn)
- 深度學(xué)習(xí)中的圖像分類與對抗技術(shù)
- Apache Spark Deep Learning Cookbook
- RPA(機器人流程自動化)快速入門:基于Blue Prism
- Nginx高性能Web服務(wù)器詳解
- Learn CloudFormation
- Machine Learning with the Elastic Stack
- 計算機與信息技術(shù)基礎(chǔ)上機指導(dǎo)
- FPGA/CPLD應(yīng)用技術(shù)(Verilog語言版)