官术网_书友最值得收藏!

Defining HDFS

The Hadoop Distributed File System (HDFS), as the name suggests, is a distributed filesystem based on the lines of the Google File System written in Java. In practice, HDFS resembles closely any other UNIX filesystem with support for common file operations such as ls, cp, rm, du, cat, and so on. However what makes HDFS stand out, despite its simplicity, is its mechanism to handle node failure in the Hadoop cluster without effectively changing the search time for accessing stored files. The HDFS cluster consists of two major components: DataNodes and NameNode.

HDFS has a unique way of storing data on HDFS clusters (cheap commodity networked commodity computers). It splits the regular file in smaller chunks called blocks and then makes an exact number of copies of such chunks depending on the replication factor for that file. After that, it copies such chunks to different DataNodes of the cluster.

主站蜘蛛池模板: 四川省| 木兰县| 陆河县| 通榆县| 青川县| 手机| 莲花县| 喜德县| 黎城县| 花垣县| 兰坪| 南宫市| 太和县| 剑川县| 务川| 扶余县| 阜阳市| 桐城市| 抚松县| 潞城市| 扎赉特旗| 鄄城县| 广昌县| 肃宁县| 克拉玛依市| 石城县| 高淳县| 新巴尔虎右旗| 蒙城县| 油尖旺区| 灵丘县| 都江堰市| 青冈县| 九龙坡区| 宜黄县| 五家渠市| 黄梅县| 苍南县| 镇宁| 偃师市| 黄龙县|