官术网_书友最值得收藏!

Defining HDFS

The Hadoop Distributed File System (HDFS), as the name suggests, is a distributed filesystem based on the lines of the Google File System written in Java. In practice, HDFS resembles closely any other UNIX filesystem with support for common file operations such as ls, cp, rm, du, cat, and so on. However what makes HDFS stand out, despite its simplicity, is its mechanism to handle node failure in the Hadoop cluster without effectively changing the search time for accessing stored files. The HDFS cluster consists of two major components: DataNodes and NameNode.

HDFS has a unique way of storing data on HDFS clusters (cheap commodity networked commodity computers). It splits the regular file in smaller chunks called blocks and then makes an exact number of copies of such chunks depending on the replication factor for that file. After that, it copies such chunks to different DataNodes of the cluster.

主站蜘蛛池模板: 普兰县| 衡阳市| 哈尔滨市| 宁波市| 无棣县| 渝中区| 贵南县| 万全县| 长宁区| 桐城市| 梅河口市| 宣化县| 射阳县| 同仁县| 长泰县| 双城市| 万山特区| 武隆县| 通城县| 濉溪县| 绥芬河市| 临沧市| 扬州市| 亚东县| 宝坻区| 民勤县| 嘉鱼县| 滦南县| 新津县| 乌兰县| 克山县| 汉寿县| 武强县| 海兴县| 定州市| 昔阳县| 瑞昌市| 四会市| 铜川市| 陕西省| 龙里县|