官术网_书友最值得收藏!

NameNode

NameNode is the gatekeeper for all HDFS-related queries. It serves as a single point for all types of coordination on HDFS data, which is distributed across multiple nodes. NameNode works as a registry to maintain data blocks that are spread across Data Nodes in the cluster. Similarly, the secondary NameNodes keep a backup of active Name Node data periodically (typically every four hours). In addition to maintaining the data blocks, NameNode also maintains the health of each DataNode through the heartbeat mechanism. In any given Hadoop cluster, there can only be one active name node at a time. When an active NameNode goes down, the secondary NameNode takes up responsibility. A filesystem in HDFS is inspired from Unix-like filesystem data structures. Any request to create, edit, or delete HDFS files first gets recorded in journal nodes; journal nodes are responsible for coordinating with data nodes for propagating changes. Once the writing is complete, changes are flushed and a response is sent back to calling APIs. In case the flushing of changes in the journal files fails, the NameNode moves on to another node to record changes.

NameNode used to be single point of failure in Hadoop 1.X; however, in Hadoop 2.X, the secondary name node was introduced to handle the failure condition. In Hadoop 3.X, more than one secondary name node is supported. The same has been depicted in the overall architecture diagram.
主站蜘蛛池模板: 阿勒泰市| 岳普湖县| 巧家县| 勃利县| 紫阳县| 康定县| 肥城市| 花莲市| 光山县| 工布江达县| 九寨沟县| 外汇| 青州市| 应用必备| 左贡县| 贵溪市| 乡城县| 宜兴市| 宁国市| 丰县| 临湘市| 保山市| 镇宁| 呼伦贝尔市| 彭泽县| 尤溪县| 古丈县| 德格县| 电白县| 景东| 萨迦县| 丁青县| 苍溪县| 于都县| 南投市| 南华县| 上蔡县| 武清区| 双桥区| 黑河市| 云龙县|