- Apache Spark 2.x for Java Developers
- Sourav Gulati Sumit Kumar
- 154字
- 2021-07-02 19:01:52
NameNode
The NameNode is responsible for managing the metadata of the HDFS cluster, such as lists of files and folders that exist in a cluster, the number of splits each file is divided into, and their replication and storage at different DataNodes. It also maintains and manages the namespace and file permission of all the files available in the HDFS cluster. Apart from bookkeeping, NameNode also has a supervisory role that keeps a watch on the replication factor of all the files and if some block goes missing, then it issue commands to replicate the missing block of data. It also generates reports to ascertain cluster health. It is important to note that all the communication for a supervisory task happens from DataNode to NameNode; that is, DataNode sends reports (block reports) to NameNode and it is then that NameNode responds to them by issuing different commands or instructions as the need may be.
- Debian 7:System Administration Best Practices
- Oracle 12c中文版數據庫管理、應用與開發實踐教程 (清華電腦學堂)
- 從0到1:HTML+CSS快速上手
- Python金融數據分析
- Python完全自學教程
- Spring Boot Cookbook
- Python算法從菜鳥到達人
- Apache Kafka Quick Start Guide
- Linux Shell核心編程指南
- 21天學通C++(第5版)
- 深入淺出Go語言編程
- 寫給大家看的Midjourney設計書
- Simulation for Data Science with R
- Java設計模式深入研究
- Learning D3.js 5 Mapping(Second Edition)