- Apache Spark 2.x for Java Developers
- Sourav Gulati Sumit Kumar
- 151字
- 2021-07-02 19:01:52
Defining HDFS
The Hadoop Distributed File System (HDFS), as the name suggests, is a distributed filesystem based on the lines of the Google File System written in Java. In practice, HDFS resembles closely any other UNIX filesystem with support for common file operations such as ls, cp, rm, du, cat, and so on. However what makes HDFS stand out, despite its simplicity, is its mechanism to handle node failure in the Hadoop cluster without effectively changing the search time for accessing stored files. The HDFS cluster consists of two major components: DataNodes and NameNode.
HDFS has a unique way of storing data on HDFS clusters (cheap commodity networked commodity computers). It splits the regular file in smaller chunks called blocks and then makes an exact number of copies of such chunks depending on the replication factor for that file. After that, it copies such chunks to different DataNodes of the cluster.
- INSTANT Mock Testing with PowerMock
- 一步一步學Spring Boot 2:微服務項目實戰
- 大學計算機基礎(第三版)
- 在最好的年紀學Python:小學生趣味編程
- Python數據可視化:基于Bokeh的可視化繪圖
- Django:Web Development with Python
- 單片機應用技術
- 青少年學Python(第1冊)
- 西門子S7-200 SMART PLC編程從入門到實踐
- R語言:邁向大數據之路(加強版)
- Laravel Application Development Blueprints
- Java Web從入門到精通(第2版)
- SFML Game Development
- Microsoft XNA 4.0 Game Development Cookbook
- C語言程序設計實驗指導