官术网_书友最值得收藏!

  • Hadoop Beginner's Guide
  • Garry Turkington
  • 179字
  • 2021-07-29 16:51:35

Time for action – changing the base HDFS directory

Let's first set the base directory that specifies the location on the local filesystem under which Hadoop will keep all its data. Carry out the following steps:

  1. Create a directory into which Hadoop will store its data:
    $ mkdir /var/lib/hadoop
    
  2. Ensure the directory is writeable by any user:
    $ chmod 777 /var/lib/hadoop
    
  3. Modify core-site.xml once again to add the following property:
    <property>
    <name>hadoop.tmp.dir</name>
    <value>/var/lib/hadoop</value>
    </property>

What just happened?

As we will be storing data in Hadoop and all the various components are running on our local host, this data will need to be stored on our local filesystem somewhere. Regardless of the mode, Hadoop by default uses the hadoop.tmp.dir property as the base directory under which all files and data are written.

MapReduce, for example, uses a /mapred directory under this base directory; HDFS uses /dfs. The danger is that the default value of hadoop.tmp.dir is /tmp and some Linux distributions delete the contents of /tmp on each reboot. So it's safer to explicitly state where the data is to be held.

主站蜘蛛池模板: 百色市| 英吉沙县| 拜城县| 花莲市| 北碚区| 遵化市| 惠来县| 大宁县| 思茅市| 蕲春县| 皮山县| 泗阳县| 石柱| 内乡县| 北票市| 西昌市| 郑州市| 泾川县| 依安县| 黄石市| 泽库县| 武山县| 林甸县| 涞水县| 海门市| 十堰市| 高青县| 宣城市| 宁津县| 淮滨县| 丹阳市| 洪泽县| 五原县| 敦煌市| 晴隆县| 砀山县| 卫辉市| 嵊泗县| 泽库县| 定结县| 云浮市|