官术网_书友最值得收藏!

Time for action – changing the base HDFS directory

Let's first set the base directory that specifies the location on the local filesystem under which Hadoop will keep all its data. Carry out the following steps:

  1. Create a directory into which Hadoop will store its data:
    $ mkdir /var/lib/hadoop
    
  2. Ensure the directory is writeable by any user:
    $ chmod 777 /var/lib/hadoop
    
  3. Modify core-site.xml once again to add the following property:
    <property>
    <name>hadoop.tmp.dir</name>
    <value>/var/lib/hadoop</value>
    </property>

What just happened?

As we will be storing data in Hadoop and all the various components are running on our local host, this data will need to be stored on our local filesystem somewhere. Regardless of the mode, Hadoop by default uses the hadoop.tmp.dir property as the base directory under which all files and data are written.

MapReduce, for example, uses a /mapred directory under this base directory; HDFS uses /dfs. The danger is that the default value of hadoop.tmp.dir is /tmp and some Linux distributions delete the contents of /tmp on each reboot. So it's safer to explicitly state where the data is to be held.

主站蜘蛛池模板: 行唐县| 龙井市| 闽侯县| 开阳县| 吉隆县| 静安区| 琼结县| 壶关县| 同仁县| 浦城县| 措美县| 临汾市| 赣州市| 柳江县| 敖汉旗| 东山县| 灵山县| 文山县| 塔城市| 班戈县| 昌宁县| 金秀| 乐安县| 黄冈市| 栾川县| 康定县| 越西县| 子长县| 水富县| 丰都县| 屏山县| 太康县| 莱芜市| 西乌珠穆沁旗| 公主岭市| 饶河县| 靖州| 皋兰县| 新田县| 湛江市| 红安县|