官术网_书友最值得收藏!

Recycle or trash bin configuration

There will also be cases where we need to restore an accidently deleted file or directory. This may be due to a user error or some archiving policy that cleans data periodically.

For such situations, we can configure the recycle bin so that the deleted files can be restored for a specified amount of time. In this recipe, we will see that this can be configured.

Getting ready

This recipe shows the steps needed to edit the configuration file and add new parameters to the file to enable trash in the Hadoop cluster.

How to do it...

  1. ssh to Namenode and edit the core-site.xml file to add the following property to it:
    <property>
    <name>fs.trash.interval</name>
    <value>10080</value>
    </property>
  2. The fs.trash.interval parameter defines the time in minutes after which the checkpoint will be deleted.
  3. Restart the namenode daemon for the property to take effect:
    $ hadoop-daemons.sh stop namenode
    $ hadoop-daemons.sh start namenode
    
  4. Once trash is enabled, delete any unimportant files, as shown in the following screenshot. You will see a different message--rather than saying deleted, it says moved to trash:
    How to do it...
  5. The deleted file can be restored by using the following command:
    $ hadoop fs -cp /user/hadoop/.Trash/Current/input/new.txt /input/
    

How it works...

Any deleted data is moved to the .Trash directory under the home of the user who executed the command. Every time the check pointer runs, it creates a new checkpoint out of current and removes any checkpoints created more than fs.trash.interval minutes ago.

There's more...

In addition to the preceding method, there is a fs.trash.checkpoint.interval parameter that defines the number of minutes between checkpoints.

主站蜘蛛池模板: 通州区| 怀来县| 长阳| 德清县| 两当县| 越西县| 翁牛特旗| 故城县| 宁德市| 迭部县| 稷山县| 南川市| 通河县| 广水市| 兴化市| 松原市| 理塘县| 景宁| 盐池县| 光泽县| 龙山县| 蒙山县| 平利县| 安西县| 深水埗区| 湖州市| 龙南县| 武义县| 洪泽县| 扎赉特旗| 遵义县| 界首市| 灯塔市| 揭东县| 诏安县| 扎兰屯市| 林芝县| 淄博市| 苏尼特右旗| 昌宁县| 醴陵市|