官术网_书友最值得收藏!

Setting up Hadoop v2 on your local machine

This recipe describes how to set up Hadoop v2 on your local machine using the local mode. Local mode is a non-distributed mode that can be used for testing and debugging your Hadoop applications. When running a Hadoop application in local mode, all the required Hadoop components and your applications execute inside a single Java Virtual Machine (JVM) process.

Getting ready

Download and install JDK 1.6 or a higher version, preferably the Oracle JDK 1.7. Oracle JDK can be downloaded from http://www.oracle.com/technetwork/java/javase/downloads/index.html.

How to do it...

Now let's start the Hadoop v2 installation:

  1. Download the most recent Hadoop v2 branch distribution (Hadoop 2.2.0 or later) from http://hadoop.apache.org/releases.html.
  2. Unzip the Hadoop distribution using the following command. You will have to change the x.x. in the filename to the actual release you have downloaded. From this point onward, we will call the unpacked Hadoop directory {HADOOP_HOME}:
    $ tar -zxvf hadoop-2.x.x.tar.gz
    
  3. Now, you can run Hadoop jobs through the {HADOOP_HOME}/bin/hadoop command, and we will elaborate on that further in the next recipe.

How it works...

Hadoop local mode does not start any servers but does all the work within a single JVM. When you submit a job to Hadoop in local mode, Hadoop starts a JVM to execute the job. The output and the behavior of the job is the same as a distributed Hadoop job, except for the fact that the job only uses the current node to run the tasks and the local filesystem is used for the data storage. In the next recipe, we will discover how to run a MapReduce program using the Hadoop local mode.

主站蜘蛛池模板: 揭西县| 镇巴县| 彭州市| 青神县| 仙居县| 梁河县| 全椒县| 特克斯县| 壤塘县| 乌拉特后旗| 邵东县| 海伦市| 罗甸县| 齐河县| 三原县| 庄河市| 莱西市| 涪陵区| 永和县| 苏尼特左旗| 蓬莱市| 兴宁市| 铜鼓县| 北海市| 吴旗县| 蒙自县| 四子王旗| 昭通市| 西华县| 弥渡县| 达尔| 施甸县| 荆门市| 茌平县| 新野县| 钟祥市| 山西省| 沿河| 周至县| 和静县| 教育|