- Apache Hadoop 3 Quick Start Guide
- Hrishikesh Vijay Karambelkar
- 307字
- 2021-06-10 19:18:43
Downloading Hadoop
Once you have completed the prerequisites and SSH keyless entry with all the necessary nodes, you are good to download the Hadoop release. You can download Apache Hadoop from http://www.apache.org/dyn/closer.cgi/hadoop/common/. Hadoop provides two options for downloading—you can either download the source code of Apache Hadoop or you can download binaries. If you download the source code, you need to compile it and create binaries out of it. We will proceed with downloading binaries.
One important question that often arises while downloading Hadoop involves which version to choose. You will find many alpha and beta versions, as well as stable versions. Currently, the stable Hadoop version is 2.9.1, however this may change by the time you read this book. The answer to such a question depends upon usage. For example, if you are evaluating Hadoop for the first time, you may choose to go with the latest Hadoop version (3.1.0) with all-new features, so as to keep yourself updated with the latest trends and skills.
However, if you are looking to set up a production-based cluster, you may need to choose a version of Hadoop that is stable (such as 2.9.1), as well as established, to ensure peaceful project execution. In our case, we will download Hadoop 3.1.0, as shown in the following screenshot:

You can download the binary (tar.gz) from Apache's website, and you can untar it with following command:
hadoop@base0:/$ tar xvzf <hadoop-downloaded-file>.tar.gz
The preceding command will extract the file in a given location. When you list the directory, you should see the following folders:

- The bin/ folder contains all executable for Hadoop
- sbin/ contains all scripts to start or stop clusters
- etc/ contains all configuration pertaining to Hadoop
- share/ contains all the documentation and examples
- Other folders such as include/, lib/, and libexec/ contain libraries and other dependencies
- 集成架構(gòu)中型系統(tǒng)
- JMAG電機電磁仿真分析與實例解析
- Mastering Elastic Stack
- 水下無線傳感器網(wǎng)絡(luò)的通信與決策技術(shù)
- 網(wǎng)絡(luò)管理工具實用詳解
- Linux嵌入式系統(tǒng)開發(fā)
- 所羅門的密碼
- PostgreSQL 10 High Performance
- PostgreSQL High Performance Cookbook
- Flink內(nèi)核原理與實現(xiàn)
- 工業(yè)機器人技術(shù)
- Learning Couchbase
- Ubuntu 9 Linux應(yīng)用基礎(chǔ)
- 網(wǎng)頁配色萬用寶典
- AI成“神”之日:人工智能的終極演變