- Apache Hadoop 3 Quick Start Guide
- Hrishikesh Vijay Karambelkar
- 307字
- 2021-06-10 19:18:43
Downloading Hadoop
Once you have completed the prerequisites and SSH keyless entry with all the necessary nodes, you are good to download the Hadoop release. You can download Apache Hadoop from http://www.apache.org/dyn/closer.cgi/hadoop/common/. Hadoop provides two options for downloading—you can either download the source code of Apache Hadoop or you can download binaries. If you download the source code, you need to compile it and create binaries out of it. We will proceed with downloading binaries.
One important question that often arises while downloading Hadoop involves which version to choose. You will find many alpha and beta versions, as well as stable versions. Currently, the stable Hadoop version is 2.9.1, however this may change by the time you read this book. The answer to such a question depends upon usage. For example, if you are evaluating Hadoop for the first time, you may choose to go with the latest Hadoop version (3.1.0) with all-new features, so as to keep yourself updated with the latest trends and skills.
However, if you are looking to set up a production-based cluster, you may need to choose a version of Hadoop that is stable (such as 2.9.1), as well as established, to ensure peaceful project execution. In our case, we will download Hadoop 3.1.0, as shown in the following screenshot:

You can download the binary (tar.gz) from Apache's website, and you can untar it with following command:
hadoop@base0:/$ tar xvzf <hadoop-downloaded-file>.tar.gz
The preceding command will extract the file in a given location. When you list the directory, you should see the following folders:

- The bin/ folder contains all executable for Hadoop
- sbin/ contains all scripts to start or stop clusters
- etc/ contains all configuration pertaining to Hadoop
- share/ contains all the documentation and examples
- Other folders such as include/, lib/, and libexec/ contain libraries and other dependencies
- Splunk 7 Essentials(Third Edition)
- 我的J2EE成功之路
- 精通MATLAB圖像處理
- SCRATCH與機(jī)器人
- 錯(cuò)覺:AI 如何通過數(shù)據(jù)挖掘誤導(dǎo)我們
- SharePoint 2010開發(fā)最佳實(shí)踐
- 變頻器、軟啟動(dòng)器及PLC實(shí)用技術(shù)260問
- 大學(xué)C/C++語言程序設(shè)計(jì)基礎(chǔ)
- INSTANT Drools Starter
- Enterprise PowerShell Scripting Bootcamp
- TensorFlow Reinforcement Learning Quick Start Guide
- 從零開始學(xué)SQL Server
- 在實(shí)戰(zhàn)中成長:C++開發(fā)之路
- AMK伺服控制系統(tǒng)原理及應(yīng)用
- 計(jì)算智能算法及其生產(chǎn)調(diào)度應(yīng)用