- Fast Data Processing with Spark 2(Third Edition)
- Krishna Sankar
- 140字
- 2021-08-20 10:27:06
Installing the prebuilt distribution
Let's download prebuilt Spark and install it. Later, we will also compile a version and build from the source. The download is straightforward. The download page is at http://spark.apache.org/downloads.html. Select the options as shown in the following screenshot:

We will use wget
from the command line. You can do a direct download as well:
cd /opt sudo wget http://www-us.apache.org/dist/spark/spark-2.0.0/spark-2.0.0-bin-hadoop2.7.tgz
We are downloading the prebuilt version for Apache Hadoop 2.7 from one of the possible mirrors. We could have easily downloaded other prebuilt versions as well, as shown in the following screenshot:

To uncompress it, execute the following command:
sudo tar xvf spark-2.0.0-bin-hadoop2.7.tgz
To test the installation, run the following command:
/opt/spark-2.0.0-bin-hadoop2.7/bin/run-example SparkPi 10
It will fire up the Spark stack and calculate the value of Pi. The result will be as shown in the following screenshot:

- Python快樂編程:人工智能深度學習基礎
- TypeScript Blueprints
- The Modern C++ Challenge
- FreeSWITCH 1.6 Cookbook
- Learn Scala Programming
- Internet of Things with Intel Galileo
- Python機器學習編程與實戰
- Learning FuelPHP for Effective PHP Development
- 大數據分析與應用實戰:統計機器學習之數據導向編程
- 21天學通C++(第5版)
- ExtJS Web應用程序開發指南第2版
- Learning JavaScript Data Structures and Algorithms(Second Edition)
- 實戰Python網絡爬蟲
- Raspberry Pi Blueprints
- Mastering XenApp?