- Fast Data Processing with Spark 2(Third Edition)
- Krishna Sankar
- 213字
- 2021-08-20 10:27:07
A single machine
A single machine is the simplest use case for Spark. It is also a great way to sanity check your build. In spark/bin
, there is a shell script called run-example
, which can be used to launch a Spark job. The run-example
script takes the name of a Spark class and some arguments. Earlier, we used the run-example
script from the /bin
directory to calculate the value of Pi. There is a collection of the sample Spark jobs in examples/src/main/scala/org/apache/spark/examples/
.
All of the sample programs take the parameter, master
(the cluster manager), which can be the URL of a distributed cluster or local[N]
, where N
is the number of threads.
Going back to our run-example
script, it invokes the more general bin/spark-submit
script. For now, let's stick with the run-example
script.
To run GroupByTest
locally, try running the following command:
bin/run-example GroupByTest
This line will produce an output like this given here:
14/11/15 06:28:40 INFO SparkContext: Job finished: count at GroupByTest.scala:51, took 0.494519333 s 2000
Note
All the examples in this book can be run on a Spark installation on a local machine. So you can read through the rest of the chapter for additional information after you have gotten some hands-on exposure to Spark running on your local machine.
- C++ Primer習(xí)題集(第5版)
- Apache ZooKeeper Essentials
- Progressive Web Apps with React
- C# 從入門到項(xiàng)目實(shí)踐(超值版)
- 區(qū)塊鏈架構(gòu)與實(shí)現(xiàn):Cosmos詳解
- Web全棧工程師的自我修養(yǎng)
- 云計(jì)算通俗講義(第3版)
- Building Cross-Platform Desktop Applications with Electron
- 差分進(jìn)化算法及其高維多目標(biāo)優(yōu)化應(yīng)用
- C語言程序設(shè)計(jì)案例精粹
- 小程序開發(fā)原理與實(shí)戰(zhàn)
- Bootstrap 4 Cookbook
- HTML5秘籍(第2版)
- Procedural Content Generation for C++ Game Development
- 動(dòng)手學(xué)數(shù)據(jù)結(jié)構(gòu)與算法