- Fast Data Processing with Spark 2(Third Edition)
- Krishna Sankar
- 51字
- 2021-08-20 10:27:09
Spark on YARN
YARN is Apache Hadoop's NextGen Resource Manager. The Spark project provides an easy way to schedule jobs on YARN once you have a Spark assembly built. The Spark web page, http://spark.apache.org/docs/latest/running-on-yarn.html, has the configuration details for YARN, which we had built earlier for compiling with the -Pyarn
switch.