- Fast Data Processing with Spark 2(Third Edition)
- Krishna Sankar
- 238字
- 2021-08-20 10:27:11
Building Spark applications
Using Spark in an interactive mode with the Spark shell is very good for quick prototyping; however for developing applications, we need an IDE. The choices for the Spark IDE have come a long way since the days of Spark 1.0. One can use an array of the Spark IDEs for developing algorithms, data wrangling (that is, exploring data), and modeling analytics applications. As a general rule of thumb, iPython and Zeppelin are used for data exploration IDEs. The language of choice for iPython is Python and Scala/Java for Zeppelin. This is a general observation; all of them can handle the major languages; Scala, Java, Python, and SQL. For developing Scala and Java, the preferred IDE is Eclipse and IntelliJ. We will mostly use the Spark shell (and occasionally iPython) in this book, as our focus is data wrangling and understanding the Spark APIs. Of course, deploying Spark applications require compiling for Java and Scala.
Building the Spark jobs is a bit trickier than building a normal application as all dependencies have to be available on all the machines that are in your cluster.
In this chapter, we will first look at iPython and Eclipse, and then cover the process of building a Java and Scala Spark job with Maven, and learn to build the Spark jobs with a non-Maven aware build system. A reference website for building Spark is at http://spark.apache.org/docs/latest/building-spark.html.
- Web Application Development with R Using Shiny(Second Edition)
- 64位匯編語言的編程藝術
- 深度強化學習算法與實踐:基于PyTorch的實現
- TypeScript實戰指南
- 大學計算機基礎實驗指導
- Julia高性能科學計算(第2版)
- Mastering Web Application Development with AngularJS
- Learning AWS
- MINECRAFT編程:使用Python語言玩轉我的世界
- Web App Testing Using Knockout.JS
- OpenCV 3計算機視覺:Python語言實現(原書第2版)
- Python自然語言理解:自然語言理解系統開發與應用實戰
- IoT Projects with Bluetooth Low Energy
- Photoshop CC移動UI設計案例教程(全彩慕課版·第2版)
- Java程序設計實用教程(第2版)