- Machine Learning with Spark(Second Edition)
- Rajdeep Dua Manpreet Singh Ghotra Nick Pentreath
- 113字
- 2021-07-09 21:07:56
Performance improvements in Spark ML over Spark MLlib
Spark 2.0 uses Tungsten Engine, which is built using ideas of modern compilers and MPP databases. It emits optimized bytecode at runtime, which collapses the query into a single function. Hence, there is no need for virtual function calls. It also uses CPU registers to store intermediate data. This technique has been called whole stage code generation.

Reference : https://databricks.com/blog/2016/05/11/apache-spark-2-0-technical-preview-easier-faster-and-smarter.htmlSource: https://databricks.com/blog/2016/05/11/apache-spark-2-0-technical-preview-easier-faster-and-smarter.html
The upcoming table and graph show single function improvements between Spark 1.6 and Spark 2.0:

Chart comparing Performance improvements in Single line functions between Spark 1.6 and Spark 2.0

Table comparing Performance improvements in Single line functions between Spark 1.6 and Spark 2.0.
推薦閱讀
- Dreamweaver CS3 Ajax網(wǎng)頁設(shè)計入門與實例詳解
- 腦動力:C語言函數(shù)速查效率手冊
- 機艙監(jiān)測與主機遙控
- STM32嵌入式微控制器快速上手
- 21天學(xué)通Visual C++
- 網(wǎng)絡(luò)組建與互聯(lián)
- 基于單片機的嵌入式工程開發(fā)詳解
- DevOps Bootcamp
- 電子設(shè)備及系統(tǒng)人機工程設(shè)計(第2版)
- Mastering Text Mining with R
- Linux Shell Scripting Cookbook(Third Edition)
- 企業(yè)級Web開發(fā)實戰(zhàn)
- 網(wǎng)管員世界2009超值精華本
- 軟件測試管理
- 巧學(xué)活用AutoCAD