- Mastering Apache Spark 2.x(Second Edition)
- Romeo Kienzler
- 88字
- 2021-07-02 18:55:32
The Catalyst Optimizer
The Catalyst Optimizer is one of the most exciting developments in Apache Spark. This is because it basically frees your mind from writing effective data processing pipelines, and lets the optimizer do it for you.
In this chapter, we will like to introduce the Catalyst Optimizer of Apache Spark SQL running on top of SQL, DataFrames, and Datasets.
This chapter will cover the following topics:
- The catalog
- Abstract syntax trees
- The optimization process on logical and physical execution plans
- Code generation
- One practical code walk-through
推薦閱讀
- 精通Nginx(第2版)
- Learning C# by Developing Games with Unity 2020
- Apache ZooKeeper Essentials
- Magento 2 Theme Design(Second Edition)
- 精通搜索分析
- Python程序設計案例教程
- 3D少兒游戲編程(原書第2版)
- Learning Selenium Testing Tools(Third Edition)
- 精通網絡視頻核心開發技術
- 前端HTML+CSS修煉之道(視頻同步+直播)
- 運用后端技術處理業務邏輯(藍橋杯軟件大賽培訓教材-Java方向)
- Learning Apache Karaf
- 打開Go語言之門:入門、實戰與進階
- Python 3 數據分析與機器學習實戰
- Mobile Test Automation with Appium