官术网_书友最值得收藏!

What this book covers

Chapter 1, Getting Started with Breeze, serves as an introduction to the Breeze linear algebra library's API.

Chapter 2, Getting Started with Apache Spark DataFrames, introduces powerful, yet intuitive and relational-table-like, data abstraction.

Chapter 3, Loading and Preparing Data – DataFrame, showcases the loading of datasets into Spark DataFrames from a variety of sources, while also introducing the Parquet serialization format.

Chapter 4, Data Visualization, introduces Apache Zeppelin for interactive data visualization using Spark SQL and Spark UDF functions. We also briefly discuss Bokeh-Scala, which is a Scala port of Bokeh (a highly customizable visualization library).

Chapter 5, Learning from Data, focuses on machine learning using Spark MLlib.

Chapter 6, Scaling Up, walks through various deployment alternatives for Spark applications: standalone, YARN, and Mesos.

Chapter 7, Going Further, briefly introduces Spark Streaming and GraphX.

主站蜘蛛池模板: 玉山县| 莱芜市| 固安县| 景洪市| 文水县| 新密市| 若尔盖县| 北川| 二连浩特市| 梁平县| 徐水县| 五家渠市| 文昌市| 北碚区| 获嘉县| 浏阳市| 抚松县| 崇礼县| 安龙县| 南川市| 白城市| 区。| 利川市| 榆社县| 施甸县| 佛山市| 建昌县| 新干县| 容城县| 英德市| 陵水| 十堰市| 论坛| 东平县| 石阡县| 吉林省| 上蔡县| 武威市| 元江| 兴和县| 梁山县|