官术网_书友最值得收藏!

Preface

JVM has become a clear winner in the race between different methods of scalable data analysis. The power of JVM, strong typing, simplicity of code, composability, and availability of highly abstracted distributed and machine learning frameworks make Scala a clear contender for the top position in large-scale data analysis. Thanks to its dynamic-looking, yet static type system, scientists and programmers coming from Python backgrounds feel at ease with Scala.

This book aims to provide easy-to-use recipes in Apache Spark, a massively scalable distributed computation framework, and Breeze, a linear algebra library on which Spark's machine learning toolkit is built. The book will also help you explore data using interactive visualizations in Apache Zeppelin.

Other than the handful of frameworks and libraries that we will see in this book, there's a host of other popular data analysis libraries and frameworks that are available for Scala. They are by no means lesser beasts, and they could actually fit our use cases well. Unfortunately, they aren't covered as part of this book.

主站蜘蛛池模板: 天峨县| 商城县| 许昌市| 普安县| 阳西县| 车致| 社会| 德惠市| 原平市| 南溪县| 陈巴尔虎旗| 蕉岭县| 邵东县| 鹤庆县| 海城市| 木里| 临夏市| 汶上县| 沁源县| 淮安市| 陆川县| 永平县| 邻水| 三台县| 灵川县| 临沂市| 渑池县| 正蓝旗| 海门市| 伽师县| 克什克腾旗| 水富县| 新龙县| 吐鲁番市| 丹江口市| 澎湖县| 昆山市| 江陵县| 疏附县| 永寿县| 蓬溪县|