官术网_书友最值得收藏!

Preface

JVM has become a clear winner in the race between different methods of scalable data analysis. The power of JVM, strong typing, simplicity of code, composability, and availability of highly abstracted distributed and machine learning frameworks make Scala a clear contender for the top position in large-scale data analysis. Thanks to its dynamic-looking, yet static type system, scientists and programmers coming from Python backgrounds feel at ease with Scala.

This book aims to provide easy-to-use recipes in Apache Spark, a massively scalable distributed computation framework, and Breeze, a linear algebra library on which Spark's machine learning toolkit is built. The book will also help you explore data using interactive visualizations in Apache Zeppelin.

Other than the handful of frameworks and libraries that we will see in this book, there's a host of other popular data analysis libraries and frameworks that are available for Scala. They are by no means lesser beasts, and they could actually fit our use cases well. Unfortunately, they aren't covered as part of this book.

主站蜘蛛池模板: 凉山| 甘德县| 丰都县| 太和县| 洪雅县| 温泉县| 台州市| 广丰县| 英山县| 乌恰县| 锡林郭勒盟| 湘潭市| 桂林市| 大渡口区| 南召县| 彭水| 平利县| 上思县| 余干县| 巴里| 凤阳县| 柳河县| 罗城| 嘉鱼县| 同德县| 景德镇市| 宁夏| 泗洪县| 灵武市| 邵东县| 随州市| 区。| 肇源县| 普宁市| 张家川| 大安市| 砀山县| 郓城县| 贺州市| 开阳县| 称多县|