官术网_书友最值得收藏!

Preface

Apache Spark is a flexible in-memory framework that allows the processing of both batch and real-time data in a distributed way. Its unified engine has made it quite popular for big data use cases.

This book will help you to quickly get started with Apache Spark 2.x and help you write efficient big data applications for a variety of use cases. You will get to grip with the low-level details as well as core concepts of Apache Spark, and the way they can be used to solve big data problems. You will be introduced to RDD and DataFrame APIs, and their corresponding transformations and actions. 

This book will help you learn Spark's components for machine learning, stream processing, and graph analysis. At the end of the book, you'll learn different optimization techniques for writing efficient Spark code.

主站蜘蛛池模板: 称多县| 光泽县| 平潭县| 罗城| 穆棱市| 石城县| 吉水县| 榆树市| 独山县| 隆安县| 高密市| 温州市| 涪陵区| 宁阳县| 梁河县| 吴桥县| 双柏县| 宁河县| 彭州市| 广灵县| 丰顺县| 合作市| 宜昌市| 雷波县| 苏尼特右旗| 伊金霍洛旗| 正安县| 南江县| 阳曲县| 常熟市| 东源县| 梁河县| 泾川县| 石泉县| 永德县| 嘉峪关市| 莎车县| 南宫市| 响水县| 茌平县| 汉川市|