官术网_书友最值得收藏!

Preface

This book is a practical guide to realizing the power of analytics in Big Data. It walks the Big Data technologist in you through the process of getting the data ready, applying analytics, and creating a value out of the data. All of this is done using appropriate design patterns in Pig. We have chosen Pig to demonstrate how useful it is, which is evident from the following:

  • The inherent amenability of Pig through its simple language constructs, which can be learned very easily, and its extensibility and applicability to structured and unstructured Big Data makes it the preferred choice over others.
  • The ease and speed with which patterns can be implemented by Pig to derive meaning out of the apparent randomness in any Big Data is commendable.
  • This book guides system architects and developers so they become more proficient at creating complex analytics solutions using Pig. It does so by exposing them to a variety of Pig design patterns, UDFs, tools, and best practices.

By reading this book, you will achieve the following goals:

  • Simplify the process of creating complex data pipelines by performing data movement across platforms, data ingestion, profiling, validation, transformations, data reduction, and egress; you'll also be able to use Pig in these design patterns
  • Create solutions that use patterns for exploratory analysis of multistructured unmodeled data to derive structure from it and move the data to downstream systems for further analysis
  • Decipher how Pig can coexist with other tools in the Hadoop ecosystem to create Big Data solutions using design patterns
主站蜘蛛池模板: 怀集县| 临沭县| 平塘县| 图片| 余江县| 都匀市| 凤山市| 昌乐县| 广灵县| 岳普湖县| 闻喜县| 凉山| 赤壁市| 卓资县| 湘乡市| 元氏县| 荆州市| 井陉县| 乐平市| 江都市| 襄汾县| 收藏| 峨边| 隆昌县| 若尔盖县| 玉屏| 尼木县| 芜湖县| 东乌珠穆沁旗| 宣化县| 定远县| 皋兰县| 兴义市| 朝阳县| 洛浦县| 扶沟县| 临夏县| 明光市| 恩施市| 那坡县| 涞水县|