官术网_书友最值得收藏!

  • Pig Design Patterns
  • Pradeep Pasupuleti
  • 257字
  • 2021-07-16 12:07:56

Preface

This book is a practical guide to realizing the power of analytics in Big Data. It walks the Big Data technologist in you through the process of getting the data ready, applying analytics, and creating a value out of the data. All of this is done using appropriate design patterns in Pig. We have chosen Pig to demonstrate how useful it is, which is evident from the following:

  • The inherent amenability of Pig through its simple language constructs, which can be learned very easily, and its extensibility and applicability to structured and unstructured Big Data makes it the preferred choice over others.
  • The ease and speed with which patterns can be implemented by Pig to derive meaning out of the apparent randomness in any Big Data is commendable.
  • This book guides system architects and developers so they become more proficient at creating complex analytics solutions using Pig. It does so by exposing them to a variety of Pig design patterns, UDFs, tools, and best practices.

By reading this book, you will achieve the following goals:

  • Simplify the process of creating complex data pipelines by performing data movement across platforms, data ingestion, profiling, validation, transformations, data reduction, and egress; you'll also be able to use Pig in these design patterns
  • Create solutions that use patterns for exploratory analysis of multistructured unmodeled data to derive structure from it and move the data to downstream systems for further analysis
  • Decipher how Pig can coexist with other tools in the Hadoop ecosystem to create Big Data solutions using design patterns
主站蜘蛛池模板: 神农架林区| 通辽市| 安康市| 慈利县| 北流市| 万盛区| 永定县| 柯坪县| 宁德市| 壶关县| 威宁| 宁津县| 阿拉善左旗| 容城县| 襄汾县| 平果县| 大洼县| 西乡县| 新化县| 鸡泽县| 长阳| 焉耆| 襄城县| 新密市| 邵武市| 永胜县| 秦皇岛市| 寻乌县| 宜兰市| 新丰县| 长丰县| 香港| 山东省| 芦溪县| 浦城县| 大化| 山东| 会昌县| 依兰县| 襄樊市| 宁德市|