官术网_书友最值得收藏!

Chapter 1. What It's All About

This book is about Hadoop, an open source framework for large-scale data processing. Before we get into the details of the technology and its use in later chapters, it is important to spend a little time exploring the trends that led to Hadoop's creation and its enormous success.

Hadoop was not created in a vacuum; instead, it exists due to the explosion in the amount of data being created and consumed and a shift that sees this data deluge arrive at small startups and not just huge multinationals. At the same time, other trends have changed how software and systems are deployed, using cloud resources alongside or even in preference to more traditional infrastructures.

This chapter will explore some of these trends and explain in detail the specific problems Hadoop seeks to solve and the drivers that shaped its design.

In the rest of this chapter we shall:

  • Learn about the big data revolution
  • Understand what Hadoop is and how it can extract value from data
  • Look into cloud computing and understand what Amazon Web Services provides
  • See how powerful the combination of big data processing and cloud computing can be
  • Get an overview of the topics covered in the rest of this book

So let's get on with it!

主站蜘蛛池模板: 东莞市| 安阳市| 广南县| 中卫市| 武穴市| 望奎县| 儋州市| 民勤县| 靖远县| 大兴区| 繁昌县| 茶陵县| 西宁市| 旺苍县| 石阡县| 个旧市| 巫山县| 慈利县| 巢湖市| 鄂托克旗| 郓城县| 松溪县| 宣武区| 潞西市| 南康市| 交城县| 东至县| 丰镇市| 措勤县| 车致| 麻阳| 西峡县| 金乡县| 铜山县| 高陵县| 封丘县| 松江区| 涞水县| 衡水市| 宾阳县| 陆河县|