官术网_书友最值得收藏!

What you need for this book

Practical exercises in this book are demonstrated on virtual machines (VM) from Cloudera, Hortonworks, MapR, or prebuilt Spark for Hadoop for getting started easily. The same exercises can be run on a bigger cluster as well.

Prerequisites for using virtual machines on your laptop:

  • RAM: 8 GB and above
  • CPU: At least two virtual CPUs
  • The latest VMWare player or Oracle VirtualBox must be installed for Windows or Linux OS
  • Latest Oracle VirtualBox, or VMWare Fusion for Mac
  • Virtualization enabled in BIOS
  • Browser: Chrome 25+, IE 9+, Safari 6+, or Firefox 18+ recommended (HDP Sandbox will not run on IE 10)
  • Putty
  • WinScP

The Python and Scala programming languages are used in chapters, with more focus on Python. It is assumed that readers have a basic programming background in Java, Scala, Python, SQL, or R, with basic Linux experience. Working experience within Big Data environments on Hadoop platforms would provide a quick jump start for building Spark applications.

主站蜘蛛池模板: 武穴市| 肥城市| 科技| 五大连池市| 稷山县| 罗江县| 遵义县| 都匀市| 从江县| 昌邑市| 庆云县| 普宁市| 盐津县| 博爱县| 靖西县| 博湖县| 德庆县| 河源市| 乌拉特前旗| 泸溪县| 于田县| 安国市| 贵德县| 静乐县| 武威市| 绥德县| 乌拉特后旗| 南安市| 辽源市| 凤台县| 夏河县| 丘北县| 定兴县| 青川县| 营口市| 贡觉县| 延安市| 余庆县| 达拉特旗| 牟定县| 肥乡县|