官术网_书友最值得收藏!

What you need for this book

Practical exercises in this book are demonstrated on virtual machines (VM) from Cloudera, Hortonworks, MapR, or prebuilt Spark for Hadoop for getting started easily. The same exercises can be run on a bigger cluster as well.

Prerequisites for using virtual machines on your laptop:

  • RAM: 8 GB and above
  • CPU: At least two virtual CPUs
  • The latest VMWare player or Oracle VirtualBox must be installed for Windows or Linux OS
  • Latest Oracle VirtualBox, or VMWare Fusion for Mac
  • Virtualization enabled in BIOS
  • Browser: Chrome 25+, IE 9+, Safari 6+, or Firefox 18+ recommended (HDP Sandbox will not run on IE 10)
  • Putty
  • WinScP

The Python and Scala programming languages are used in chapters, with more focus on Python. It is assumed that readers have a basic programming background in Java, Scala, Python, SQL, or R, with basic Linux experience. Working experience within Big Data environments on Hadoop platforms would provide a quick jump start for building Spark applications.

主站蜘蛛池模板: 绿春县| 新密市| 和龙市| 庆元县| 平利县| 黑山县| 梁河县| 石楼县| 驻马店市| 文安县| 永泰县| 永昌县| 临夏县| 密云县| 苏尼特右旗| 洛隆县| 五台县| 九台市| 临西县| 桃园市| 德钦县| 晋宁县| 曲松县| 万盛区| 内黄县| 克什克腾旗| 民权县| 揭阳市| 黎川县| 新巴尔虎右旗| 周宁县| 海林市| 深水埗区| 深水埗区| 文昌市| 玉环县| 保定市| 金塔县| 汶川县| 百色市| 怀柔区|