官术网_书友最值得收藏!

To get the most out of this book

All the examples have been implemented in Scala with some open source libraries, including Apahe Spark MLlib/ML and Deeplearning4j. However, to get the best out of this, you should have a powerful computer and software stack.

A Linux distribution is preferable (for example, Debian, Ubuntu, or CentOS). For example, for Ubuntu, it is recommended to have at least a 14.04 (LTS) 64-bit complete installation on VMware Workstation Player 12 or VirtualBox. You can run Spark jobs on Windows (7/8/10) or macOS X (10.4.7+) as well.

A computer with a Core i5 processor, enough storage (for example, for running Spark jobs, you'll need at least 50 GB of free disk storage for standalone cluster and for the SQL warehouse), and at least 16 GB RAM are recommended. And optionally, if you want to perform the neural network training on the GPU (for the last chapter only), the NVIDIA GPU driver has to be installed with CUDA and CuDNN configured.

The following APIs and tools are required in order to execute the source code in this book:

  • Java/JDK, version 1.8
  • Scala, version 2.11.8
  • Spark, version 2.2.0 or higher
  • Spark csv_2.11, version 1.3.0
  • ND4j backend version nd4j-cuda-9.0-platform for GPU; otherwise, nd4j-native
  • ND4j, version 1.0.0-alpha
  • DL4j, version 1.0.0-alpha          
  • Datavec, version 1.0.0-alpha
  • Arbiter, version 1.0.0-alpha
  • Eclipse Mars or Luna (latest version) or IntelliJ IDEA
  • Maven Eclipse plugin (2.9 or higher)
  • Maven compiler plugin for Eclipse (2.3.2 or higher)
  • Maven assembly plugin for Eclipse (2.4.1 or higher)
主站蜘蛛池模板: 三江| 西和县| 宝清县| 惠东县| 休宁县| 闸北区| 扎赉特旗| 繁峙县| 库尔勒市| 宝鸡市| 清镇市| 永泰县| 金塔县| 黄石市| 神池县| 扎囊县| 永定县| 象山县| 施甸县| 舒兰市| 普安县| 磴口县| 黄石市| 铜川市| 常州市| 辉县市| 余庆县| 定边县| 三都| 安新县| 开原市| 来安县| 峨山| 大港区| 延寿县| 怀宁县| 龙岩市| 枣阳市| 依兰县| 高陵县| 株洲县|