- Hands-On Deep Learning with Apache Spark
- Guglielmo Iozzia
- 160字
- 2021-07-02 13:34:20
The Apache Spark Ecosystem
Apache Spark (http://spark.apache.org/) is an open source, fast cluster-computing platform. It was originally created by AMPLab at the University of California, Berkeley. Its source code was later donated to the Apache Software Foundation (https://www.apache.org/). Spark comes with a very fast computation speed because data is loaded into distributed memory (RAM) across a cluster of machines. Not only can data be quickly transformed, but also cached on demand for a variety of use cases. Compared to Hadoop MapReduce, it runs programs up to 100 times faster when the data fits in memory, or 10 times faster on disk. Spark provides support for four programming languages: Java, Scala, Python, and R. This book covers the Spark APIs (and deep learning frameworks) for Scala (https://www.scala-lang.org/) and Python (https://www.python.org/) only.
This chapter will cover the following topics:
- Apache Spark fundamentals
- Getting Spark
- Resilient Distributed Dataset (RDD) programming
- Spark SQL, Datasets, and DataFrames
- Spark Streaming
- Cluster mode using a different manager
- 高效能辦公必修課:Word圖文處理
- 嵌入式系統應用
- 商戰數據挖掘:你需要了解的數據科學與分析思維
- IoT Penetration Testing Cookbook
- 21天學通C++
- 大數據技術與應用
- CompTIA Network+ Certification Guide
- 菜鳥起飛系統安裝與重裝
- Deep Reinforcement Learning Hands-On
- 悟透AutoCAD 2009案例自學手冊
- 基于企業網站的顧客感知服務質量評價理論模型與實證研究
- C++程序設計基礎(上)
- Photoshop CS5圖像處理入門、進階與提高
- INSTANT Adobe Story Starter
- Hands-On Business Intelligence with Qlik Sense