- Hands-On Deep Learning with Apache Spark
- Guglielmo Iozzia
- 160字
- 2021-07-02 13:34:20
The Apache Spark Ecosystem
Apache Spark (http://spark.apache.org/) is an open source, fast cluster-computing platform. It was originally created by AMPLab at the University of California, Berkeley. Its source code was later donated to the Apache Software Foundation (https://www.apache.org/). Spark comes with a very fast computation speed because data is loaded into distributed memory (RAM) across a cluster of machines. Not only can data be quickly transformed, but also cached on demand for a variety of use cases. Compared to Hadoop MapReduce, it runs programs up to 100 times faster when the data fits in memory, or 10 times faster on disk. Spark provides support for four programming languages: Java, Scala, Python, and R. This book covers the Spark APIs (and deep learning frameworks) for Scala (https://www.scala-lang.org/) and Python (https://www.python.org/) only.
This chapter will cover the following topics:
- Apache Spark fundamentals
- Getting Spark
- Resilient Distributed Dataset (RDD) programming
- Spark SQL, Datasets, and DataFrames
- Spark Streaming
- Cluster mode using a different manager
- 高效能辦公必修課:Word圖文處理
- Cinema 4D R13 Cookbook
- 基于LPC3250的嵌入式Linux系統(tǒng)開發(fā)
- WOW!Illustrator CS6完全自學(xué)寶典
- 腦動(dòng)力:PHP函數(shù)速查效率手冊
- 網(wǎng)上生活必備
- 計(jì)算機(jī)網(wǎng)絡(luò)應(yīng)用基礎(chǔ)
- Maya 2012從入門到精通
- 大型數(shù)據(jù)庫管理系統(tǒng)技術(shù)、應(yīng)用與實(shí)例分析:SQL Server 2005
- INSTANT Autodesk Revit 2013 Customization with .NET How-to
- 具比例時(shí)滯遞歸神經(jīng)網(wǎng)絡(luò)的穩(wěn)定性及其仿真與應(yīng)用
- 精通數(shù)據(jù)科學(xué)算法
- Learning Azure Cosmos DB
- INSTANT Puppet 3 Starter
- Web編程基礎(chǔ)