- Apache Spark Machine Learning Blueprints
- Alex Liu
- 156字
- 2021-07-16 10:39:48
Chapter 1. Spark for Machine Learning
This chapter provides an introduction to Apache Spark from a Machine Learning (ML) and data analytics perspective, and also discusses machine learning in relation to Spark computing. Here, we first present an overview of Apache Spark, as well as Spark's advantages for data analytics, in comparison to MapReduce and other computing platforms. Then we discuss five main issues, as below:
- Machine learning algorithms and libraries
- Spark RDD and dataframes
- Machine learning frameworks
- Spark pipelines
- Spark notebooks
All of the above are the most important topics that any data scientist or machine learning professional is expected to master, in order to fully take advantage of Apache Spark computing. Specifically, this chapter will cover all of the following six topics.
- Spark overview and Spark advantages
- ML algorithms and ML libraries for Spark
- Spark RDD and dataframes
- ML Frameworks, RM4Es and Spark computing
- ML workflows and Spark pipelines
- Spark notebooks introduction
推薦閱讀
- 腦動力:C語言函數速查效率手冊
- 基于LabWindows/CVI的虛擬儀器設計與應用
- Dreamweaver 8中文版商業案例精粹
- 自動檢測與傳感技術
- Python Algorithmic Trading Cookbook
- OpenStack Cloud Computing Cookbook(Second Edition)
- Hybrid Cloud for Architects
- 工業控制系統測試與評價技術
- 學練一本通:51單片機應用技術
- 電腦上網入門
- Java組件設計
- 機器學習案例分析(基于Python語言)
- Apache Spark Quick Start Guide
- Getting Started with Tableau 2018.x
- 巧學活用Photoshop