官术网_书友最值得收藏!

Chapter 1. Spark for Machine Learning

This chapter provides an introduction to Apache Spark from a Machine Learning (ML) and data analytics perspective, and also discusses machine learning in relation to Spark computing. Here, we first present an overview of Apache Spark, as well as Spark's advantages for data analytics, in comparison to MapReduce and other computing platforms. Then we discuss five main issues, as below:

  • Machine learning algorithms and libraries
  • Spark RDD and dataframes
  • Machine learning frameworks
  • Spark pipelines
  • Spark notebooks

All of the above are the most important topics that any data scientist or machine learning professional is expected to master, in order to fully take advantage of Apache Spark computing. Specifically, this chapter will cover all of the following six topics.

  • Spark overview and Spark advantages
  • ML algorithms and ML libraries for Spark
  • Spark RDD and dataframes
  • ML Frameworks, RM4Es and Spark computing
  • ML workflows and Spark pipelines
  • Spark notebooks introduction
主站蜘蛛池模板: 名山县| 工布江达县| 盘山县| 保定市| 洛川县| 娱乐| 阿拉善盟| 遂平县| 永寿县| 承德县| 民丰县| 江西省| 道孚县| 东莞市| 盐亭县| 衡阳县| 金秀| 武威市| 荆州市| 板桥市| 新乡市| 景谷| 湖口县| 湾仔区| 英超| 汶上县| 襄城县| 普陀区| 吴川市| 通江县| 忻州市| 黄浦区| 环江| 宜春市| 舒兰市| 高清| 绥棱县| 沙河市| 洛隆县| 东乡县| 桂东县|