- Apache Spark 2.x for Java Developers
- Sourav Gulati Sumit Kumar
- 320字
- 2021-07-02 19:01:50
What this book covers
Chapter 1, Introduction to Spark, covers the history of big data, its dimensions, and basic concepts of Hadoop and Spark.
Chapter 2, Revisiting Java, refreshes the concepts of core Java and will focus on the newer feature of Java 8 that will be leveraged while developing Spark applications.
Chapter 3, Let Us Spark, serves the purpose of providing an instruction set so that the reader becomes familiar with installing Apache Spark in standalone mode along with its dependencies.
Chapter 4, Understanding the Spark Programming Model, makes progress by explaining the word count problem in Apache Spark using Java and simultaneously setting up an IDE.
Chapter 5, Working with Data and Storage, teaches you how to read/store data in Spark from/to different storage systems.
Chapter 6, Spark on Cluster, discusses the cluster setup process and some popular cluster managers available with Spark in detail. After this chapter, you will be able to execute Spark jobs effectively in distributed mode.
Chapter 7, Spark Programming Model – Advanced, covers partitioning concepts in RDD along with advanced transformations and actions in Spark.
Chapter 8, Working with Spark SQL, discusses Spark SQL and its related concepts such as dataframe, dataset, and UDF. We will also discuss SqlContext and the newly introduced SparkSession.
Chapter 9, Near-Real-Time Processing with Spark Streaming, covers the internals of Spark Streaming, reading streams of data in Spark from various data sources with examples, and newer extensions of stream processing in Spark known as structured streaming.
Chapter 10, Machine Learning Analytics with Spark MLlib, focuses on introducing the concepts of machine learning and then moves on towards its implementation using Apache Spark Mllib libraries. We also discuss some real-world problems using Spark Mllib.
Chapter 11, Learning Spark GraphX, looks into another module of Spark, GraphX; we will discover types of GraphX RDD and various operations associated with them. We will also discuss the use cases of GraphX implementation.
- C語言程序設計(第3版)
- 構建移動網站與APP:HTML 5移動開發入門與實戰(跨平臺移動開發叢書)
- RTC程序設計:實時音視頻權威指南
- R的極客理想:工具篇
- R大數據分析實用指南
- Solr Cookbook(Third Edition)
- 智能搜索和推薦系統:原理、算法與應用
- 深入理解C指針
- Spring技術內幕:深入解析Spring架構與設計原理(第2版)
- Nagios Core Administration Cookbook(Second Edition)
- Web程序設計:ASP.NET(第2版)
- Greenplum構建實時數據倉庫實踐
- AI自動化測試:技術原理、平臺搭建與工程實踐
- C語言程序設計與應用實驗指導書(第2版)
- 體驗之道:從需求到實踐的用戶體驗實戰