目錄(70章)
倒序
- coverpage
- Scala Data Analysis Cookbook
- Credits
- About the Author
- About the Reviewers
- www.PacktPub.com
- Support files eBooks discount offers and more
- Preface
- Apache Flink
- Scalding
- Saddle
- Spire
- Akka
- Accord
- What this book covers
- What you need for this book
- Who this book is for
- Sections
- Conventions
- Reader feedback
- Customer support
- Chapter 1. Getting Started with Breeze
- Introduction
- Getting Breeze – the linear algebra library
- Working with vectors
- Working with matrices
- Vectors and matrices with randomly distributed values
- Reading and writing CSV files
- Chapter 2. Getting Started with Apache Spark DataFrames
- Introduction
- Getting Apache Spark
- Creating a DataFrame from CSV
- Manipulating DataFrames
- Creating a DataFrame from Scala case classes
- Chapter 3. Loading and Preparing Data – DataFrame
- Introduction
- Loading more than 22 features into classes
- Loading JSON into DataFrames
- Storing data as Parquet files
- Using the Avro data model in Parquet
- Loading from RDBMS
- Preparing data in Dataframes
- Chapter 4. Data Visualization
- Introduction
- Visualizing using Zeppelin
- Creating scatter plots with Bokeh-Scala
- Creating a time series MultiPlot with Bokeh-Scala
- Chapter 5. Learning from Data
- Introduction
- Supervised and unsupervised learning
- Gradient descent
- Predicting continuous values using linear regression
- Binary classification using LogisticRegression and SVM
- Binary classification using LogisticRegression with Pipeline API
- Clustering using K-means
- Feature reduction using principal component analysis
- Chapter 6. Scaling Up
- Introduction
- Building the Uber JAR
- Submitting jobs to the Spark cluster (local)
- Running the Spark Standalone cluster on EC2
- Running the Spark Job on Mesos (local)
- Running the Spark Job on YARN (local)
- Chapter 7. Going Further
- Introduction
- Using Spark Streaming to subscribe to a Twitter stream
- Using Spark as an ETL tool
- Using StreamingLogisticRegression to classify a Twitter stream using Kafka as a training stream
- Using GraphX to analyze Twitter data
- Index 更新時間:2021-07-09 21:24:31
推薦閱讀
- AngularJS入門與進階
- 玩轉Scratch少兒趣味編程
- Design Principles for Process:driven Architectures Using Oracle BPM and SOA Suite 12c
- C語言程序設計(第2 版)
- 數據庫原理及應用(Access版)第3版
- Java面向對象軟件開發
- 算法精粹:經典計算機科學問題的Java實現
- 單片機應用技術
- Python Network Programming Cookbook(Second Edition)
- TMS320LF240x芯片原理、設計及應用
- Node.js 12實戰
- C++程序設計教程
- 實戰Python網絡爬蟲
- 你好!Java
- Mastering Citrix? XenDesktop?
- 嵌入式網絡編程
- Java無難事:詳解Java編程核心思想與技術(第2版)
- WordPress Responsive Theme Design
- Learning ArcGIS Geodatabases
- 大學計算機基礎
- 數據庫系統原理及應用(SQL Server 2012)
- CAE分析大系:ABAQUS有限元分析從入門到精通
- Learning Python
- C++程序設計習題解析與實踐教程
- 深入淺出Java編程
- Go語言定制指南
- Linux服務器配置與管理項目化教程
- Visual C#.NET程序設計教程(第2版)
- Spring Essentials
- Python游戲設計案例實戰