目錄(70章)
倒序
- coverpage
- Scala Data Analysis Cookbook
- Credits
- About the Author
- About the Reviewers
- www.PacktPub.com
- Support files eBooks discount offers and more
- Preface
- Apache Flink
- Scalding
- Saddle
- Spire
- Akka
- Accord
- What this book covers
- What you need for this book
- Who this book is for
- Sections
- Conventions
- Reader feedback
- Customer support
- Chapter 1. Getting Started with Breeze
- Introduction
- Getting Breeze – the linear algebra library
- Working with vectors
- Working with matrices
- Vectors and matrices with randomly distributed values
- Reading and writing CSV files
- Chapter 2. Getting Started with Apache Spark DataFrames
- Introduction
- Getting Apache Spark
- Creating a DataFrame from CSV
- Manipulating DataFrames
- Creating a DataFrame from Scala case classes
- Chapter 3. Loading and Preparing Data – DataFrame
- Introduction
- Loading more than 22 features into classes
- Loading JSON into DataFrames
- Storing data as Parquet files
- Using the Avro data model in Parquet
- Loading from RDBMS
- Preparing data in Dataframes
- Chapter 4. Data Visualization
- Introduction
- Visualizing using Zeppelin
- Creating scatter plots with Bokeh-Scala
- Creating a time series MultiPlot with Bokeh-Scala
- Chapter 5. Learning from Data
- Introduction
- Supervised and unsupervised learning
- Gradient descent
- Predicting continuous values using linear regression
- Binary classification using LogisticRegression and SVM
- Binary classification using LogisticRegression with Pipeline API
- Clustering using K-means
- Feature reduction using principal component analysis
- Chapter 6. Scaling Up
- Introduction
- Building the Uber JAR
- Submitting jobs to the Spark cluster (local)
- Running the Spark Standalone cluster on EC2
- Running the Spark Job on Mesos (local)
- Running the Spark Job on YARN (local)
- Chapter 7. Going Further
- Introduction
- Using Spark Streaming to subscribe to a Twitter stream
- Using Spark as an ETL tool
- Using StreamingLogisticRegression to classify a Twitter stream using Kafka as a training stream
- Using GraphX to analyze Twitter data
- Index 更新時間:2021-07-09 21:24:31
推薦閱讀
- Modular Programming with Python
- FreeSWITCH 1.8
- 面向STEM的Scratch創新課程
- 云計算通俗講義(第3版)
- 小程序,巧運營:微信小程序運營招式大全
- Visual Basic程序設計實驗指導(第4版)
- bbPress Complete
- INSTANT Passbook App Development for iOS How-to
- Mastering JavaScript Design Patterns(Second Edition)
- Mastering Unity 2D Game Development(Second Edition)
- Mastering Python Design Patterns
- Kivy Cookbook
- Learning Jakarta Struts 1.2: a concise and practical tutorial
- Oracle Database XE 11gR2 Jump Start Guide
- 安卓工程師教你玩轉Android
- 零基礎學Java(第5版)
- Python全棧開發:數據分析
- Learning Puppet
- Instant OpenCV for iOS
- Django 3 Web應用開發從零開始學(視頻教學版)
- ASP.NET MVC 4 Mobile App Development
- QGIS Python Programming Cookbook
- Java高級程序設計實戰教程
- SpringBoot+Vue.js+分布式組件全棧開發訓練營(視頻教學版)
- C語言程序設計教程
- 解決方案架構師修煉之道
- 3D Game Design with Unreal Engine 4 and Blender
- Python數據分析、挖掘與可視化(慕課版)
- 測試有道:微軟測試技術心得
- 軟件設計模式(慕課版)