- Hands-On Big Data Analytics with PySpark
- Rudy Lai Bart?omiej Potaczek
- 144字
- 2021-06-24 15:52:32
Installing Pyspark and Setting up Your Development Environment
In this chapter, we are going to introduce Spark and learn the core concepts, such as, SparkContext, and Spark tools such as SparkConf and Spark shell. The only prerequisite is the knowledge of basic Python concepts and the desire to seek insight from big data. We will learn how to analyze and discover patterns with Spark SQL to improve our business intelligence. Also, you will be able to quickly iterate through your solution by setting to PySpark for your own computer. By the end of the book, you will be able to work with real-life messy data sets using PySpark to get practical big data experience.
In this chapter, we will cover the following topics:
- An overview of PySpark
- Setting up Spark on Windows and PySpark
- Core concepts in Spark and PySpark
推薦閱讀
- 公有云容器化指南:騰訊云TKE實(shí)戰(zhàn)與應(yīng)用
- 數(shù)據(jù)產(chǎn)品經(jīng)理高效學(xué)習(xí)手冊(cè):產(chǎn)品設(shè)計(jì)、技術(shù)常識(shí)與機(jī)器學(xué)習(xí)
- 在你身邊為你設(shè)計(jì)Ⅲ:騰訊服務(wù)設(shè)計(jì)思維與實(shí)戰(zhàn)
- SQL Server 2016 數(shù)據(jù)庫教程(第4版)
- Python數(shù)據(jù)分析入門:從數(shù)據(jù)獲取到可視化
- Python廣告數(shù)據(jù)挖掘與分析實(shí)戰(zhàn)
- Ceph源碼分析
- Scratch 3.0 藝術(shù)進(jìn)階
- MATLAB Graphics and Data Visualization Cookbook
- 爬蟲實(shí)戰(zhàn):從數(shù)據(jù)到產(chǎn)品
- 改變未來的九大算法
- Spring Boot 2.0 Cookbook(Second Edition)
- 大數(shù)據(jù)測(cè)試技術(shù):數(shù)據(jù)采集、分析與測(cè)試實(shí)踐(在線實(shí)驗(yàn)+在線自測(cè))
- PostgreSQL高可用實(shí)戰(zhàn)
- 智能與數(shù)據(jù)重構(gòu)世界