官术网_书友最值得收藏!

Installing Pyspark and Setting up Your Development Environment

In this chapter, we are going to introduce Spark and learn the core concepts, such as, SparkContext, and Spark tools such as SparkConf and Spark shell. The only prerequisite is the knowledge of basic Python concepts and the desire to seek insight from big data. We will learn how to analyze and discover patterns with Spark SQL to improve our business intelligence. Also, you will be able to quickly iterate through your solution by setting to PySpark for your own computer. By the end of the book, you will be able to work with real-life messy data sets using PySpark to get practical big data experience.

In this chapter, we will cover the following topics:

  • An overview of PySpark
  • Setting up Spark on Windows and PySpark
  • Core concepts in Spark and PySpark
主站蜘蛛池模板: 探索| 海门市| 花垣县| 新化县| 定兴县| 方城县| 安岳县| 钟山县| 崇信县| 乌拉特中旗| 石嘴山市| 维西| 鲁甸县| 丹寨县| 青阳县| 无极县| 基隆市| 格尔木市| 平远县| 富阳市| 晋江市| 黄平县| 鄂伦春自治旗| 绍兴市| 隆子县| 汤阴县| 瑞昌市| 牡丹江市| 永寿县| 红桥区| 安顺市| 辽阳市| 屏山县| 沧州市| 禹城市| 新源县| 抚宁县| 定襄县| 闸北区| 郴州市| 阳曲县|