官术网_书友最值得收藏!

What this book covers

Chapter 1, Programming with Data, discusses the context of data wrangling and offers a high-level overview of the rest of the book's content.

Section 1: A generalized programming approach to data wrangling

Chapter 2, Introduction to Programming in Python, introduces programming using the Python programming language, which used in most of the chapters of the book.

Chapter 3, Reading, Exploring, and Modifying Data - Part I, is an overview of the steps for processing a data file and an introduction to JSON data.

Chapter 4, Reading, Exploring, and Modifying Data - Part II, continues from the previous chapter, extending to the CSV and XML data formats.

Chapter 5, Manipulating Text Data - An Introduction to Regular Expressions, is an introduction to regular expressions with the application of extracting street names from street addresses.

Section 2: A formulated approach to data wrangling

Chapter 6, Cleaning Numerical Data - An Introduction to R and RStudio, introduces R and RStudio with the application of cleaning numerical data.

Chapter 7, Simplifying Data Manipulation with dplyr, is an introduction to the dplyr package for R, which can be used to express multiple data processing steps elegantly and concisely.

Section 3: Advanced methods for retrieving and storing data

Chapter 8, Getting Data from the Web, is an introduction to APIs. This chapter shows how to extract data from APIs using Python.

Chapter 9, Working with Large Datasets, has an overview of the issues when working with large amounts of data and a very brief introduction to MongoDB.

主站蜘蛛池模板: 蓝山县| 彩票| 安图县| 收藏| 温宿县| 策勒县| 剑河县| 大邑县| 松溪县| 新竹市| 无锡市| 冕宁县| 安新县| 城口县| 崇礼县| 珠海市| 肇东市| 宣汉县| 塔城市| 康平县| 商河县| 株洲县| 泸溪县| 公安县| 蒙山县| 南江县| 和平县| 深圳市| 阳谷县| 诸城市| 永春县| 万载县| 托克逊县| 五台县| 阿合奇县| 武隆县| 修武县| 资兴市| 孙吴县| 博野县| 洱源县|