官术网_书友最值得收藏!

Cleaning data

When working with data, you can generally expect to find human errors, missing entries, and numerical outliers. These types of errors usually need to be corrected, handled, or removed to prepare a dataset for analysis.

In Chapter 5, Manipulating Text Data - An Introduction to Regular Expressions, I will demonstrate how to use regular expressions, a tool to identify, extract, and modify patterns in text data. Chapter 5, Manipulating Text Data - An Introduction to Regular Expressions, includes a project to use regular expressions to extract street names.

In Chapter 6Cleaning Numerical Data - An Introduction to R and Rstudio, I will demonstrate how to use RStudio to conduct two common tasks for cleaning numerical data: outlier detection and NA handling.

主站蜘蛛池模板: 阿鲁科尔沁旗| 中牟县| 民县| 博罗县| 富源县| 察隅县| 武强县| 井研县| 靖江市| 五大连池市| 武陟县| 邵武市| 青龙| 大埔县| 石林| 临高县| 高平市| 仁化县| 井陉县| 百色市| 石景山区| 方城县| 邵武市| 迁安市| 白山市| 木里| 贵阳市| 广东省| 梅河口市| 清水河县| 竹山县| 肥西县| 罗江县| 紫云| 红桥区| 辉南县| 大连市| 当涂县| 铜陵市| 阿坝| 西乡县|