官术网_书友最值得收藏!

Introduction

In the previous chapter, we covered how to integrate data from various data sources. However, simply collecting data is not enough; you also have to ensure the quality of the collected data. If the quality of data used is insufficient, the results of the analysis may be misleading due to biased samples or missing values. Moreover, if the collected data is not well structured and shaped, you may find it hard to correlate and investigate the data. Therefore, data preprocessing and preparation is an essential task that you must perform prior to data analysis.

Those of you familiar with how SQL operates may already understand how to use databases to process data. For example, SQL allows users to add new records with the insert operation, modify data with the update operation, and remove records with the delete operation. However, we do not need to move collected data back to the database; R already provides more powerful and convenient preprocessing functions and packages. In this chapter, we will cover how simple it is to perform data preprocessing in R.

主站蜘蛛池模板: 宁都县| 永丰县| 霸州市| 台东市| 安国市| 辽宁省| 金寨县| 剑阁县| 沐川县| 建宁县| 龙岩市| 庐江县| 黎城县| 翁牛特旗| 子长县| 宜黄县| 河东区| 砚山县| 繁峙县| 玉林市| 赣州市| 余庆县| 江山市| 海晏县| 安乡县| 察雅县| 庆云县| 灵石县| 图们市| 新疆| 崇信县| 灌南县| 衡阳县| 长阳| 平泉县| 额济纳旗| 镇宁| 上高县| 延寿县| 南投县| 两当县|