官术网_书友最值得收藏!

Data cleaning

Data cleaning, also known as data cleansing or data scrubbing, is a process consisting of the following steps:

  1. Identifying inaccurate, incomplete, irrelevant, or corrupted data to remove it from further processing
  2. Parsing data, extracting information of interest, or validating whether a string of data is in an acceptable format
  3. Transforming data into a common encoding format, for example, UTF-8 or int32, time scale, or a normalized range
  4. Transforming data into a common data schema; for instance, if we collect temperature measurements from different types of sensors, we might want them to have the same structure
主站蜘蛛池模板: 彰化县| 普定县| 隆德县| 青冈县| 霸州市| 南投市| 临清市| 西丰县| 奎屯市| 黄陵县| 延吉市| 武清区| 秭归县| 汝南县| 丹东市| 中江县| 闻喜县| 巴塘县| 进贤县| 安岳县| 定结县| 泽普县| 东乌珠穆沁旗| 德惠市| 大宁县| 南华县| 聂荣县| 拉孜县| 剑阁县| 南部县| 南充市| 平阳县| 鄱阳县| 星子县| 搜索| 高雄县| 万宁市| 云梦县| 卢湾区| 财经| 张家口市|