官术网_书友最值得收藏!

Data cleaning

Data cleaning, also known as data cleansing or data scrubbing, is a process consisting of the following steps:

  1. Identifying inaccurate, incomplete, irrelevant, or corrupted data to remove it from further processing
  2. Parsing data, extracting information of interest, or validating whether a string of data is in an acceptable format
  3. Transforming data into a common encoding format, for example, UTF-8 or int32, time scale, or a normalized range
  4. Transforming data into a common data schema; for instance, if we collect temperature measurements from different types of sensors, we might want them to have the same structure
主站蜘蛛池模板: 布尔津县| 鹰潭市| 上蔡县| 湘潭市| 陕西省| 梅河口市| 白玉县| 永济市| 昆山市| 郁南县| 永胜县| 邢台县| 贵溪市| 舞钢市| 晴隆县| 综艺| 南昌县| 边坝县| 泾源县| 油尖旺区| 九台市| 无锡市| 望城县| 福海县| 漠河县| 西峡县| 兴山县| 甘德县| 昂仁县| 新蔡县| 临高县| 成都市| 公安县| 高唐县| 曲阳县| 定西市| 营山县| 微山县| 邓州市| 宁武县| 兴海县|