- Machine Learning in Java
- AshishSingh Bhatia Bostjan Kaluza
- 97字
- 2021-06-10 19:29:56
Data cleaning
Data cleaning, also known as data cleansing or data scrubbing, is a process consisting of the following steps:
- Identifying inaccurate, incomplete, irrelevant, or corrupted data to remove it from further processing
- Parsing data, extracting information of interest, or validating whether a string of data is in an acceptable format
- Transforming data into a common encoding format, for example, UTF-8 or int32, time scale, or a normalized range
- Transforming data into a common data schema; for instance, if we collect temperature measurements from different types of sensors, we might want them to have the same structure
推薦閱讀
- Hands-On Internet of Things with MQTT
- Getting Started with Clickteam Fusion
- Matplotlib 3.0 Cookbook
- Hands-On Linux for Architects
- Multimedia Programming with Pure Data
- 計算機網絡技術基礎
- Learn CloudFormation
- The Python Workshop
- HTML5 Canvas Cookbook
- Excel 2010函數與公式速查手冊
- Mastering Exploratory Analysis with pandas
- Excel 2007終極技巧金典
- 網絡服務器搭建與管理
- Mastering Ceph
- Photoshop CS4數碼照片處理入門、進階與提高