- Statistics for Data Science
- James D. Miller
- 184字
- 2021-07-02 14:58:57
Contextual data issues
A lot of the previously mentioned data issues can be automatically detected and even corrected. The issues may have been originally caused by user entry errors, by corruption in transmission or storage, or by different definitions or understandings of similar entities in different data sources. In data science, there is more to think about.
During data cleaning, a data scientist will attempt to identify patterns within the data, based on a hypothesis or assumption about the context of the data and its intended purpose. In other words, any data that the data scientist determines to be either obviously disconnected with the assumption or objective of the data or obviously inaccurate will then be addressed. This process is reliant upon the data scientist's judgment and his or her ability to determine which points are valid and which are not.
- Instant Raspberry Pi Gaming
- 工業機器人虛擬仿真實例教程:KUKA.Sim Pro(全彩版)
- 輕松學C語言
- Dreamweaver CS3+Flash CS3+Fireworks CS3創意網站構建實例詳解
- 我的J2EE成功之路
- 圖解PLC控制系統梯形圖和語句表
- Learning C for Arduino
- 信息物理系統(CPS)測試與評價技術
- 網絡化分布式系統預測控制
- Enterprise PowerShell Scripting Bootcamp
- Windows Server 2003系統安全管理
- Mastering MongoDB 3.x
- 人工智能:智能人機交互
- Effective Business Intelligence with QuickSight
- 玩轉PowerPoint