- Hands-On Unsupervised Learning with Python
- Giuseppe Bonaccorso
- 265字
- 2021-07-02 12:32:00
Descriptive analysis
The first problem to solve in almost any data science scenario concerns understanding its nature. We need to know how the system works or what a dataset is describing. Without this analysis, our knowledge is too limited to make any assumption or hypothesis. For example, we can observe a chart of the average temperature in a city for several years. If we are unable to describe the time series discovering the correlation, seasonalities, and trends, any other question remains unsolved. In our specific context, if we don't discover the similarities between groups of objects, we cannot try to find out a way to summarize their common features. The data scientist has to employ specific tools for every particular problem, but, at the end of this stage, all possible (and helpful) questions must be answered.
Moreover, as this process must have clear business value, it's important to involve different stakeholders with the purpose of gathering their knowledge and converting it into a common language. For example, when working with healthcare data, a physician might talk about hereditary factors, but for our purpose, it's preferable to say that there's a correlation among some samples, so we're not fully authorized to treat them as statistically independent elements. In general, the outcome of descriptive analysis is a summary containing all metric evaluations and conclusions that are necessary to qualify the context, and reducing uncertainty. In the example of the temperature chart, the data scientist should be able to answer the auto-correlation, the periodicity of the peaks, the number of potential outliers, and the presence of trends.
- 用“芯”探核:龍芯派開發實戰
- 24小時學會電腦組裝與維護
- Learning Cocos2d-x Game Development
- FPGA從入門到精通(實戰篇)
- 顯卡維修知識精解
- Instant uTorrent
- Manage Partitions with GParted How-to
- Svelte 3 Up and Running
- Mastering Adobe Photoshop Elements
- 面向對象分析與設計(第3版)(修訂版)
- Creating Flat Design Websites
- Source SDK Game Development Essentials
- BeagleBone Robotic Projects
- 單片機原理及應用:基于C51+Proteus仿真
- Intel FPGA權威設計指南:基于Quartus Prime Pro 19集成開發環境