官术网_书友最值得收藏!

Key objectives of data science

As mentioned in Chapter 1, Transitioning from Data Developer to Data Scientist, the idea of how data science is defined is a matter of opinion.

I personally like the explanation that data science is a progression or, even better, an evolution of thought or steps, as shown in the following figure:

This data science evolution (depicted in the preceding figure) consists of a series of steps or phases that a data scientist tracks, comprising the following:

  • Collecting data
  • Processing data
  • Exploring and visualizing data
  • Analyzing (data) and/or applying machine learning (to data)
  • Deciding (or planning) based on acquired insight

Although a progression or evolution implies a sequential journey, in practice, this is an extremely fluid process; each of the phases may inspire the data scientist to reverse and repeat one or more of the phases until they are satisfied. In other words, all or some phases of the process may be repeated until the data scientist determines that the desired outcome is reached.

For example, after a careful review of a generated visualization (during the Exploring and visualizing data phase), one may determine that additional processing of the data is required or that additional data needs to be collected before any reasonable analysis or learning could be of value.

You might loosely compare the data science process to the agile software development mythology where a developer performs various tasks, the results are analyzed, more work is done, the work is again reviewed, and the process is repeated until the desired results or outcomes are obtained.

Let's explain each of the phases of the data science evolution.

主站蜘蛛池模板: 中西区| 双鸭山市| 东城区| 太白县| 工布江达县| 定州市| 绿春县| 深州市| 荆门市| 鹤岗市| 武隆县| 宁波市| 四会市| 金沙县| 磴口县| 潼南县| 图木舒克市| 汤阴县| 洪雅县| 板桥市| 二手房| 晋江市| 普兰店市| 英山县| 兴和县| 嘉黎县| 彭泽县| 定远县| 常熟市| 新疆| 柘城县| 高雄市| 武定县| 晋江市| 大兴区| 维西| 平利县| 巴彦淖尔市| 溧阳市| 镇沅| 栖霞市|