官术网_书友最值得收藏!

Introduction to data science

The term, data science, as mentioned earlier, was first proposed in the 1960s and 1970s by Peter Naur. In the late 1990s, Jeff Wu, while at the University of Michigan, Ann Arbor, proposed the term in a formal paper titled Statistics = Data Science?. The paper, which Prof. Wu subsequently presented at the seventh series of P.C. Mahalonobis Lectures at the Indian Statistical Institute in 1998, raised some interesting questions about what an appropriate definition of statistics might be in light of the tasks that a statistician did beyond numerical calculations.

In the paper Prof. Wu highlighted the concept of Statistical Trilogy, consisting of data collection, data modeling and analysis, and problem solving. The following sections reflected upon the future directions in which Dr. Wu raised the prospects of neural network models to model complex, non-linear relationships, the use of cross validation to improve model performance, and data mining of large-scale data among others. [Source: https://www2.isye.gatech.edu/~jeffwu/presentations/datascience.pdf].

The paper, although written more than 20 years ago, is a reflection of the foresight that a few academicians such as Dr. Wu had at the time, which has been realized in full, almost verbatim as it was propositioned back then, both in thought and practical concepts. A copy of Dr. Wu's paper is available at https://www2.isye.gatech.edu/~jeffwu/presentations/datascience.pdf.

主站蜘蛛池模板: 渝北区| 博湖县| 彩票| 息烽县| 梁河县| 仁怀市| 丰都县| 汉寿县| 临邑县| 拜泉县| 冷水江市| 昭苏县| 漯河市| 蓝山县| 永州市| 务川| 绥宁县| 福州市| 攀枝花市| 乌拉特中旗| 玉龙| 柘荣县| 绿春县| 金湖县| 安仁县| 呼伦贝尔市| 壤塘县| 沙湾县| 成安县| 安阳市| 红安县| 苍溪县| 深水埗区| 泗水县| 万盛区| 泸西县| 灵武市| 平顺县| 舒兰市| 安达市| 白河县|