官术网_书友最值得收藏!

Summary

In this chapter, we revisited the most fundamental theory behind data analysis and exploratory data analysis. EDA is one of the most prominent steps in data analysis and involves steps such as data requirements, data collection, data processing, data cleaning, exploratory data analysis, modeling and algorithms, data production, and communication. It is crucial to identify the type of data under analysis. Different disciplines store different kinds of data for different purposes. For example, medical researchers store patients' data, universities store students' and teachers' data, real estate industries store house and building datasets, and many more. A dataset contains many observations about a particular object. Most of the datasets can be divided into numerical data and categorical datasets. There are four types of data measurement scales: nominal, ordinal, interval, and ratio. 

We are going to use several Python libraries, including NumPy, pandas, SciPy, and Matplotlib, in this book for performing simple to complex exploratory data analysis. In the next chapter, we are going to learn about various types of visualization aids for exploratory data analysis. 

主站蜘蛛池模板: 乐亭县| 呼伦贝尔市| 出国| 松溪县| 泰和县| 沙坪坝区| 静乐县| 达拉特旗| 达拉特旗| 静乐县| 青阳县| 牡丹江市| 阳泉市| 延寿县| 本溪市| 苏尼特左旗| 建阳市| 荣成市| 礼泉县| 西充县| 长寿区| 井冈山市| 尚志市| 清镇市| 响水县| 大名县| 临沭县| 孟连| 贵定县| 昆山市| 娄底市| 新河县| 宣城市| 阿城市| 广宁县| 阿克陶县| 淳安县| 桂平市| 合阳县| 浦江县| 丹江口市|