官术网_书友最值得收藏!

Inspection

Once you have acquired your data, the next step is to inspect it. The primary goal at this stage is to sanity check the data, and the best way to accomplish this is to look for things that are either impossible or highly unlikely. As an example, if the data has a unique identifier, check to see that there is indeed only one; if the data is price-based, check that it is always positive; and whatever the data type, check the most extreme cases. Do they make sense? A good practice is to run some simple statistical tests on the data, and visualize it. The outcome of your models is only as good as the data you put in, so it is crucial to get this step right.

主站蜘蛛池模板: 金秀| 吴堡县| 湘阴县| 彰武县| 高清| 永仁县| 上林县| 寻乌县| 鲜城| 奉新县| 霸州市| 尼木县| 静宁县| 道孚县| 滨海县| 道孚县| 宁夏| 罗定市| 铜鼓县| 齐河县| 酒泉市| 乐东| 新巴尔虎右旗| 陇西县| 景宁| 湖州市| 肇庆市| 舞阳县| 南召县| 大田县| 铜山县| 舟山市| 喀喇沁旗| 龙南县| 买车| 高碑店市| 宝坻区| 广西| 全州县| 巩义市| 衡水市|