官术网_书友最值得收藏!

Introduction

Various statistical distributions have been invented, which are the equivalent of the wheel for data analysts. Just as whatever I think of comes out differently in print, data in our world doesn't follow strict mathematical laws. Nevertheless, after visualizing our data, we can see that the data follows (to certain extent) a distribution. Even without visualization, we can find a candidate distribution using rules of thumb. The next step is to try to fit the data to a known distribution. If the data is very complex, possibly due to a high number of variables, it is useful to estimate its kernel density (also useful with one variable). In all scenarios, it is good to estimate the confidence intervals or p-values of our results. When we have at least two variables, it is sometimes appropriate to have a look at the correlation between variables. In this chapter, we will apply three types of correlation.

主站蜘蛛池模板: 九寨沟县| 思茅市| 邵阳县| 德安县| 义乌市| 天祝| 黔江区| 建始县| 新营市| 盐池县| 江孜县| 济宁市| 武冈市| 建平县| 高雄市| 马关县| 永州市| 临清市| 衡水市| 溧水县| 乌苏市| 丰城市| 河西区| 南江县| 永宁县| 江达县| 教育| 化隆| 靖西县| 博兴县| 广宗县| 巫溪县| 西安市| 毕节市| 铁岭市| 洪湖市| 平邑县| 云阳县| 铜川市| 雷州市| 麻江县|