官术网_书友最值得收藏!

Variance

Let's look at a histogram, because variance and standard deviation are all about the spread of the data, the shape of the distribution of a dataset. Take a look at the following histogram:

Let's say that we have some data on the arrival frequency of airplanes at an airport, for example, and this histogram indicates that we have around 4 arrivals per minute and that happened on around 12 days that we looked at for this data. However, we also have these outliers. We had one really slow day that only had one arrival per minute, we only had one really fast day where we had almost 12 arrivals per minute. So, the way to read a histogram is look up the bucket of a given value, and that tells you how frequently that value occurred in your data, and the shape of the histogram could tell you a lot about the probability distribution of a given set of data.

We know from this data that our airport is very likely to have around 4 arrivals per minute, but it's very unlikely to have 1 or 12, and we can also talk specifically about the probabilities of all the numbers in between. So not only is it unlikely to have 12 arrivals per minute, it's also very unlikely to have 9 arrivals per minute, but once we start getting around 8 or so, things start to pick up a little bit. A lot of information can be had from a histogram.

Variance measures how spread-out the data is.
主站蜘蛛池模板: 三明市| 商南县| 衡阳市| 龙里县| 涟源市| 云安县| 枣阳市| 虎林市| 渭源县| 凤庆县| 密云县| 林周县| 涪陵区| 德格县| 宿迁市| 高陵县| 汉川市| 聂拉木县| 昭平县| 万宁市| 山西省| 通海县| 嫩江县| 得荣县| 兴化市| 杂多县| 丰县| 九龙坡区| 滦南县| 兰西县| 武夷山市| 从江县| 灵武市| 体育| 留坝县| 彭泽县| 新郑市| 天水市| 于都县| 漳平市| 正定县|