官术网_书友最值得收藏!

Population variance versus sample variance

There is a little nuance to standard deviation and variance, and that's when you're talking about population versus sample variance. If you're working with a complete set of data, a complete set of observations, then you do exactly what I told you. You just take the average of all the squared variances from the mean and that's your variance.

However, if you're sampling your data, that is, if you're taking a subset of the data just to make computing easier, you have to do something a little bit different. Instead of dividing by the number of samples, you divide by the number of samples minus 1. Let's look at an example.

We'll use the sample data we were just studying for people standing in a line. We took the sum of the squared variances and divided by 5, that is the number of data points that we had, to get 5.04.

σ2 = (11.56 + 0.16 + 0.36 + 0.16 + 12.96) / 5 = 5.04

If we were to look at the sample variance, which is designated by S2, it is found by the sum of the squared variances divided by 4, that is (n - 1). This gives us the sample variance, which comes out to 6.3.

S2 = (11.56 + 0.16 + 0.36 + 0.16 + 12.96) / 4 = 6.3

So again, if this was some sort of sample that we took from a larger dataset, that's what you would do. If it was a complete dataset, you divide by the actual number. Okay, that's how we calculate population and sample variance, but what's the actual logic behind it?

主站蜘蛛池模板: 金塔县| 深水埗区| 米泉市| 乌鲁木齐市| 巴彦淖尔市| 长治县| 大余县| 陆良县| 莱州市| 无极县| 汤原县| 河池市| 瓮安县| 敦化市| 奎屯市| 兴城市| 阿合奇县| 揭东县| 普兰店市| 兰西县| 临夏县| 芜湖县| 灵宝市| 通江县| 永嘉县| 来安县| 临邑县| 宣汉县| 临邑县| 辰溪县| 寻乌县| 利川市| 微山县| 五峰| 成安县| 玉龙| 永平县| 玉树县| 高密市| 繁昌县| 禄劝|