官术网_书友最值得收藏!

The chi-square distribution

The chi-square statistics are defined by the following formula:

The chi-square distribution

Here, n is the size of the sample, s is the standard deviation of the sample, and σ is the standard deviation of the population.

If we repeatedly take samples and define the chi-square statistics, then we can form a chi-square distribution, which is defined by the following probability density function:

The chi-square distribution

Here, Y0 is a constant that depends on the number of degrees of freedom, Χ2 is the chi-square statistic, v = n - 1 is the number of degrees of freedom, and e is a constant equal to the base of the natural logarithm system.

Y0 is defined so that the area under the chi-square curve is equal to one.

The chi-square distribution

Chi-square for the goodness of fit

The Chi-square test can be used to test whether the observed data differs significantly from the expected data. Let's take the example of a dice. The dice is rolled 36 times and the probability that each face should turn upwards is 1/6. So, the expected distribution is as follows:

>>> expected = np.array([6,6,6,6,6,6])

The observed distribution is as follows:

>>> observed = observed = np.array([7, 5, 3, 9, 6, 6])

The null hypothesis in the chi-square test is that the observed value is similar to the expected value.

The chi-square can be performed using the chisquare function in the SciPy package:

>>> stats.chisquare(observed,expected)
(3.333333333333333, 0.64874235866759344)

The first value is the chi-square value and the second value is the p-value, which is very high. This means that the null hypothesis is valid and the observed value is similar to the expected value.

主站蜘蛛池模板: 长兴县| 明水县| 尼玛县| 惠来县| 房产| 闵行区| 喀喇| 本溪市| 贡觉县| 巴塘县| 毕节市| 临朐县| 瓮安县| 微博| 宜昌市| 许昌县| 宜兰市| 同仁县| 鸡东县| 民县| 磐安县| 清流县| 河南省| 浦城县| 东乡县| 金沙县| 崇州市| 阿拉善左旗| 舞阳县| 布拖县| 龙南县| 介休市| 平潭县| 昌图县| 武隆县| 沿河| 大田县| 兰州市| 吉木萨尔县| 万宁市| 玉龙|