官术网_书友最值得收藏!

The chi-square distribution

The chi-square statistics are defined by the following formula:

The chi-square distribution

Here, n is the size of the sample, s is the standard deviation of the sample, and σ is the standard deviation of the population.

If we repeatedly take samples and define the chi-square statistics, then we can form a chi-square distribution, which is defined by the following probability density function:

The chi-square distribution

Here, Y0 is a constant that depends on the number of degrees of freedom, Χ2 is the chi-square statistic, v = n - 1 is the number of degrees of freedom, and e is a constant equal to the base of the natural logarithm system.

Y0 is defined so that the area under the chi-square curve is equal to one.

The chi-square distribution

Chi-square for the goodness of fit

The Chi-square test can be used to test whether the observed data differs significantly from the expected data. Let's take the example of a dice. The dice is rolled 36 times and the probability that each face should turn upwards is 1/6. So, the expected distribution is as follows:

>>> expected = np.array([6,6,6,6,6,6])

The observed distribution is as follows:

>>> observed = observed = np.array([7, 5, 3, 9, 6, 6])

The null hypothesis in the chi-square test is that the observed value is similar to the expected value.

The chi-square can be performed using the chisquare function in the SciPy package:

>>> stats.chisquare(observed,expected)
(3.333333333333333, 0.64874235866759344)

The first value is the chi-square value and the second value is the p-value, which is very high. This means that the null hypothesis is valid and the observed value is similar to the expected value.

主站蜘蛛池模板: 武陟县| 吉首市| 德令哈市| 莆田市| 安平县| 福海县| 茂名市| 迁西县| 资兴市| 札达县| 兴和县| 东台市| 孟津县| 疏附县| 鹤岗市| 佛山市| 慈利县| 金堂县| 临海市| 元谋县| 读书| 杭锦后旗| 民丰县| 长春市| 顺义区| 汾西县| 松滋市| 左权县| 阿合奇县| 乌审旗| 偏关县| 泽库县| 韶山市| 老河口市| 阳西县| 海口市| 荆门市| 安塞县| 崇左市| 藁城市| 清河县|