- Mastering Python for Data Science
- Samir Madhavan
- 270字
- 2021-07-16 20:14:21
The chi-square distribution
The chi-square statistics are defined by the following formula:

Here, n is the size of the sample, s is the standard deviation of the sample, and σ is the standard deviation of the population.
If we repeatedly take samples and define the chi-square statistics, then we can form a chi-square distribution, which is defined by the following probability density function:

Here, Y0 is a constant that depends on the number of degrees of freedom, Χ2 is the chi-square statistic, v = n - 1 is the number of degrees of freedom, and e is a constant equal to the base of the natural logarithm system.
Y0 is defined so that the area under the chi-square curve is equal to one.

Chi-square for the goodness of fit
The Chi-square test can be used to test whether the observed data differs significantly from the expected data. Let's take the example of a dice. The dice is rolled 36 times and the probability that each face should turn upwards is 1/6. So, the expected distribution is as follows:

>>> expected = np.array([6,6,6,6,6,6])
The observed distribution is as follows:

>>> observed = observed = np.array([7, 5, 3, 9, 6, 6])
The null hypothesis in the chi-square test is that the observed value is similar to the expected value.
The chi-square can be performed using the chisquare
function in the SciPy package:
>>> stats.chisquare(observed,expected) (3.333333333333333, 0.64874235866759344)
The first value is the chi-square value and the second value is the p-value, which is very high. This means that the null hypothesis is valid and the observed value is similar to the expected value.
- 黑客攻防從入門到精通(實戰秘笈版)
- jQuery EasyUI網站開發實戰
- 深入淺出Windows API程序設計:編程基礎篇
- Python高效開發實戰:Django、Tornado、Flask、Twisted(第3版)
- Creating Mobile Apps with jQuery Mobile(Second Edition)
- Building Machine Learning Systems with Python(Second Edition)
- Learning Unreal Engine Android Game Development
- Learning Material Design
- Data Science Algorithms in a Week
- 金融商業數據分析:基于Python和SAS
- JavaScript前端開發基礎教程
- C/C++程序設計教程
- Learning Scrapy
- 實戰圖解MACD波段交易技術
- 區塊鏈社會:區塊鏈助力國家治理能力現代化