- Python:Advanced Predictive Analytics
- Ashish Kumar Joseph Babcock
- 213字
- 2021-07-02 20:09:27
Summary
In this chapter, we skimmed through the basic concepts of statistics. Here is a brief summary of the concepts we learned:
- Hypothesis testing is used to test the statistical significance of a hypothesis. The one which already exists or is assumed to be true is a null hypothesis, the one which someone is not sure about or is being proposed as an alternate premise is an alternate hypothesis.
- One needs to calculate a statistic and the associated p-value to conduct the test.
- Hypothesis testing (p-values) is used to test the significance of the estimates of the coefficients calculated by the model.
- The chi-square test is used to test the causal relationship between a predictor and an input variable. It can also be used to check whether the data is fair or fake.
- The correlation coefficient can range from -1 to 1. The closer it is to the extremes, the stronger is the relationship between the two variables.
Linear regression is part of the family of algorithms called supervised algorithms as the dataset on which they are built has an output variable. In a sense, one can say that this output variable governs or supervises the development of the model and hence the name. More on this is covered in the next chapter.
推薦閱讀
- 數據分析實戰:基于EXCEL和SPSS系列工具的實踐
- Google Visualization API Essentials
- Developing Mobile Games with Moai SDK
- 劍破冰山:Oracle開發藝術
- 智能數據時代:企業大數據戰略與實戰
- MySQL 8.x從入門到精通(視頻教學版)
- 計算機應用基礎教程上機指導與習題集(微課版)
- 企業級容器云架構開發指南
- 云計算
- 大數據技術體系詳解:原理、架構與實踐
- 社交網站的數據挖掘與分析(原書第2版)
- 數據庫原理及應用實驗:基于GaussDB的實現方法
- 商業銀行數據庫管理實踐
- Getting Started with Review Board
- Python數據可視化之matplotlib精進