- R Programming By Example
- Omar Trejo Navarro
- 236字
- 2021-07-02 21:30:42
Understanding interactions with correlations
The correlation is a measure of the linear relation among two variables. Its value ranges from -1, representing a perfect inverse relation, to 1, representing a perfect direct relation. Just as we created a matrix of scatter plots, we will now create a matrix of correlations, and resulting graph is shown below. Large circles mean high absolute correlation. Blue circles mean positive correlation, while red circles mean negative correlation.
To create this plot we will use the corrplot() function from the corrplot package, and pass it the correlations data computed by the cor() function in R, and optionally some parameters for the text labels (tl), such as color (color) and size (cex).
Variable Correlations
Now, let's look at the following code:
library(corrplot) corrplot(corr = cor(data_numerical), tl.col = "black", tl.cex = 0.6)
If we look at the relation among the Proportion variable and the other variables, variables in large blue circles are positively correlated with it, meaning that the more that variable increases, the more likely it is for the Proportion variable to also increase. For examples of this type, look at the relations among AdultMeanAge and NoQuals with Proportion. If we find large red circles among Proportion and other variables, it means that the more that variable increases, the more Proportion is likely to decrease. For examples of this type, look at the relations among Age_25to29, Age_30to44, and L4Quals_plus with Proportion:
- 工業機器人產品應用實戰
- Mastercam 2017數控加工自動編程經典實例(第4版)
- Hands-On Linux for Architects
- 統計策略搜索強化學習方法及應用
- 完全掌握AutoCAD 2008中文版:綜合篇
- 水晶石精粹:3ds max & ZBrush三維數字靜幀藝術
- JavaScript典型應用與最佳實踐
- 工業機器人應用案例集錦
- Grome Terrain Modeling with Ogre3D,UDK,and Unity3D
- Hadoop應用開發基礎
- Mastering MongoDB 3.x
- INSTANT Adobe Story Starter
- 穿越計算機的迷霧
- Instant Slic3r
- 單片機C51應用技術