- Hands-On Data Science with R
- Vitor Bianchi Lanzetta Nataraj Dasgupta Ricardo Anjoleto Farias
- 166字
- 2021-06-10 19:12:31
Measures of dispersion
While measures of central tendency try to give an idea about where data is centered, measures of dispersion are meant to give a general idea about how data is distributed around the center. Standard deviation and variance are the most popular measures of dispersion. The square root of the variance equals the standard deviation. It's very easy to get both values with R:
sd(big_sample, na.rm = T)
# outputs [1] 5.01836
var(big_sample, na.rm = T)
# outputs [1] 25.18394
The sd() function estimates the standard deviation while var() estimates the variation. In most cases, we find ourselves with a DataFrame full of variables we want to analyze. One way out of this is to use a function that will quickly summarize the whole dataset. These functions usually work equally well both with vectors and DataFrame objects. The next section introduces a couple of them.
- Instant Raspberry Pi Gaming
- 電力自動化實用技術(shù)問答
- 大數(shù)據(jù)管理系統(tǒng)
- 大數(shù)據(jù)技術(shù)基礎(chǔ)
- Docker Quick Start Guide
- Hadoop Real-World Solutions Cookbook(Second Edition)
- 21天學(xué)通Visual C++
- Ceph:Designing and Implementing Scalable Storage Systems
- 數(shù)據(jù)掘金
- Kubernetes for Developers
- Working with Linux:Quick Hacks for the Command Line
- 生物3D打印:從醫(yī)療輔具制造到細(xì)胞打印
- AMK伺服控制系統(tǒng)原理及應(yīng)用
- 傳感器原理與工程應(yīng)用
- 玩轉(zhuǎn)PowerPoint