- Hands-On Data Science with R
- Vitor Bianchi Lanzetta Nataraj Dasgupta Ricardo Anjoleto Farias
- 166字
- 2021-06-10 19:12:31
Measures of dispersion
While measures of central tendency try to give an idea about where data is centered, measures of dispersion are meant to give a general idea about how data is distributed around the center. Standard deviation and variance are the most popular measures of dispersion. The square root of the variance equals the standard deviation. It's very easy to get both values with R:
sd(big_sample, na.rm = T)
# outputs [1] 5.01836
var(big_sample, na.rm = T)
# outputs [1] 25.18394
The sd() function estimates the standard deviation while var() estimates the variation. In most cases, we find ourselves with a DataFrame full of variables we want to analyze. One way out of this is to use a function that will quickly summarize the whole dataset. These functions usually work equally well both with vectors and DataFrame objects. The next section introduces a couple of them.
- 腦動力:Linux指令速查效率手冊
- Seven NoSQL Databases in a Week
- Getting Started with Oracle SOA B2B Integration:A Hands-On Tutorial
- 計算機網絡技術實訓
- 四向穿梭式自動化密集倉儲系統的設計與控制
- 精通數據科學算法
- Ruby on Rails敏捷開發最佳實踐
- Storm應用實踐:實時事務處理之策略
- Machine Learning with Apache Spark Quick Start Guide
- Microsoft System Center Confi guration Manager
- Salesforce Advanced Administrator Certification Guide
- 從零開始學SQL Server
- JRuby語言實戰技術
- 玩轉PowerPoint
- Qt中的C++技術