- Hands-On Data Science and Python Machine Learning
- Frank Kane
- 259字
- 2021-07-15 17:15:10
Variance
Let's look at a histogram, because variance and standard deviation are all about the spread of the data, the shape of the distribution of a dataset. Take a look at the following histogram:

Let's say that we have some data on the arrival frequency of airplanes at an airport, for example, and this histogram indicates that we have around 4 arrivals per minute and that happened on around 12 days that we looked at for this data. However, we also have these outliers. We had one really slow day that only had one arrival per minute, we only had one really fast day where we had almost 12 arrivals per minute. So, the way to read a histogram is look up the bucket of a given value, and that tells you how frequently that value occurred in your data, and the shape of the histogram could tell you a lot about the probability distribution of a given set of data.
We know from this data that our airport is very likely to have around 4 arrivals per minute, but it's very unlikely to have 1 or 12, and we can also talk specifically about the probabilities of all the numbers in between. So not only is it unlikely to have 12 arrivals per minute, it's also very unlikely to have 9 arrivals per minute, but once we start getting around 8 or so, things start to pick up a little bit. A lot of information can be had from a histogram.
- Arduino開發實戰指南:LabVIEW卷
- R語言游戲數據分析與挖掘
- Monitoring Elasticsearch
- SQL基礎教程(視頻教學版)
- 高級語言程序設計(C語言版):基于計算思維能力培養
- Visual C#通用范例開發金典
- Learning ArcGIS for Desktop
- Kotlin從基礎到實戰
- jQuery Mobile移動應用開發實戰(第3版)
- Learning Apache Cassandra
- R用戶Python學習指南:數據科學方法
- Learning Material Design
- Python編程:從入門到實踐(第3版)
- Fastdata Processing with Spark
- 青少年學Python(第2冊)