- Hands-On Data Science and Python Machine Learning
- Frank Kane
- 258字
- 2021-07-15 17:15:11
Analyzing standard deviation and variance on a histogram
Let's write some code here and play with some standard deviation and variances. So If you pull up the StdDevVariance.ipynb file IPython Notebook, and follow along with me here. Please do, because there's an activity at the end that I want you to try. What we're going to do here is just like the previous example, so begin with the following code:
%matplotlib inline import numpy as np import matplotlib.pyplot as plt incomes = np.random.normal(100.0, 20.0, 10000) plt.hist(incomes, 50) plt.show()
We use matplotlib to plot a histogram of some normally distributed random data, and we call it incomes. We're saying it's going to be centered around 100 (hopefully that's an hourly rate or something and not annual, or some weird denomination), with a standard deviation of 20 and 10,000 data points.
Let's go ahead and generate that by executing that above code block and plotting it as shown in the following graph:

We have 10,000 data points centered around 100. With a normal distribution and a standard deviation of 20, a measure of the spread of this data, you can see that the most common occurrence is around 100, and as we get further and further from that, things become less and less likely. The standard deviation point of 20 that we specified is around 80 and around 120. You can see in the histogram that this is the point where things start to fall off sharply, so we can say that things beyond that standard deviation boundary are unusual.
- 大話PLC(輕松動漫版)
- 一步一步學Spring Boot 2:微服務項目實戰
- The Android Game Developer's Handbook
- C++面向對象程序設計(微課版)
- PHP基礎案例教程
- 我的第一本算法書
- Django Design Patterns and Best Practices
- Scratch 3.0少兒編程與邏輯思維訓練
- Web Application Development with MEAN
- 軟件測試技術指南
- Java:High-Performance Apps with Java 9
- Android開發:從0到1 (清華開發者書庫)
- RSpec Essentials
- Mastering React
- TypeScript 2.x By Example