官术网_书友最值得收藏!

Creating histograms and density plots

In this recipe, you will learn how to make histograms and density plots, which are useful to look at the distribution of values in a dataset.

How to do it...

The simplest way to demonstrate the use of a histogram is to show a normal distribution:

hist(rnorm(1000))

Another example of a histogram is one that shows a skewed distribution:

hist(islands)

How it works...

The hist() function is also a function of R's base graphics library. It takes only one compulsory argument, that is, the variable whose distribution of values we wish to visualize.

In the first example, we passed the rnorm() function as the variable. The rnorm(1000) function generates a vector of 1000 random numbers with a normal distribution. As you can see in the histogram, it's a bell-shaped curve.

In the second example, we passed the built-in islands dataset (which gives the areas of the world's major landmasses) as the argument to hist(). As you can see from these histograms, islands has a distribution skewed heavily towards the lower value range of 0 to 2000 square miles.

There's more...

As you might have noticed in the preceding examples, the default setting for histograms is to display the frequency or number of occurrences of values in a particular range on the y axis. We can also display probabilities instead of frequencies by setting the prob (for probability) argument to TRUE or the freq (for frequency) argument to FALSE.

Now, let's make a density plot for the same function, rnorm(). To do so, we need to use the density() function and pass it as our first argument to plot() as follows:

plot(density(rnorm(1000)))

See also

We cover more details such as setting the breaks, density, formatting of bars, and other advanced recipes in Chapter 7, Creating Histograms.

主站蜘蛛池模板: 克东县| 宣汉县| 仪陇县| 农安县| 黄龙县| 常宁市| 兴隆县| 廊坊市| 麻阳| 民县| 禄劝| 呼和浩特市| 南雄市| 沾益县| 五台县| 东乡县| 德钦县| 白银市| 凤翔县| 怀安县| 沛县| 溧阳市| 徐水县| 玛多县| 乌拉特中旗| 安吉县| 肇源县| 新晃| 平南县| 洛浦县| 江北区| 攀枝花市| 鸡西市| 那坡县| 大城县| 吉首市| 黔东| 贵定县| 来凤县| 澎湖县| 双峰县|