官术网_书友最值得收藏!

tapply

Now, suppose instead of prices for 2018 only, we have prices for these items for 2017, 2016, and 2015 as well. This new data frame is defined as follows:

all_prices = data.frame(items = rep(c("potato", "rice", "oil"), 4), 
jan_price = c(10, 20, 30, 10, 18, 25, 9, 17, 24, 9, 19,27),
mar_price = c(11, 22, 33, 13, 25, 32, 12, 21, 33, 15, 27,39),
june_price = c(20, 25, 33, 21, 24, 40, 17, 22, 27, 13, 18,23)
)
all_prices

The output for the preceding lines of code can be seen as follows:

Now suppose we want to take the mean price of different items for very March in all years. We can do this by using tapply(numerical_variable, categorical_variable, function). So, we will need to convert the items column of the all_prices data frame to a categorical variable to take the mean price.

tapply(all_prices$mar_price, factor(all_prices$items), mean)

This gives us a mean March price for oil, potato, and rice in all years, as follows:

Note the use of factor() to convert the items column to a factor variable.

There are other apply functions, but that's it for now, folks. We will introduce new functions as and when it will be necessary as we proceed to new chapters for geospatial analysis.

To install a new package, we need to write install.packages("package_name"), and to use any package, we need to write load.packages("package_name").

主站蜘蛛池模板: 万年县| 南宫市| 杭锦后旗| 景德镇市| 东阳市| 依兰县| 吴忠市| 屯门区| 常宁市| 霸州市| 龙陵县| 汪清县| 洮南市| 莱州市| 太湖县| 长垣县| 六盘水市| 抚顺市| 常德市| 中宁县| 郸城县| 万山特区| 洛阳市| 新郑市| 绥化市| 肥乡县| 邓州市| 山东| 自贡市| 响水县| 成安县| 泌阳县| 乐陵市| 伊吾县| 乌鲁木齐市| 芒康县| 吉木乃县| 义乌市| 波密县| 聂拉木县| 措美县|