官术网_书友最值得收藏!

  • Hands-On Data Science with R
  • Vitor Bianchi Lanzetta Nataraj Dasgupta Ricardo Anjoleto Farias
  • 263字
  • 2021-06-10 19:12:36

Using select

The verb select, which is part of the dplyr package (installed automatically when the tidyverse package is installed), can be used to select and rename columns from a dataset, as follows:

# dplyr Verbs 
# select 
 
# Add the state name to the tstate dataset 
tstate$Name   <- state.name 
 
select(tstate, Income, Frost, Area) # selecting specific columns 
 
# # A tibble: 50 x 3 
# Income Frost   Area 
# <dbl> <dbl>  <dbl> 
#   1   3624    20  50708 
# 2   6315   152 566432 
 
select(tstate, Population:Illiteracy) # selecting a range of columns 
 
# # A tibble: 50 x 3 
# Population Income Illiteracy 
# <dbl>  <dbl>      <dbl> 
#   1       3615   3624        2.1 
# 2        365   6315        1.5 
# 3       2212   4530        1.8 
 
select(tstate, -c(Population:Illiteracy)) # excluding a range of columns 
 
# # A tibble: 50 x 7 
# `Life Exp` Murder `HS Grad` Frost   Area    Region        Name 
# <dbl>  <dbl>     <dbl> <dbl>  <dbl>    <fctr>       <chr> 
#   1      69.05   15.1      41.3    20  50708     South     Alabama 
# 2      69.31   11.3      66.7   152 566432      West      Alaska 
# 3      70.55    7.8      58.1    15 113417      West     Arizona 
 
rename(tstate, Pop=Population) # renaming specific columns and selecting all columns 
 
# # A tibble: 50 x 10 
# Pop Income Illiteracy `Life Exp` Murder `HS Grad` Frost   Area    Region        Name 
# <dbl>  <dbl>      <dbl>      <dbl>  <dbl>     <dbl> <dbl>  <dbl>    <fctr>       <chr> 
#   1  3615   3624        2.1      69.05   15.1      41.3    20  50708     South     Alabama 
# 2   365   6315        1.5      69.31   11.3      66.7   152 566432      West      Alaska 
# 

You can also use helper functions such as starts_with, ends_with to select only specific columns matching a criteria, as follows:

select(tstate, starts_with("P")) 

# # A tibble: 50 x 1 # Population # <dbl> # 1 3615 # select(tstate, ends_with("n")) # # A tibble: 50 x 2 # Population Region # <dbl> <fctr> # 1 3615 South # 2 365 West #
主站蜘蛛池模板: 方正县| 亚东县| 泌阳县| 扎囊县| 深圳市| 广河县| 嘉善县| 青州市| 东山县| 清徐县| 福清市| 霸州市| 乌拉特中旗| 普陀区| 册亨县| 周至县| 靖州| 天峻县| 桃园市| 苗栗县| 阿勒泰市| 民和| 墨竹工卡县| 库伦旗| 托克托县| 姚安县| 台前县| 饶阳县| 海门市| 呼图壁县| 西峡县| 襄城县| 盐津县| 承德市| 嘉祥县| 博罗县| 伊吾县| 宜川县| 聂荣县| 弥渡县| 武宣县|