第 7 章 新增變數欄:由現有欄位轉換生成

new variables
- mutate: 新增欄位變數(保留原始data frame欄位)
- transmute: 新增欄位變數(「不」保留原始data frame欄位)

計算dep_delay排名

flights %>% mutate(dep_delay排名=dense_rank(dep_delay))
## # A tibble: 336,776 x 20
##     year month   day dep_time sched_dep_time dep_delay
##    <int> <int> <int>    <int>          <int>     <dbl>
##  1  2013     1     1      517            515         2
##  2  2013     1     1      533            529         4
##  3  2013     1     1      542            540         2
##  4  2013     1     1      544            545        -1
##  5  2013     1     1      554            600        -6
##  6  2013     1     1      554            558        -4
##  7  2013     1     1      555            600        -5
##  8  2013     1     1      557            600        -3
##  9  2013     1     1      557            600        -3
## 10  2013     1     1      558            600        -2
## # ... with 336,766 more rows, and 14 more variables:
## #   arr_time <int>, sched_arr_time <int>,
## #   arr_delay <dbl>, carrier <chr>, flight <int>,
## #   tailnum <chr>, origin <chr>, dest <chr>,
## #   air_time <dbl>, distance <dbl>, hour <dbl>,
## #   minute <dbl>, time_hour <dttm>,
## #   dep_delay排名 <int>
flights %>% transmute(dep_delay排名=dense_rank(dep_delay))
## # A tibble: 336,776 x 1
##    dep_delay排名
##            <int>
##  1            34
##  2            36
##  3            34
##  4            31
##  5            26
##  6            28
##  7            27
##  8            29
##  9            29
## 10            30
## # ... with 336,766 more rows