Execute dplyr operation only if column exists

前端未结

关注

 6  1832

青春惊慌失措 2021-02-07 09:18

Drawing on the discussion on conditional dplyr evaluation I would like conditionally execute a step in pipeline depending on whether the reference column exists in the passed da

6条回答

灰色年华 (楼主)

2021-02-07 09:52
With across() in dplyr > 1.0.0 you can now use any_of when filtering. Compare original with all columns:
```
mtcars %>% 
  filter(am == 1) %>% 
  filter(cyl == 4)
```
With cyl removed, it throws an error:
```
mtcars %>% 
  select(!cyl) %>% 
  filter(am == 1) %>% 
  filter(cyl == 4)
```
Using any_of (note you have to write "cyl" and not cyl):
```
mtcars %>% 
  select(!cyl) %>% 
  filter(am == 1) %>% 
  filter(across(any_of("cyl"), ~.x == 4))
#N.B. this is equivalent to just filtering by `am == 1`.
```
0 讨论(0)

查看其它6个回答
发布评论:

提交评论
- 加载中...