Error when using “diff” function inside of dplyr mutate

六眼飞鱼酱① 提交于 2020-03-14 11:01:36

问题


I try to mutate new column to data.frame. When V column order changes from decreasing to increasing order, I use diff function inside of mutate to categorize them in new column H.

V <- c(seq(30,-10,-10),seq(-10,30,10))
gr = rep(seq(1,3),each=10)
df <- data.frame(V,gr)

library(dplyr)    
diff_df <- df%>%
  group_by(gr)%>%
  mutate(H=ifelse(diff(V)<0,"back","forward"))

However getting error

Error: incompatible size (9), expecting 10 (the group size) or 1

But when I do

diff(df$V)

[1] -10 -10 -10 -10 0 10 10 10 10 0 -10 -10 -10 -10 0 10 10 10 10 0 -10 -10 -10 -10 0 10 10 10 10

seems to be working logically. Why I'm getting error when I do inside of dplyr?


回答1:


We need to concatenate with one more value to make the length equal as diff returns with a length one less than the length of the group. i.e.

length(df$V)
#[1] 30
length(diff(df$V))
#[1] 29

So, we concatenate with a dummy number at the beginning to make the length equal.

 df %>%
   group_by(gr) %>%
   mutate(H=ifelse(c(0,diff(V))<0,"back","forward"))

If we need the first value to be 'back', change the condition to <=0



来源:https://stackoverflow.com/questions/35169423/error-when-using-diff-function-inside-of-dplyr-mutate

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!