Returning first row of group

前端未结

关注

 2  702

I have a dataframe consisting of an ID, that is the same for each element in a group, two datetimes and the time interval between these two. One of the datetime

相关标签:

2条回答

执笔经年

2020-11-30 09:54
As you don't provide any data, here is an example using base R with a sample data frame :
```
df <- data.frame(group=c("a", "b"), value=1:8)
## Order the data frame with the variable of interest
df <- df[order(df$value),]
## Aggregate
aggregate(df, list(df$group), FUN=head, 1)
```
EDIT : As Ananda suggests in his comment, the following call to aggregate is better :
```
aggregate(.~group, df, FUN=head, 1)
```
If you prefer to use plyr, you can replace aggregate with ddply :
```
ddply(df, "group", head, 1)
```
0 讨论(0)
发布评论:

提交评论
- 加载中...
伪装坚强ぢ

2020-11-30 10:05
By reproducing the example data frame and testing it I found a way of getting the needed result:
1. Order data by relevant columns (ID, Start)
  
  ordered_data <- data[order(data$ID, data$Start),]
2. Find the first row for each new ID
  
  final <- ordered_data[!duplicated(ordered_data$ID),]
0 讨论(0)
发布评论:

提交评论
- 加载中...