How to check multiple values using if condition [duplicate]

后端未结

关注

 5  1442

孤街浪徒 2021-01-27 16:38

5条回答

离开以前 (楼主)

2021-01-27 17:05

This approach will scale to a large number of remarks.

Load the data and prepare a matching data frame

The second data frame makes a matching between remarks and their TRUE or FALSE value.

library(readr)
library(dplyr)
library(tidyr)
dtf <- read_table("id        remarks         value
1         ABC             10
1         AAB             12
1         ZZX             15
2         XYZ             12
2         ABB             14")
truefalse <- data_frame(remarks = c("ABC", "AAB", "ABB", "ZZX", "XYZ"),
                        tf = c(TRUE, TRUE, TRUE, FALSE, FALSE))

Group by id and summarise

This is the format as asked in the question

dtf %>% 
    left_join(truefalse, by = "remarks") %>% 
    group_by(id) %>% 
    summarise(true = sum(tf),
              false = sum(!tf),
              value = sum(value)) 

# A tibble: 2 x 4
     id  true false value
     
1     1     2     1    37
2     2     1     1    26

Alternative proposal: group by id, tf and summarise

This option retains more details on the spread of value along the grouping variables id and tf.

    dtf %>% 
        left_join(truefalse, by = "remarks") %>% 
        group_by(id, tf) %>% 
        summarise(n = n(),
                  value = sum(value))
# A tibble: 4 x 4
# Groups:   id [?]
     id tf        n value
     
1     1 FALSE     1    15
2     1 TRUE      2    22
3     2 FALSE     1    12
4     2 TRUE      1    14

0 讨论(0)

查看其它5个回答

热议问题