Is there a method in dplyr to remove columns that are almost duplicated? For example, I would like to remove columns that are greater than 75% duplicated in the following tibbl