The sample data set is heavily unbalanced (for example group a has 1000 observations while group b has 20), I\'m wondering if there\'s any existing functions in R that I could s