R cut dendrogram into groups with minimum size
Is there an easy way to calculate lowest value of h in cut that produces groupings of a given minimum size? In this example, if I wanted clusters with at least ten members each, I should go with h = 3.80 : # using iris data simply for reproducible example data(iris) d <- data.frame(scale(iris[,1:4])) hc <- hclust(dist(d)) plot(hc) cut(as.dendrogram(hc), h=3.79) # produces 5 groups; group 4 has 7 members cut(as.dendrogram(hc), h=3.80) # produces 4 groups; no group has <10 members Since the heights of the splits are given in hc$height , I could create a set of candidate values using hc$height +