I need to cluster sites by text, and I don\'t know how many clusters will be. Usually I use k-means to cluster text data, but now I don\'t know what to use better for this ta