Effective clustering of a similarity matrix

后端未结

关注

 3  2067

轻奢々 2021-02-09 19:19

my topic is similarity and clustering of (a bunch of) text(s). In a nutshell: I want to cluster collected texts together and they should appear in meaningful clusters at the end

3条回答

太阳男子 (楼主)

2021-02-09 19:47

Since you're both new to the field, have an unknown number of clusters and are already using cosine distance I would recommend the FLAME clustering algorithm.

It's intuitive, easy to implement, and has implementations in a large number of languages (not PHP though, largely because very few people use PHP for data science).

Not to mention, it's actually good enough to be used in research by a large number of people. If nothing else you can get an idea of what exactly the shortcomings are in this clustering algorithm that you want to address in moving onto another one.

0 讨论(0)

查看其它3个回答
发布评论:

提交评论
- 加载中...