How to intrepret Clusters results after using Doc2vec?

前端 未结 3 1936
时光说笑
时光说笑 2021-01-29 03:13

I am using doc2vec to convert the top 100 tweets of my followers in vector representation (say v1.....v100). After that I am using the vector representation to do the K-Means cl

3条回答
  •  南方客
    南方客 (楼主)
    2021-01-29 03:42

    Don't use the individual variables. They should be only analyzed together because of the way these embeddings are trained.

    For a starter, find

    1. The most similar document vectors to your centroid to see typical cluster members
    2. The most similar term vectors from the embedding for typical words to describe the cluster
    3. Note the distances to see how good your fit is.

提交回复
热议问题