Interpreting output from mahout clusterdumper

前端 未结 4 2076
面向向阳花
面向向阳花 2021-02-10 03:24

I ran a clustering test on crawled pages (more than 25K docs ; personal data set). I\'ve done a clusterdump :

$MAHOUT_HOME/bin/mahout clusterdump --seqFileDir ou         


        
4条回答
  •  暖寄归人
    2021-02-10 03:48

    To complete the answer:

    • VL-x: is the identifier of the cluster
    • n=y: is the number of elements in the cluster
    • c=[z, ...]: is the centroid of the cluster, with the z's being the weights of the different dimensions
    • r=[z, ...]: is the radius of the cluster.

    More info here: https://mahout.apache.org/users/clustering/cluster-dumper.html

提交回复
热议问题