How to calculate cosine similarity between two frequency vectors in MATLAB?

前端 未结 2 1874
小蘑菇
小蘑菇 2021-01-28 04:19

I need to find the cosine similarity between two frequency vectors in MATLAB.

Example vectors:

a = [2,3,4,4,6,1]
b = [1,3,2,4,6,3]

How

2条回答
  •  别那么骄傲
    2021-01-28 05:02

    Take a quick look at the mathematical definition of Cosine similarity.

    From the definition, you just need the dot product of the vectors divided by the product of the Euclidean norms of those vectors.

    % MATLAB 2018b
    a = [2,3,4,4,6,1]; 
    b = [1,3,2,4,6,3];
    
    cosSim = sum(a.*b)/sqrt(sum(a.^2)*sum(b.^2));            % 0.9436
    

    Alternatively, you could use

    cosSim = (a(:).'*b(:))/sqrt(sum(a.^2)*sum(b.^2));        % 0.9436
    

    which gives the same result.


    After reading this correct answer, to avoid sending you to another castle I've added another approach using MATLAB's built-in linear algebra functions, dot() and norm().

    cosSim = dot(a,b)/(norm(a)*norm(b));                     % 0.9436
    

    See also the tag-wiki for cosine-similarity.


    Performance by Approach:

    1. sum(a.*b)/sqrt(sum(a.^2)*sum(b.^2))
    2. (a(:).'*b(:))/sqrt(sum(a.^2)*sum(b.^2))
    3. dot(a,b)/(norm(a)*norm(b))

    Each point represents the geometric mean of the computation times for 10 randomly generated vectors.

提交回复
热议问题