Mapping 2 vectors - help to vectorize

前端 未结 6 1140
情话喂你
情话喂你 2020-11-30 12:34

Working in Matlab I have 2 vectors of x coordinate with different length. For example:

xm = [15 20 24 25 26 35 81 84 93];
xn = [14 22 26 51 55 59 70 75 89 96         


        
相关标签:
6条回答
  • 2020-11-30 13:14

    The fastest implementation I'm aware of that solves this problem is this one (C code that can be compiled as a .mex file; for me it's about 20 times faster than rescdsk's code in the accepted answer). It's surprising that such a common operation is not a MATLAB built-in function.

    0 讨论(0)
  • 2020-11-30 13:14

    Your xm and xn are sorted. If this is generally the case, then you can do much better than stepping over the entire array.

    For each value in xn, there will be a range of values for which a value in xm will be closer to that number than any other. Compute these intervals beforehand and you can then step through both arrays sequentially.

    0 讨论(0)
  • 2020-11-30 13:20

    Consider this vectorized solution:

    [~, xmap] = min( abs(bsxfun(@minus, xm, xn')) )
    
    0 讨论(0)
  • 2020-11-30 13:30

    Oh! One other option: since you're looking for close correspondences between two sorted lists, you could go through them both simultaneously, using a merge-like algorithm. This should be O(max(length(xm), length(xn)))-ish.

    
    match_for_xn = zeros(length(xn), 1);
    last_M = 1;
    for N = 1:length(xn)
      % search through M until we find a match.
      for M = last_M:length(xm)
        dist_to_curr = abs(xm(M) - xn(N));
        dist_to_next = abs(xm(M+1) - xn(N));
    
        if dist_to_next > dist_to_curr
          match_for_xn(N) = M;
          last_M = M;
          break
        else
          continue
        end
    
      end % M
    end % N
    

    EDIT: See @yuk's comment, the above code is not totally correct!

    0 讨论(0)
  • 2020-11-30 13:30

    It looks like your input vectors are sorted. Use a binary search to find the closest match. This will give you a O(n ln n) run time.

    0 讨论(0)
  • 2020-11-30 13:39

    Taking advantage of being sorted, as David says, will be faster since you have so many points, but for reference one way to vectorize this would be to use meshgrid:

    [X Y] = meshgrid(xn, xm);
    diffs = X - y;
    mins = min(diffs, [], 2);
    

    Note that this will create two 100,000 x 100,000 arrays in memory, so it's probably only feasible for smaller data sets.

    0 讨论(0)
提交回复
热议问题