merging indexed array in Python

后端 未结 1 540
后悔当初
后悔当初 2021-01-02 16:45

Suppose that I have two numpy arrays of the form

x = [[1,2]
     [2,4]
     [3,6]
     [4,NaN]
     [5,10]]

y = [[0,-5]
     [1,0]
     [2,5]
     [5,20]
           


        
相关标签:
1条回答
  • 2021-01-02 17:23

    See numpy.lib.recfunctions.join_by

    It only works on structured arrays or recarrays, so there are a couple of kinks.

    First you need to be at least somewhat familiar with structured arrays. See here if you're not.

    import numpy as np
    import numpy.lib.recfunctions
    
    # Define the starting arrays as structured arrays with two fields ('key' and 'field')
    dtype = [('key', np.int), ('field', np.float)]
    x = np.array([(1, 2),
                 (2, 4),
                 (3, 6),
                 (4, np.NaN),
                 (5, 10)],
                 dtype=dtype)
    
    y = np.array([(0, -5),
                 (1, 0),
                 (2, 5),
                 (5, 20),
                 (6, 25)],
                 dtype=dtype)
    
    # You want an outer join, rather than the default inner join
    # (all values are returned, not just ones with a common key)
    join = np.lib.recfunctions.join_by('key', x, y, jointype='outer')
    
    # Now we have a structured array with three fields: 'key', 'field1', and 'field2'
    # (since 'field' was in both arrays, it renamed x['field'] to 'field1', and
    #  y['field'] to 'field2')
    
    # This returns a masked array, if you want it filled with
    # NaN's, do the following...
    join.fill_value = np.NaN
    join = join.filled()
    
    # Just displaying it... Keep in mind that as a structured array,
    #  it has one dimension, where each row contains the 3 fields
    for row in join: 
        print row
    

    This outputs:

    (0, nan, -5.0)
    (1, 2.0, 0.0)
    (2, 4.0, 5.0)
    (3, 6.0, nan)
    (4, nan, nan)
    (5, 10.0, 20.0)
    (6, nan, 25.0)
    

    Hope that helps!

    Edit1: Added example Edit2: Really shouldn't join with floats... Changed 'key' field to an int.

    0 讨论(0)
提交回复
热议问题