Deepcopy pandas DataFrame containing python objects (such as lists)

前端 未结 1 1560
说谎
说谎 2021-01-21 13:06

Need help understanding variable assignment, pointers, ...

The following is reproducible.

import pandas as pd

df          


        
1条回答
  •  时光取名叫无心
    2021-01-21 13:58

    When you run

    df['listDataSort'] = df['listData']
    

    All you do is copy the references of the lists to new columns. This means only a shallow copy is performed and both columns reference the same lists. So any change to one column will likely affect another.

    You can use a list comprehension with sorted which returns a copy of the data. This should be the easiest option for you.

    df['listDataSort'] = [sorted(x) for x in df['listDataSort']]
    df
    
                 listData        listDataSort
    0  [c, f, d, a, e, b]  [a, b, c, d, e, f]
    1     [5, 2, 1, 4, 3]     [1, 2, 3, 4, 5]
    

    Now, when it comes to the problem of making a copy of the entire DataFrame, things are a little more complicated. I would recommend deepcopy:

    import copy
    df2 = df.apply(copy.deepcopy)
    

    0 讨论(0)
提交回复
热议问题