I have the following data.
ID1 ID2 Value
1 2 5.5
2 1 10
1 3 5
Expected output:
ID1 ID2 Value
1 2 5.5
2
If all values in pairs in columns ID1
and ID2
are unique first create helper DataFrame
with np.sort
and return all duplicated rows with DataFrame.duplicated:
df1 = pd.DataFrame(np.sort(df[['ID1','ID2']], axis=1), index=df.index)
df = df[df1.duplicated(keep=False)]
print (df)
ID1 ID2 Value
0 1 2 5.5
1 2 1 10.0