How to label rows by unique pairs of other rows in pandas 0.19.2

前端未结

关注

 2  980

I have a dataframe df like this but much larger.

  ID_0 ID_1  location
0    a    b     1
1    a    c     1
2    a    b     0
3    d    c     0
4


                      
              相关标签:


      
      
        
          2条回答        

        
                         				            
            
           
            
                              
                
              
              
                
                  暗喜        
                
              
                            
                2021-01-23 19:49
              
            
            
                                                                       
You need GroupBy.ngroup, new in 0.20.2:

df['group_ID'] = df.groupby(['ID_0', 'ID_1']).ngroup()
print (df)
  ID_0 ID_1  location  group_ID
0    a    b         1         0
1    a    c         1         1
2    a    b         0         0
3    d    c         0         2
4    a    c         0         1
5    a    c         1         1




df['group_ID'] = df.groupby(['ID_0', 'ID_1']).grouper.group_info[0]
print (df)
  ID_0 ID_1  location  group_ID
0    a    b         1         0
1    a    c         1         1
2    a    b         0         0
3    d    c         0         2
4    a    c         0         1
5    a    c         1         1

                                                                        
                                                        
            
            
              
                
                0
              
                 
                
               讨论(0)
              
              
                                                   
              
                                                            
            
                      
                    


               
            
    发布评论:
    
         
                        
    
    提交评论 
  
  

                    
                    
                    
                        
                        
                         加载中...
                        
                    
                
          
          	          
            
           
            
                              
                
              
              
                
                  广开言路        
                
              
                            
                2021-01-23 19:58
              
            
            
                                                                       
This should do the trick without using the GroupBy.ngroup which is only supported in newer pandas versions:

df['group_ID'] = df.groupby(['ID_0', 'ID_1']).grouper.group_info[0]

    ID_0    ID_1    location    group_ID
0   a       b       1           0
1   a       c       1           1
2   a       b       0           0
3   d       c       0           2
4   a       c       0           1


Find more information at this SO post: Python Pandas: How can I group by and assign an id to all the items in a group?
                                                                        
                                                        
            
            
              
                
                0
              
                 
                
               讨论(0)
              
              
                                                   
              
                                                            
            
                      
                    


               
            
    发布评论:
    
         
                        
    
    提交评论 
  
  

                    
                    
                    
                        
                        
                         加载中...
                        
                    
                
          
          	          
                             
        
        
          
            
            
              
              
            
    


                                 
              
            
                          
    

        
         
                验证码
                
                  
                
                
                   看不清?
                
              
                                  
                    
   
                 
             
              提交回复