sorting panda columns on ascending and then non 0 duplicates

Question

i have a pandas dataframe that i am trying to sort

here it would be unsorted

col1   col2    col3   col4
  2       0       0    0    
  7    1207    3262    3 
  8    1212    3266    1    
  5    1246    3263    4 
  4       0       0    1 
  6    1232    3265    0

this would be the ideal sort:

This would be the expected result

col1   col2    col3   col4
  6    1232    3265    0    
  2       0      0     0    
  8    1212    3266    1    
  4       0      0     1    
  7    1207    3262    3    
  5    1246    3263    4

What i want to do is sort by col4, but if there is a duplicate, then use the row that has a valid value (non 0) in col2. the problem is that col2 may not be ascending or descending, just containing a valid value.

The only way i can think of doing this is creating new dataframes, but hopefully there is better way.

@ganderson's method ended up working for me

df.sort_values(['col4', 'col2'], ascending=[True,True], inplace=True)

G. Anderson · Accepted Answer

In the provided example, this looks like

df.sort_values(['col4', 'col2'], ascending=[True,False]))

There is some abiguity in how you want the 2nd level of sort to be applied, but this should produce at least the posted output

sorting panda columns on ascending and then non 0 duplicates

Answers (2)

Related Questions