sort by one column within groups of another without changing positions of grouping column

Question

consider the df

df = pd.DataFrame(dict(A=list('babbaa'), B=list('zxyxzy')))
df

I want to sort B with groups defined by A. But I don't want the positions of A to change.

If I try:

df.groupby('A', sort=False) \
    .apply(pd.DataFrame.sort_values, by='B') \
    .reset_index(drop=True)

You'll notice that A is grouped together. I wanted this:

Nickil Maveli · Accepted Answer

For your contrived example:

Sort w.r.t both A and B and let A take on the index. Later, reset the index to make a reference DF.

A = df.sort_values(['A', 'B']).set_index('A').reset_index()

Next, set A as the index along with the normal integer index by using append. Sort the index(which belongs to A). Now reset the index again.

B = df.set_index('A', append=True).sort_index(level=1).reset_index(level=1)

Let A take on B's index. Sort the obtained index axis.

A.index = B.index
A.sort_index()

Answers (2)