Reputation: 176

Drop all rows of higher level multi-index if certain lower level index doesn't exist

Suppose I have a DataFrame with two MultiIndex levels, such as:

Now, if an index of the level 0 lacks the lower level index 'x' or 'y', I want to drop all rows associated with this index. So in this example I want to drop all rows of 'C' because there is no 'y' under 'C'. So the result should be:

Is there a nice/clean way to do this?

Upvotes: 2

Answers (2)

Anurag Dabas

Reputation: 24314

Try:

mask=df.unstack().isna().any(1)       
#created a mask to check if an index has a missing value or not

Finally:

df=df.loc[~df.index.get_level_values(0).isin(mask[mask].index)]
                                               #^getting index of where value is missing
                      #^excluding that value from the level 0

output of df:

Upvotes: 2

BENY

Reputation: 323226

Try with transform

out = df[df.groupby(level=0)['col1'].transform('count')==2]

Upvotes: 1

Drop all rows of higher level multi-index if certain lower level index doesn&#39;t exist

Answers (2)

Related Questions

Drop all rows of higher level multi-index if certain lower level index doesn't exist