Dropping rows that have string cointained in other rows in pandas

Question

Given dataframe in this form:

 ID      A
130     Yes
130-1   Yes
130-2   Yes
200     No
201     No
201-10  No
201-101 Yes
201-22  Yes
300     No

I want to drop the rows that have value from ID column present in another string before the hyphen (-) in other rows So based on this I would drop value 201 since there are 201-10, 201-101 etc.

Expected output:

 ID      A
130-1   Yes
130-2   Yes
200     No
201-10  No
201-101 Yes
201-22  Yes
300     No

user3483203 · Accepted Answer

Using duplicated and some bitwise operations. This does rely on the values without hyphens being before the values with hyphens.

s = df['ID'].str.split('-').str[0]
m = s.duplicated(keep=False) ^ s.duplicated()

df[~m]

        ID    A
1    130-1  Yes
2    130-2  Yes
3      200   No
5   201-10   No
6  201-101  Yes
7   201-22  Yes
8      300   No

Dropping rows that have string cointained in other rows in pandas

Answers (2)

Related Questions