I want to split Pandas Dataframe into 2 dataframes based on a condition

Question

I have a Base dataframe with 4 columns.

 column_A column_B  column_C   id  
0       1       1      anna    123
1       2       1      anna      7
2      30       2      bob      42
2      20       2      bob      12
3      10       3      charlie   1
4     100       3      david     2

I want to split it into 2 different dataframes with the following properties.

Dataframe 1:

 column_A column_B  column_C   id  
0       1       1      anna    123
1       2       1      anna      7
2      30       2      bob      42
2      20       2      bob      12

where both values in column_B column_C match

Dataframe 2:

  column_A column_B  column_C   id
3      10       3      charlie   1
4     100       3      david     2

where only values in column_B match

Zero · Accepted Answer

You could check for duplicates.

In [200]: dfs = {i: n for i, n in df.groupby(
                    df.duplicated(subset=['column_B', 'column_C'], keep=False))}

In [201]: dfs[True]
Out[201]:
   column_A  column_B column_C   id
0         1         1     anna  123
1         2         1     anna    7
2        30         2      bob   42
2        20         2      bob   12

In [202]: dfs[False]
Out[202]:
   column_A  column_B column_C  id
3        10         3  charlie   1
4       100         3    david   2

I want to split Pandas Dataframe into 2 dataframes based on a condition

Answers (2)

Related Questions