python: check dataframe columns: is there more than one value for each group?

Question

the following code:

import numpy as np
import pandas as pd

data=[['A', 1,2 ,5, 'blue'],
        ['A', 5,5,6, 'blue'],
        ['A', 4,6,7, 'blue']
        ,['B', 6,5,4,'yellow'],
        ['B',9,9,3, 'blue'],
        ['B', 7,9,1,'yellow']
        ,['B', 2,3,1,'yellow'],
        ['B', 5,1,2,'yellow'],
        ['C',2,10,9,'green']
        ,['C', 8,2,8,'green'],
        ['C', 5,4,3,'green'],
        ['C', 8,5 ,3,'green']]
df = pd.DataFrame(data, columns=['x','y','z','xy', 'color'])

groups = df.groupby('x')['color'].apply(list)
print(groups)

produces the following output:

x
A                        [blue, blue, blue]
B    [yellow, blue, yellow, yellow, yellow]
C              [green, green, green, green]
Name: color, dtype: object

I now want to check if there is more than one category for each 'x' value. For example, A has only one category but B has two. I am not sure if there is a way to do that.

python: check dataframe columns: is there more than one value for each group?

Answers (1)

Related Questions