Count sum of of patterns matches in pandas

Question

Hello I have a df such as :

Groups COL1
G1   Seq:1
G1   Seq:2
G1   Seq_1
G1   Seq:4
G2   Seq_2
G2   Seq_3
G2   Seq_4
G3   Seq:5
G3   Seq:6
G4   Seq:7
G4   Seq_5

and I would like to count :

Nb Groups with only ":" = 1 (G3)
Nb Groups with not only ":" = 2(G1 and G4 )
Nb Groups without any ":" = 1 (G2)

does someone have na idea ? I guess I should sue a re.sub and do the sum of each Groups in pandas ?

Ch3steR · Accepted Answer

You can use this to count using pd.Series.str.contains then use GroupBy.all and GroupBy.any

om = df['COL1'].str.contains(':')

one = om.groupby(df['Groups']).all().sum() # 1
two = om.groupby(df['Groups']).any().sum() - one # 2 
# minus one because `any` counts all Trues too so we need 
# subtract groups with all Trues.
three = (~om).groupby(df['Groups']).all().sum() # 1

Count sum of of patterns matches in pandas

Answers (2)

Related Questions