Night Walker
Night Walker

Reputation: 21280

count occurrences with dataframe?

I have following Dataframe:

               fk  name
0              105  Coca-Cola1 
1              105  Coca-Cola1
2              106  Coca-Cola2
3              115  Coca-Cola5
4              115  Coca-Cola5
5              115  Coca-Cola5
6              108  Coca-Cola8
7              105  Coca-Cola1

I want to count frequency of occurrences:

[[105, Coca-Cola1, 3],
 [106, Coca-Cola2, 1],
 [115, Coca-Cola5, 3],
 [108, Coca-Cola8, 1]]

Any idea how to get so ?

Upvotes: 3

Views: 379

Answers (1)

jezrael
jezrael

Reputation: 863531

You can use groupby with aggregating size, then reset_index and last convert to numpy array by DataFrame.values with numpy.ndarray.tolist:

df = df.groupby(['fk','name']).size().reset_index().values.tolist()
print (df)
[[105, 'Coca-Cola1', 3], 
 [106, 'Coca-Cola2', 1], 
 [108, 'Coca-Cola8', 1], 
 [115, 'Coca-Cola5', 3]]

If is order important add parameter sort=False to groupby:

df = df.groupby(['fk','name'], sort=False).size().reset_index().values.tolist()
print (df)
[[105, 'Coca-Cola1', 3], 
 [106, 'Coca-Cola2', 1], 
 [115, 'Coca-Cola5', 3], 
 [108, 'Coca-Cola8', 1]]

Upvotes: 6

Related Questions