Reputation: 21280
I have following Dataframe
:
fk name
0 105 Coca-Cola1
1 105 Coca-Cola1
2 106 Coca-Cola2
3 115 Coca-Cola5
4 115 Coca-Cola5
5 115 Coca-Cola5
6 108 Coca-Cola8
7 105 Coca-Cola1
I want to count frequency of occurrences:
[[105, Coca-Cola1, 3],
[106, Coca-Cola2, 1],
[115, Coca-Cola5, 3],
[108, Coca-Cola8, 1]]
Any idea how to get so ?
Upvotes: 3
Views: 379
Reputation: 863531
You can use groupby
with aggregating size
, then reset_index
and last convert to numpy array
by DataFrame.values
with numpy.ndarray.tolist
:
df = df.groupby(['fk','name']).size().reset_index().values.tolist()
print (df)
[[105, 'Coca-Cola1', 3],
[106, 'Coca-Cola2', 1],
[108, 'Coca-Cola8', 1],
[115, 'Coca-Cola5', 3]]
If is order important add parameter sort=False
to groupby
:
df = df.groupby(['fk','name'], sort=False).size().reset_index().values.tolist()
print (df)
[[105, 'Coca-Cola1', 3],
[106, 'Coca-Cola2', 1],
[115, 'Coca-Cola5', 3],
[108, 'Coca-Cola8', 1]]
Upvotes: 6