Use pandas groupby.size() results for arithmetical operation

Question

I got the following problem which I got stuck on and unfortunately cannot resolve by myself or by similar questions that I found on stackoverflow.

To keep it simple, I'll give a short example of my problem:

I got a Dataframe with several columns and one column that indicates the ID of a user. It might happen that the same user has several entries in this data frame:

|   |  userID   |      col2      | col3  |
+---+-----------+----------------+-------+
| 1 | 1         | a              |     b |
| 2 | 1         | c              |     d |
| 3 | 2         | a              |     a |
| 4 | 3         | d              |     e |

Something like this. Now I want to known the number of rows that belongs to a certain userID. For this operation I tried to use df.groupby('userID').size() which in return I want to use for another simple calculation, like division whatsover. But as I try to save the results of the calculation in a seperate column, I keep getting NaN values.

Is there a way to solve this so that I get the result of the calculations in a seperate column?

Thanks for your help!

edit//

To make clear, how my output should look like. The upper dataframe is my main data frame so to say. Besides this frame I got a second frame looking like this:

|   |  userID   |      value     | value/appearances  |
+---+-----------+----------------+-------+
| 1 | 1         | 10             |     10 / 2 = 5     |
| 3 | 2         | 20             |     20 / 1 = 20    |
| 4 | 3         | 30             |     30 / 1 = 30    |

So I basically want in the column 'value/appearances' to have the result of the number in the value column divided by the number of appearances of this certain user in the main dataframe. For user with ID=1 this would be 10/2, as this user has a value of 10 and has 2 rows in the main dataframe. I hope this makes it a bit clearer.

Use pandas groupby.size() results for arithmetical operation

Answers (1)

Related Questions