Divide value having same date in Dataframe to the count of values in each date

Question

I have two DataFrames one have values against various dates which you can replicate for vlaues of first date as following

import pandas as pd
values_df = pd.DataFrame({'date': ['2021-08-16', '2021-08-16', '2021-08-16', '2021-08-16', '2021-08-16', '2021-08-16', '2021-08-16', '2021-08-16','2021-08-16', '2021-08-16'], 
                   'val': ['0.38956', '0.733198', '0.1856580', '-0.759235', '0.112378', '0.7436924' , '0.32941900', '0.7423981', '0.71269175', '0.127698146']})

other one having count of those values for each date, it can be replicated using

count = pd.DataFrame({'date': ['2021-08-16', '2021-08-17', '2021-08-18', '2021-08-19', '2021-08-20', '2021-08-21', '2021-08-22', '2021-08-23','2021-08-24', '2021-08-25'], 
                      'count': ['12', '47', '85', '65', '36', '126' , '75', '85', '175', '63']})

now there are 12 values for the first date 2021-08-16 I need to divide every value 2021-08-16 of with 12 and same with all other values according to their dates and number of counts. I have tried to create new column of values_df that has number of counts so that I can do values_df['val'] / values_df['count']but it doesn't repeat values for each date. Can someone guide me how can I achieve this? Thank you

jezrael · Accepted Answer

If want reuse count DataFrame use Series.map:

values_df['new'] = values_df['val'].astype(float) / values_df['date'].map(count.set_index('date')['count'].astype(int))
print (values_df)
         date          val       new
0  2021-08-16      0.38956  0.032463
1  2021-08-16     0.733198  0.061100
2  2021-08-16    0.1856580  0.015471
3  2021-08-16    -0.759235 -0.063270
4  2021-08-16     0.112378  0.009365
5  2021-08-16    0.7436924  0.061974
6  2021-08-16   0.32941900  0.027452
7  2021-08-16    0.7423981  0.061867
8  2021-08-16   0.71269175  0.059391
9  2021-08-16  0.127698146  0.010642

Or if want count number of values in values_df only use Series.map with Series.value_counts:

values_df['new'] = values_df['val'].astype(float) / values_df['date'].map(values_df['date'].value_counts())
print (values_df)
         date          val       new
0  2021-08-16      0.38956  0.038956
1  2021-08-16     0.733198  0.073320
2  2021-08-16    0.1856580  0.018566
3  2021-08-16    -0.759235 -0.075924
4  2021-08-16     0.112378  0.011238
5  2021-08-16    0.7436924  0.074369
6  2021-08-16   0.32941900  0.032942
7  2021-08-16    0.7423981  0.074240
8  2021-08-16   0.71269175  0.071269
9  2021-08-16  0.127698146  0.012770

Divide value having same date in Dataframe to the count of values in each date

Answers (2)

Related Questions