pandas set value if dates are the same in two diffrent df according to dates range

Question

I have df: while disease is binary column ( 0 or 1)

diagnosis_date    id  disease
2013-05-03         1     0
2013-05-08         1     0
2013-06-08         1     1
2013-01-01         2     0 
.....

and I have range of dates- 2013-01-01 until 2013-12-31:

date_index=pd.date_range(start='1/1/2013', end='31/12/2013')
dates=pd.DataFrame(date_index,columns=['date'])

I want for each id in df, to set the date range as date_index, and if the date is the same as diagnosis date, to set the value like it in the disease column, otherwise the value will be set to zero. the desire df:

date    id    disease
01-01    1       0
02-01     1      0
03-01     1      0
...
05-03     1      0
05-04     1      0
...
06-08    1      1
 ....
12-31     1      0
01-01     2      1
01-02     2      0 
...

Thanks

sharathnatraj · Accepted Answer

Here you go:

date_index=pd.date_range(start='1/1/2013', end='31/12/2013')
dates = pd.DataFrame()
for i in df.id.unique():
    dates=pd.concat([dates,pd.DataFrame({'date':date_index, 'id' : np.full(len(date_index),i)})])
df.diagnosis_date = pd.to_datetime(df['diagnosis_date'])
df1 = pd.merge(dates,df, left_on=['id','date'], right_on=['id','diagnosis_date'], how='left')[['date','id','disease']].fillna(0)
df1['disease'] = df1.disease.astype(int)

Tested and prints correctly.

pandas set value if dates are the same in two diffrent df according to dates range

Answers (2)

Related Questions