Select element from list in Pandas Series based on another column

Question

I have a Pandas DataFrame of the following form:

  Name         Dates        Trigger
  John       [d1,d2,d3]     1
  Mike       [d4]           NaN
  Li         [d1,d4,d5]     2

where the rows in the column Dates are Python lists, where each element in the lists is a DateTime object (e.g. '2019-08-15').

My final goal is to obtain an array with the differences (in days) between the Dates at the index position mentioned in Trigger, resulting in a new column like:

       Date_diff
   [d2-d1,d2-d2,d2-d3]
   [NaN]/d4
   [d5-d1,d5-d4,d5-d5]

No matter what I've tried, I always failed to properly identify the correct element in the list based on the last column. Any suggestions?

mannaroth · Accepted Answer

After handling the NaN's, the following solution works:

df.apply(lambda row: row.Dates[row.Trigger], axis=1)

Select element from list in Pandas Series based on another column

Answers (2)

Related Questions