Iterating through comparison of dates in Python Pandas

Question

Right now I am iterating over a pandas dataframe to patch data inconsistencies with datetime columns, but the runtime is brutal.

def fix(row):
        return row.datetime_column2 if row.datetime_column1 > row.datetime_column2 else row.datetime_column1

df['datetime_column1'] = df.apply(fix, axis = 1)

Is there a smarter way to do this?

MaxU - stand with Ukraine · Accepted Answer

IIUC you can use this vectorized approach:

df['datetime_column1'] =  \
    np.where(df['datetime_column1'] > df['datetime_column2'], 
             df['datetime_column2'],
             df['datetime_column1'])

or:

df['datetime_column1'] = df[['datetime_column1','datetime_column2']].min(1)

Iterating through comparison of dates in Python Pandas

Answers (2)

Related Questions