Iteration with multiple conditions in Pandas

Question

In part of my code, I am searching for subsets of a DataFrame in order to manipulate them later. Part of the code that takes a very long time goes as follow:

for record in records.itertuples():
    matches_ids = df[((df['column_1'] < record.attribute_1) &
                               (record.attribute_2 < df['column_2']) &
                               (df['column_3'] < record.attribute_3) &
                               (record.attribute_4 == df['column_4']) &
                               (df['column_5'] != 'value'))].index

Is there a way to reduce the complexity of the code?

expected output: list of indices that answer all conditions

p.s removing conditions reduce runtime of almost 10-fold for each condition

Iteration with multiple conditions in Pandas

Answers (1)

Related Questions