Replace column value based on value in other column, for all rows in a pandas dataframe

Question

I am having trouble thinking pythonically about something, and would love some guidance.

I have a dataframe that contains columns with dates of events at which certain files should be uploaded, and a column with the names of those events. So events can be X, Y, Z, and files can be 1, 2, 3.

Not all files need to be uploaded at all events, i.e. if it's Event X, then files 1, 2, and 3 need to be uploaded, but if it's Event Y, then only file 3 needs to be uploaded. The date columns either have a date in them, or are blank.

What I want to do is, for all the files for events that are not needed, replace blank with "Not Needed".

Initial:

    File1   File2  File3
X   Aug 1          Sept 1
X   Aug 3   Aug 4  Sept 9
Y                  Sept 10
Z   Aug 12
X   Aug 13  Aug 15
Z   Aug 1

Goal

     File1   File2  File3
X   Aug 1          Sept 1
X   Aug 3   Aug 4  Sept 9
Y   NN      NN     Sept 10
Z   Aug 12  NN     NN
X   Aug 13  Aug 15
Z   Aug 1   NN     NN

So in other words, for the blanks that SHOULD be blank because a file is not expected, replace that value with "Not Needed", while leaving the other blanks alone.

I have tried doing this with .replace(), .apply() with functions, and I am not having any success.

The code below sort of works, but it works not only when there is a match, but even when there is not a match.

Fill in descriptive text for scales not collected at certain visits (where upload dates would be blank)
df_combined['FAQ-Audio-upDate'] = np.where(df_combined['VisitName'] == "Screening", "FAQ Not Expected", "")
df_combined['FAQ-Form-upDate'] = np.where(df_combined['VisitName'] == "Screening", "FAQ Not Expected", "")

How can I change the value in one column based on the value in another column, across the entire dataframe? What I want is basically this:

For every row in the dataframe If the value in the VisitName column == X Change the value in ColumnA to "Not Expected"
Thank you!!

Replace column value based on value in other column, for all rows in a pandas dataframe

Answers (1)

Related Questions