Apply a condition over paired columns

Question

suppose to have the following situation:

    Statistic1       Condition1     Statistic2       Condition2         
      0.00001            Y             0.02              NA      
      0.03               Y             0.0001            NA         
      0.01               NA            0.001              Y       
     ..............

For a total of 20.000 rows and 60 columns. Suppose you want to replace in the column "Condition*" the NA/Y with 0 if the value in the relative Statistic* column is <0.05. The check will involve the paired columns Statistic*-Condition*. How is it possible to do this over a large number of columns and rows?

Thank you in advance

B

Esben Eickhardt · Accepted Answer

You make a boolen for each column and then write and (&) between them. Here a simple example where I check if two columns live up to the condition that the numbers in both columns have to be above three.

# Creating data
df <- data.frame(a = c(1,2,3,4), b = c(2,2,3,2))

# Running conditions on both columns and storing results in a new column
df$c <- df$a>2 & df$b>2

If you want to make replacements in one column based on another column, you can do the following.

# Creating data
df <- data.frame(a = c(1,2,3,4), b = c(2,2,3,2))

# If column a is above 2 column b is set to zero
df$b[df$a>2] <- 0

In the future please supply example data and output such that we can help.

Apply a condition over paired columns

Answers (2)

Related Questions