If statement that checks many variables to create new variable

Question

I created a function where R looks at many variables, then it populates a new column in such way:

-if any of the variables has a "1" entry, the new column should be a "1"

-if they are all variables have NA entries, the new column should have an NA value.

This should be very simple, yet it somehow does not work. I think the issue is in the part of the code where I check it they all are not NA values: "if(!((is.na(variable))|..."

Any idea of a better way to code this? Please help!

Note: there are many more calculations done inside this function but for the purpose of showing the function structure and my specific issue I have only left this part inside it.

#if they answered "1" (yes) to recieving any specific treatment, 
#then say "1" (yes) to a new columns called treated_psych

diag_treated <- function(x){
  for (v in 1:length(x)) assign(names(x)[v], x[[v]])

if(!((is.na(CurrTx6.1_Group))|(is.na(CurrTx6.1_Ind))| (is.na(CurrTx6.1_Fam))|
       (is.na(CurrTx6.1_Couples))|(is.na(CurrTx7a_CBTAnx))|(is.na(CurrTx7b_CBTDep))|
       (is.na(CurrTx7c_CBTInsom)))){

    if(CurrTx6.1_Group==1 | CurrTx6.1_Ind==1 | CurrTx6.1_Fam==1 | CurrTx6.1_Couples==1 |
       CurrTx7a_CBTAnx==1 | CurrTx7b_CBTDep==1 | CurrTx7c_CBTInsom==1)
      {
      treated_psych <-1 
      }
    else{treated_psych <- 0}
}else{treated_psych<-NA}

treat <- data.frame(treated_psych)
  return(treat)
}

#call function
diagnoses_treated <- adply(dataset, 1, diag_treated)

Katy Torres · Accepted Answer

I ended up doing a subset of the columns, 2 apply funcitons and then a for loop that goes through both vectors created from apply function to make my new variable. Not very elegant or efficient but it works.

#if they answered "1" (yes) to recieving any specific treatment, 
#then say "1" (yes) to a new columns called treated_psych

#subset data by just these columns
df_psych<- dat_with_pcl5[c("CurrTx6.1_Group", "CurrTx6.1_Ind", "CurrTx6.1_Fam", 
"CurrTx6.1_Couples", "CurrTx7a_CBTAnx", "CurrTx7b_CBTDep",  "CurrTx7c_CBTInsom")]

#make one vector if ANY are 1, make another vector if ALL are NA
treated_psych1<- apply(df_psych, 1, function(r) any(r %in% "1"))
treated_psych.na<- apply(df_psych, 1, function(r) all(r %in% NA))

# Loop through both vectors and create new variable
#if true treated_psych1 then 1, if true in treated_psych.na then NA
for(i in 1:length(treated_psych0)){
if (treated_psych1[i]==TRUE){treated_psych[i] <- 1}
if (treated_psych.na[i] ==TRUE){treated_psych[i] <- NA}
}

If statement that checks many variables to create new variable

Answers (2)

Related Questions