How to change characters into NA?

Question

I have a census dataset with some missing variables indicated with a ?, When checking for incomplete cases in R it says there are none because R takes the ? as a valid character. Is there any way to change all the ? to NAs? I would like to run multiple imputation using the mice package to fill in the missing data after.

Sowmya S. Manian · Accepted Answer

Creating data frame df

df <- data.frame(A=c("?",1,2),B=c(2,3,"?"))
df
#   A B
# 1 ? 2
# 2 1 3
# 3 2 ?

I. Using replace() function

replace(df,df == "?",NA)
#      A    B
# 1     2
# 2    1    3
# 3    2

II. While importing a file with ?

 data <- read.table("xyz.csv",sep=",",header=T,na.strings=c("?",NA))
 data
 # A  B
 # 1  1 NA
 # 2  2  3
 # 3  3  4
 # 4 NA NA
 # 5 NA NA
 # 6  4  5

How to change characters into NA?

Answers (2)

Related Questions