How do I fill in blank cells?

Question

I have a csv file where some of the entries in some columns are blank. I have corresponding columns which have data which could be used to fill in the blank fields. Let's say one of the columns with blanks is called Old Info, and one of the columns with replacement information is called New Info. I don't want to replace Old Info with the New Info, I only want to fill in blanks in Old Info with data from New Info. Data would come from the same row, i.e. if Old Info Row 1 is blank, then information would be taken from New Info Row 1.

Additionally, I have a secondary column that also has replacement information, which could be called Secondary Replacement Info. If Old Info Row 1 is blank, and so is New Info Row 1, then I would want to replace Old Info Row 1 with Secondary Replacement Info Row 1. Here's some example data:

    Old Info     New Info   Secondary Replacement Info
1      Carl         Carl               Carl
2                   Diana              Diana
3      Jeremy       Jeremy             Jeremy
4                                      Jack

And here's the desired outcome:

    Old Info     New Info   Secondary Replacement Info
1      Carl         Carl               Carl
2      Diana        Diana              Diana
3      Jeremy       Jeremy             Jeremy
4      Jack                            Jack

So as you can see, the blanks in Old Info have been filled in. Row 2 was filled in by New Info, but Row 4 was filled in by Secondary Replacement Info, as New Info also had a blank. How would I write a function to accomplish all of this?

Roland · Accepted Answer

#import your data
#don't forget to set stringsAsFactors = FALSE
DF <- read.csv(text = "Old Info,New Info,Secondary Replacement Info
1,Carl,Carl,Carl
2,,Diana,Diana
3,Jeremy,Jeremy,Jeremy
4,,,Jack", stringsAsFactors = FALSE)

#a little function
fun <- function(x, y, z) {
  y[y == ""] <- z[y == ""] #substitute missings in y with values from z
  x[x == ""] <- y[x == ""] #substitute missings in x with values from y
  x #return
}

DF <- within(DF, Old.Info <- fun(Old.Info, New.Info, Secondary.Replacement.Info))
#  Old.Info New.Info Secondary.Replacement.Info
#1     Carl     Carl                       Carl
#2    Diana    Diana                      Diana
#3   Jeremy   Jeremy                     Jeremy
#4     Jack                                Jack

How do I fill in blank cells?

Answers (2)

Related Questions