Fill in missing data that is same across columns

Question

I need to fill in some missing data from a merge that is the same in all columns. After the merge, all the values are NA, but i would like a quick way to fill them in since their values are the same.

Example:

df <- structure(list(date = structure(c(-25932, -25931, -25930, -25929, 
-25928), class = "Date"), year = c(1899, 1899, 1899, 1899, 1899
), month = c(1, 1, 1, 1, 1), day = c(1, 2, 3, 4, 5), test1 = c(NA, 
NA, "VAR1", NA, NA), test2 = c(NA, NA, "VAR2", NA, NA), test3 = c(NA, 
NA, "VAR3", NA, NA)), .Names = c("date", "year", "month", "day", 
"test1", "test2", "test3"), row.names = c(NA, 5L), class = "data.frame")

# Tedious way, but works
df$test1 <- "VAR1"

# Desired output
    date     year month day test1 test2 test3
1 1899-01-01 1899     1   1  VAR1  VAR2  VAR3
2 1899-01-02 1899     1   2  VAR1  VAR2  VAR3
3 1899-01-03 1899     1   3  VAR1  VAR2  VAR3
4 1899-01-04 1899     1   4  VAR1  VAR2  VAR3
5 1899-01-05 1899     1   5  VAR1  VAR2  VAR3

A5C1D2H2I1M1N2O1R2T1 · Accepted Answer

You can try something like the following:

df
#         date year month day test1 test2 test3
# 1 1899-01-01 1899     1   1      
# 2 1899-01-02 1899     1   2      
# 3 1899-01-03 1899     1   3  VAR1  VAR2  VAR3
# 4 1899-01-04 1899     1   4      
# 5 1899-01-05 1899     1   5      

df[grep("test", names(df))] <- lapply(df[grep("test", names(df))], 
                                      function(x) x[!is.na(x)][1])
df
#         date year month day test1 test2 test3
# 1 1899-01-01 1899     1   1  VAR1  VAR2  VAR3
# 2 1899-01-02 1899     1   2  VAR1  VAR2  VAR3
# 3 1899-01-03 1899     1   3  VAR1  VAR2  VAR3
# 4 1899-01-04 1899     1   4  VAR1  VAR2  VAR3
# 5 1899-01-05 1899     1   5  VAR1  VAR2  VAR3

Fill in missing data that is same across columns

Answers (2)

Related Questions