Replacing NA values with a random value picked from another data frame

Question

In my data I have two columns that contain NA values. So I made a new dataframe which has no NA values, I removed the rows which contaied NA values.

What I want is that each time there is an NA value in the original data (called metadata here), I want to sample randomly one sample from the new data frame (called temp).. (I removed the NAs so there is no risk of picking NA again).

However, my original data is not changing, it stays the same after performing this:

temp = metadata %>% drop_na()
for (i in length(metadata$Gender)){
  if (is.na(metadata$Gender[[i]])) {
    metadata$Gender[[i]] = sample(temp$Gender, 1)
  }
  
  if (is.na(metadata$Age[[i]])){
    metadata$Age[[i]] = sample(temp$Age, 1)
  }
}

Replacing NA values with a random value picked from another data frame

Answers (1)

Related Questions