How do I limit x-range of spline() interpolation to first and last non-NA value in dplyr?

Question

I want to interpolate missing values using dplyr, piping, and spline().

Data:

test <- structure(list(site = structure(c(3L, 3L, 3L, 3L, 3L, 3L, 1L, 
    1L, 1L, 1L, 2L, 2L, 2L, 2L), .Label = c("lake", "stream", "wetland"
    ), class = "factor"), depth = c(0L, -3L, -4L, -8L, -10L, -14L, 
    0L, -1L, -3L, -5L, 0L, -2L, -4L, -6L), var1 = c(NA, 1L, 3L, NA, 
    6L, NA, 1L, 2L, NA, 4L, 1L, NA, NA, 4L), var2 = c(1L, NA, 3L, 
    4L, 8L, NA, NA, NA, NA, NA, NA, 2L, NA, NA)), .Names = c("site", 
    "depth", "var1", "var2"), class = "data.frame", row.names = c(NA, 
    -14L))

Q1: How do I use the following functioning code, but limit the range of interpolation to occur between the first non-NA value and the last non-NA value for each variable. For example, it should only interpolate var1 for wetland at depth -8 and return NA for depths 0 and -14.

library(tidyverse)

test_int <- test %>% 
    group_by(site) %>% 
    mutate_at(vars(c(var1, var2)),
              funs("i" = if(sum(!is.na(.)) > 1) 
                             spline(x=depth, y=., xout=depth)[["y"]]
                         else
                             NA))

Q2: Is there a way to bound my interpolated values from 0 to Inf? Or is this not appropriate with spline (e.g., I should use another interpolation method such as smooth or loess)?

How do I limit x-range of spline() interpolation to first and last non-NA value in dplyr?

Answers (1)

Related Questions