Conditional merging between time values of 2 dataframe in R

Question

I have 2 dataframes with different structures. The first one contains data from a continuos and repeated analysis over few samples (multiple rows with time and value for each single measurement), the second one reports the sample ID and the start and finish time of the measurement.

##example
df.analysis <- data.frame(var= rnorm(321,mean=50),
                  time= seq(strptime("2018-1-1 0:0:0","%Y-%m-%d %H:%M:%S"), strptime("2018-1-1 8:0:0","%Y-%m-%d %H:%M:%S"), by= 90))

df.sample <- data.frame(sample= rep_len(1:8, 30),
                  start=seq(strptime("2018-1-1 0:0:0","%Y-%m-%d %H:%M:%S"), strptime("2018-1-1 7:45:0","%Y-%m-%d %H:%M:%S"),length.out=30),
                  end=seq(strptime("2018-1-1 0:15:0","%Y-%m-%d %H:%M:%S"), strptime("2018-1-1 8:0:0","%Y-%m-%d %H:%M:%S"),length.out=30))

I should insert the sample ID corresponding to each measured value, having in mind that not all the measurements corrispond to a sample. I tried with the following code but it doesn't work because now it compares the rows from the first database with the corresponding rows from the second database. While I need that every single row from the first database to be compared with all the rows from the second database

if df.analysis$time >df.sample[,"start"] & df.analysis$time < df.sample[,"end"] {
  df.analysis$sample <-  df.sample$sample
  }

I thought to use a for loop or a lapply but I can't make work them properly.

akrun · Accepted Answer

We can use a non-equi join

library(data.table)
setDT(df.analysis)[df.sample, sample := sample, on = .(time > start, time

Conditional merging between time values of 2 dataframe in R

Answers (2)

Related Questions