Find overlap of multiple ranges in data.table

Question

I would like to find the overlapping part of multiple ranges which are given rowise in a data.table object.

An example would be:

t <- data.table(a = c(3,4,5), b = c(13,12,19))

So we have the ranges:

3 - 13, 4 - 12, 5 - 19

Hence the overlapping range would be:

5 - 12

In case of an additional range 19 - 22 the overlap should return NA - NA or 0 - 0 since there is no overlap.

I found solutions for similar problems like spatstat.utils:: intersect.ranges(). However this works only on two vectors and is hard to implement in a data.table

    DT[,.(o.l = function()[1], o.r = function()[2], by=.()]

manner which I would really like to do if possible,..

As output for this example I would like to have:

t <- data.table(a = c(3,4,5), b = c(13,12,19), o.l = c(5,5,5), o.r = c(12,12,12))

talat · Accepted Answer

Here's a one-line example:

library(data.table)

dt = data.table(a = c(3,4,5), b = c(13,12,19))

dt[, c("o.l", "o.r") := as.list(range(Reduce(intersect, mapply(seq, a, b, 1))))]

dt
#    a  b o.l o.r
# 1: 3 13   5  12
# 2: 4 12   5  12
# 3: 5 19   5  12

Where the core of the problem is

dt = data.table(a = c(3,4,5), b = c(13,12,19))
dt[, Reduce(intersect, mapply(seq, a, b, 1))]
# [1]  5  6  7  8  9 10 11 12

Find overlap of multiple ranges in data.table

Answers (2)

Related Questions