Pulling Multiple Ranges from Dataframe Pandas

Question

Lets say I have the following data set:

I have a the following list of ranges.

[[9,15],[19,22],[39,50]]

How do I efficiently pull rows that lie in those ranges?

Wanted Output

Edit: Needs to work for floating points

jpp · Accepted Answer

Update for modified question

For floats, you can construct a mask using NumPy array operations:

L = np.array([[9,15],[19,22],[39,50]])
A = df['A'].values

mask = ((A >= L[:, 0][:, None]) & (A <= L[:, 1][:, None])).any(0)

res = df[mask]

print(res)

      A   B
0  10.1  53
1  12.5  42
3  20.7   3
6  40.9  19

Previous answer to original question

For integers, you can use numpy.concatenate with numpy.arange:

L = [[9,15],[19,22],[39,50]]

vals = np.concatenate([np.arange(i, j) for i, j in L])

res = df[df['A'].isin(vals)]

print(res)

    A   B
0  10  53
1  12  42
3  20   3
6  40  19

An alternative solution with itertools.chain and range:

from itertools import chain

vals = set(chain.from_iterable(range(i, j) for i, j in L))

res = df[df['A'].isin(vals)]

Pulling Multiple Ranges from Dataframe Pandas

Answers (2)

Related Questions