Fenwick trees to determine which interval a point falls in

Question

Let a₀,...,a_n-1 be a sequence of lengths. We can construct intervals [0,a₀], (a₁,a₂+a₁],(a₂+a₁,a₃+a₂+a₁],... I store the sequence a₁,...,a_n-1 in a Fenwick tree.

I ask the question: given a number m, how can I efficiently (log n time) find into which interval m falls?

For example, given the a: 3, 5, 2, 7, 9, 4.

The Fenwick Tree stores 3, 8, 2, 17, 9, 13.

The intervals are [0,3],(3,8],(8,10],(10,17],(17,26],(26,30].

Given the number 9, the algorithm should return the 3rd index of the Fenwick Tree (2 if 0-based arrays are used, 3 if 1-based arrays are used). Given the number 26, the algorithm should return the 5th index of the Fenwick Tree (4 if 0-based arrays are used or 5 if 1-based arrays are used).

Possibly another data structure might be more suited to this operation. I am using Fenwick Trees because of their seeming simplicity and efficiency.

David Eisenstat · Accepted Answer

We can get an O(log n)-time search operation. The trick is to integrate the binary search with the prefix sum operation.

def get_total(tree, i):
    total = 0
    while i > 0:
        total += tree[i - 1]
        i -= i & (-i)
    return total


def search(tree, total):
    j = 1
    while j < len(tree):
        j <<= 1
    j >>= 1
    i = -1
    while j > 0:
        if i + j < len(tree) and total > tree[i + j]:
            total -= tree[i + j]
            i += j
        j >>= 1
    return i + 1


tree = [3, 8, 2, 17, 9, 13]
print('Intervals')
for i in range(len(tree)):
    print(get_total(tree, i), get_total(tree, i + 1))
print('Searches')
for total in range(31):
    print(total, search(tree, total))

Output is

Fenwick trees to determine which interval a point falls in

Answers (2)

Related Questions