Nested loops causing reduced complexity efficency?

Question

I've written a simple function that removes elements from a vector (V2), based upon the values of another vector (V1):

std::vector V1={6,2,3,4};
std::vector V2={9,4,8,6,7};

for(int i=0; i



My challenge is that the above needs to be O(n) complexity. Currently this is O(n*m), n being V1, m being V2.

N.B. Arrays are not and cannot be sorted as the elements original index values are required.

Questions:


Am I right in saying 'V2.erase' is stopping this function from being O(n)? (Because its a nested iteration within the for loop).
Is there a way around this, by performing the erase operation outside the loop?

Nik Bougalis · Accepted Answer

Why not use std::set_difference:

std::vector test(
    std::vector v1,
    std::vector& v2)
{
    // The algorithm we use requires the ranges to be sorted:
    std::sort (v1.begin(), v1.end());
    std::sort (v2.begin(), v2.end());

    // our output vector: reserve space to avoid copying:
    std::vector v3;
    v3.reserve (v2.size());

    // Use std::set_difference to copy the elements from
    // v2 that are not in v1 into v3:
    std::set_difference (
        v2.begin(), v2.end(),
        v1.begin(), v1.end(),
        std::back_inserter(v3));

    return v3;
}

If v1.size() == n and v2.size() == m the runtime of this works out to roughly:

OK, so then how about this:

void test2(
    std::vector v1,
    std::vector v2)
{
    // We can still sort this without affecting the indices
    // in v2:
    std::sort (v1.begin(), v1.end());

    // Replace all the elements in v1 which appear in v2
    // with -1:
    std::replace_if (v2.begin(), v2.end(),
        [&v1] (int v)
        {
            return std::binary_search(v1.begin(), v1.end(), v);
        }, -1);
}

Not linear; estimating the complexity is left as an exercise for the OP.

A third alternative is this:

void test3(
    std::vector v1,
    std::vector& v2)
{
    // We can still sort this without affecting the indices
    // in v2:
    std::sort (v1.begin(), v1.end());

    auto ret = std::stable_partition (
        v2.begin(), v2.end(),
        [&v1] (int v)
        {
            return !std::binary_search(v1.begin(), v1.end(), v);
        });

    v2.erase (ret, v2.end());
}

Again, not linear, but options...

Nested loops causing reduced complexity efficency?

Answers (2)

Related Questions