Vectorizing a pareto front algorithm

Question

First of all, here's my setup:

x is an n x 1 vector containing the values of a first cost function.
y is another n x 1 vector containing the values of a second cost function.
a is an m x 1 vector containing the indices of x and y to be checked, this is used to selectively exclude values from the algorithm. Unless needed, this can be replaced with 1:n.
It's safe to assume all combinations (x,y) are unique.

The task is to find the pareto optimal set of value pairs (x,y), that is, all pairs that are not dominated. A pair is called dominated if there exists another pair (u,v) such that u <= x && v <= y and one of these comparisons is strict: u < x || v < y. In other words, a pair is dominated if another pair is better in one value and not worse in the other.

My research up to this point has yielded three working algorithms, that are unfortunately all reliant on loops. Here's how they work and what time I got for running them with x, y and a of length 1e8:

Sort x in ascending order. Add the first pair to the pareto set.

Loop through x. Add each pair to the pareto set where y is lower than the previous pareto pair's y.

Elapsed time is 80.204052 seconds.

Find min(x). Add that pair to the pareto set.

Select all pairs where y is lower than the previously added pair's y.

Go to step 1 unless step 2 resulted in the empty set.

Elapsed time is 2.993350 seconds.

Loop through all pairs (x,y).

Remove all pairs (u,v) with x >= u && y >= v.

Elapsed time is 105.924814 seconds.

Now what I'm attempting to do is create a vectorized algorithm. It doesn't have to be based on one of the above, but I wasn't able to find any other working algorithm. The best I could do is this:

ap = a(y < min(y(x == min(x))) | x < min(x(y == min(y))));

which does usually find all pareto optimal pairs, but includes all pairs not dominated by the pair at min(x) or min(y), even if one dominates another. I say usually because it fails entirely if there is only one globally optimal pair that dominates every other pair. Replacing the < with <= cures the second problem, but finds even more dominated pairs (those that have only one worse value). I also ran this through the same timer as above:

Elapsed time is 0.800385 seconds.

Here's a test script I'm using to check how the algorithm does, feel free to use it

for i=1:25
    x = randi(8,10,1);
    y = randi(8,10,1);
    a = 1:10;
    ap = a(y < min(y(x == min(x))) | x < min(x(y == min(y)))); %// algorithm here
    figure(1);
    subplot(5,5,i);
    plot(a,x,'b',a,y,'r',ap,x(ap),'b.',ap,y(ap),'r.','MarkerSize',20);
    axis([0,11,0,9]);
    set(gca,'XGrid','on','YGrid','on','XTick',1:10,'YTick',0:8);
    figure(2);
    subplot(5,5,i);
    plot(x,y,'b.',x(ap),y(ap),'ro','MarkerSize',10);
    axis([0,9,0,9]);
end

Vectorizing a pareto front algorithm

Answers (1)

Related Questions