juju
juju

Reputation: 1239

compare elements of two lists

A = [a1, a2, a3...]  #a1<a2<a3...
B = [b1, b2...]  #b1<b2<b3...

A and B are disjoint. are I do not know the number of elements and the value of them in A/B in advance. I want to compare the value of the elements in both list and delete elements iff:

delete a[i+1] if there is no b[j] such that a[i]<b[j]<a[i+1]
delete b[i+1] if there is no a[j] such that b[i]<a[j]<b[i+1]

At the end, I want to separate list, not a combination of A and B.

For example, If A[0] < B[0], A = [1, 10, 40], B = [15, 30]. Compare A[1] and B[0] first. Delete 10 because no element in B are in between 1 and 15. Then delete 15 since no element exist anymore btw 15 and 30. The output should be: if you try to order the elements of the new 2 lists, it should be A[0]<B[0]<A[1]<B[1]<...

If A[0] > B[0], vice versa.

Upvotes: 3

Views: 1537

Answers (4)

Emmanuel
Emmanuel

Reputation: 14209

I knew using the bisect module could be a good solution:

>>> def sort_relative(L1, L2):
    # Switch if needed
    if L1[0] > L2[0]:
        L1, L2 = L2, L1
    i = 0
    while i + 1 < max(len(L1), len(L2)):
        try:
            # We know that L1[i] < L2[i]
            # Get indexes where L2[i] and L2[i + 1] should be inserted
            i11 = bisect.bisect_left(L1, L2[i])
            i12 = bisect.bisect_left(L1, L2[i + 1])

            # This condition allows to know if one element of L1
            # was found between L2[i] and L2[i + 1]:
            # - if so, we have L1[i] < L2[i] < L1[i + 1] < L2[i + 1]
            # - else we have L1[i] < L2[i] < L1[i + 1] but
            # we don't know between L1[i + 1] and L2[i + 1]
            if L1[i11] < L2[i + 1]:
                L1 = L1[:i + 1] + [L1[i11]] + L1[i12:]
                index1, index2 = i + 1, i + 2
            else:
                L1 = L1[:i + 1] + L1[i12:]
                index1, index2 = i, i + 1

            # Do the same kind of symetric search,
            # with indexes computed above
            i21 = bisect.bisect_left(L2, L1[index1])
            i22 = bisect.bisect_left(L2, L1[index2])
            if L2[i21] < L1[index2]:
                L2 = L2[:index1] + [L2[i21]] + L2[i22:]
            else:
                L2 = L2[:index1] + L2[i22:]
        # Little trick not to test indexes everywhere:
        # lists are browsed at the same time
        except IndexError:
            pass

        # Next index !
        i += 1

    # Show result
    print 'L1:', L1, '- L2:', L2


>>> sort_relative([1, 10, 50], [15, 30])
L1: [1, 50] - L2: [15]
>>> sort_relative([17, 18, 50], [15, 30])
L1: [15, 30] - L2: [17, 50]
>>> sort_relative([1, 10, 12, 25, 27, 50], [15, 30, 70])
L1: [1, 25, 50] - L2: [15, 30, 70]
>>> sort_relative([1, 10, 12, 25, 27, 50], [15, 30, 34, 70])
L1: [1, 25, 50] - L2: [15, 30, 70]
>>> 

I didn't take into account the case when a number is both in A and B.

Upvotes: 0

selllikesybok
selllikesybok

Reputation: 1225

So, if I am reading correctly, the desired output from your example is [1,40] and [15], yes?

If so, the following would get the right result, but I am certain there is a tighter way to do it.

a = [1, 10, 40]
b = [15, 30]
c = sorted([[e_a,'a'] for e_a in a] + [[e_b,'b'] for e_b in b])
indices = []

for i in range(len(c)-1):
    if c[i][1] == c[i+1][1]:
        indices.append(i+1)

for e in sorted(indices, reverse=True):
    del c[e]

a,b = [e[0] for e in c if e[1]=='a'],[e[0] for e in c if e[1]=='b']

First - Merging the lists and sorting them, while keeping track of which list they came from.

Second - Then remove all instances where the next item in the merged list is from the same source list.

Third - updating a and b.

Upvotes: 0

eumiro
eumiro

Reputation: 212885

a = [1, 10, 40]
b = [15, 30]

srcs = [a, b]
dsts = [[], []]
prev_which = -1
while all(srcs):
    which = int(srcs[0][0] > srcs[1][0])
    elem = srcs[which].pop(0)
    if prev_which != which:
        dsts[which].append(elem)
    prev_which = which
for src, dst in zip(srcs,dsts):
    if src:
        dst.append(src.pop(0))
a, b = dsts

returns:

a = [1, 40]
b = [15]

and for

a = [3, 4, 6, 7, 8, 9]
b = [1, 2, 5, 10]

it returns [3, 6] and [1, 5, 10].

EDIT: another possibility:

import itertools as it
import operator as op

a = [3, 4, 6, 7, 8, 9]
b = [1, 2, 5, 10]
srcs = [a, b]
dsts = [[], []]

for which, elems in it.groupby(sorted((x, i) for i in (0,1) for x in srcs[i]), key=op.itemgetter(1)):
    dsts[which].append(next(elems)[0])
a, b = dsts

Upvotes: 1

jadkik94
jadkik94

Reputation: 7068

I came up with this before you edit. But it seems the output is not what you expect. Anyway, it might help you get on the right track:

a = range(0, 30, 3)
b = range(0, 20, 2)

a.sort()
b.sort()

A = [a[i+1] for i in range(len(a)-1) if any(a[i]<b[j]<a[i+1] for j in range(len(b)-1))]
B = [b[i+1] for i in range(len(b)-1) if any(b[i]<a[j]<b[i+1] for j in range(len(a)-1))]

result = sorted(A+B)

print a, b
print result

This is "literally" what you expressed, but the result here is not what you expect. I'll try to improve this.

Upvotes: 1

Related Questions