Merge ordered Lists in Python without prior knowledge about the ordering function

Question

I am having a hard time to solve the following issue: I have to merge N lists. Each list contains some string objects. For each list, although I do not know which is the ordering function, I know that it is ordered. Moreover, the final list should respect all the ordering of the child that generated it. For instance:

l1 = ['This','world']
l2 = ['This','is','a','world','!']
l3 = ['a','hello','world']

merged_list = merge_function(l1,l2,l3)

The results I would like to achieve is to receive a list containing

merged_list # ['This','is','a','hello','world','!']

But I cannot figure out the way to do it, as the lists are not following any rule beside the order in which the elements are provided. Any help would be appreciated.

EDIT: The focus of my question is not on how to merge some lists. The problem is that the elements should be merged in a way that they respect all the ordering of the original lists. So I cannot use sets as they have their own ordering policy. I think I have to stick to lists because they are more flexible, but I need to find a way to insert new elements in the correct position within the final list, as I cannot simply append them.

For instance:

l1 = ['This','world']
l2 = ['This','is','a','world','!']
merged_list = l1

If I then simply append the missing element to the merged_list I would obtain:

merged_list # ['This','world','is','a','!']

and this list breaks the ordering of l2. I hope now I explained the issue a bit better.

Olivier Melan&#231;on · Accepted Answer

The main difficulty with your problem is if we have a case like this:

l1 = ['a', 'world']
l2 = ['This', 'is']
l3 = ['is', 'a']

In that case the expected result would be ['This', 'is', 'a', 'world']. Although, the relation 'This' < 'world' must be obtained by transitivity.

Here is a solution which determines your ordering by generating the full table of predecessors using a fixed point algorithm.

def merge_function(*lists):
    predecessors = {w: set() for w in set(w for lst in lists for w in lst)}

    # Add trivial predecessors from the lists
    # For example in ['foo', 'bar', 'baz']:
    # we know 'foo' is a predecessor of 'baz'
    for l in lists:
        tail = l[::-1]
        for word in l:
            tail.pop()
            for successor in tail:
                predecessors[successor].add(word)

    # Use transitive property to update predecessor
    # This is the fixed point part of the algorithm
    change_occured = True
    while change_occured:
        change_occured = False
        for word, word_predecessors in predecessors.items():
            for predecessors_set in predecessors.values():
                if word in predecessors_set and any(w not in predecessors_set for w in word_predecessors):
                    change_occured = True
                    predecessors_set.update(word_predecessors)

    return sorted(predecessors, key=lambda w: predecessors[w])

l1 = ['a', 'world']
l2 = ['This', 'is']
l3 = ['is', 'a']

merged_list = merge_function(l1,l2,l3)

print(merged_list) # ['This', 'is', 'a', 'world']

Merge ordered Lists in Python without prior knowledge about the ordering function

Answers (2)

Related Questions