Filtering a dictionary based on the first occurence of the value elements

Question

So I have this tricky dictionary of tuples which I want to filter based on the first occurrence of the informative flag in the value elements. If the flag (which is the element occupying the first position of the tuple) is observed in other keys I will only retain only the first key-value pair in which it occurs and subsequent key-value pairs which contain the flag would be skipped.

old_dict = {'abc':[('abc', '1', '5'), ('def', '1', '5'), ('abcd', '2', '5')],
            'def':[('abc', '2', '5'), ('def', '1', '5'), ('abcd', '1', '5')],
            'ghi':[('ghi', '1', '5'), ('jkl', '1', '4'), ('mno', '2', '4')]}

I have struggled with a lot of attempts and this latest attempt does not produce anything meaningful.

flgset = set()
new_dict = {}

for elem, tp in old_dict.items():
       for flg in tp:
              flgset.add(flg[0])

counter = 0
for elem, tp in old_dict.items():
       for (item1, item2, item3) in tp:
              for flg in flgset:
                     if flg == item1:
                            counter = 1
                            new_dict[elem] = [(item1, item2, item3)]
                            break

Expected results should be:

new_dict = {'abc':[('abc', '1', '5'), ('def', '1', '5'), ('abcd', '2', '5')],
            'ghi':[('ghi', '1', '5'), ('jkl', '1', '4'), ('mno', '2', '4')]}

Thanks in advance.

Toukenize · Accepted Answer

If i get you correctly, the following should do what you want:

flgset = set()
new_dict = {}

for k, tuple_list in old_dict.items():

    # if the key is not in flgset, just keep the k, tuple_list pair
    if k not in flgset:
        new_dict[k] = tuple_list

        # update the elements into flgset
        # item in this case is ('abc', '2', '5'), 
        # since you only want to add the first element, use item[0]
        for item in tuple_list:
            flgset.add(item[0])

Output as such:

new_dict = {'abc': [('abc', '1', '5'), ('def', '1', '5'), ('abcd', '2', '5')],
            'ghi': [('ghi', '1', '5'), ('jkl', '1', '4'), ('mno', '2', '4')]}

flgset = {'abc', 'abcd', 'def', 'ghi', 'jkl', 'mno'}

Filtering a dictionary based on the first occurence of the value elements

Answers (2)

Related Questions