re-implement __eq__ to compare sets with symmetric_difference in python

Question

I have a set of filenames coming from two different directories.

currList=set(['pathA/file1', 'pathA/file2', 'pathB/file3', etc.])

My code is processing the files, and need to change currList by comparing it to its content at the former iteration, say processLst. For that, I compute a symmetric difference:

toProcess=set(currList).symmetric_difference(set(processList))

Actually, I need the symmetric_difference to operate on the basename (file1...) not on the complete filename (pathA/file1).

I guess I need to reimplement the __eq__ operator, but I have no clue how to do that in python.

is reimplementing __eq__ the right approach? or
is there another better/equivalent approach?

Zarkonnen · Accepted Answer

You can do this with the magic of generator expressions.

def basename(x):
    return x.split("/")[-1]

result = set(x for x in set(currList).union(set(processList)) if (basename(x) in [basename(y) for y in currList]) != (basename(x) in [basename(y) for y in processList]))

should do the trick. It gives you all the elements X that appear in one list or the other, and whose basename-presence in the two lists is not the same.

Edit: Running this with:

currList=set(['pathA/file1', 'pathA/file2', 'pathB/file3'])
processList=set(['pathA/file1', 'pathA/file9', 'pathA/file3'])

returns:

set(['pathA/file2', 'pathA/file9'])

which would appear to be correct.

re-implement eq to compare sets with symmetric_difference in python

Answers (2)

Related Questions