Reputation: 362
I am using Python 3.5.1 on 64 bit windows. My problem is Python seems to ignore __eq__
and __lt__
operators on a user-defined class when sorting. Using a custom sorting key is a workaround that doesn't work when trying to sort tuples containing instances of this class.
Example:
class Symbol:
def __init__(self, name, is_terminal = False):
self.name = name
self.is_terminal = is_terminal
def __eq__(self, other):
return (self.is_terminal, self.name) == (other.is_terminal, other.name)
def __lt__(self, other):
return (self.is_terminal, self.name) < (other.is_terminal, other.name)
symbols = set()
for s in "abcdef":
symbols.add(Symbol(s))
sorted_symbols = sorted(symbols)
# sorted_symbols now contain the symbols in random order
Using the functools.total_ordering
decorator does not help.
My question is how do I define ordering for a user class in Python 3?
Upvotes: 2
Views: 2339
Reputation: 1121654
Python does not ignore __eq__
and __lt__
, at least not if you actually used @functools.total_ordering
:
>>> from functools import total_ordering
>>> @total_ordering
... class Symbol:
... def __init__(self, name, is_terminal=False):
... self.name = name
... self.is_terminal = is_terminal
... def __repr__(self):
... return 'Symbol({0.name!r}, is_terminal={0.is_terminal!r})'.format(self)
... def __hash__(self):
... return hash(self.name) ^ hash(self.is_terminal)
... def __eq__(self, other):
... print('{} __eq__ {}'.format(self, other))
... return (self.is_terminal, self.name) == (other.is_terminal, other.name)
... def __lt__(self, other):
... print('{} __lt__ {}'.format(self, other))
... return (self.is_terminal, self.name) < (other.is_terminal, other.name)
...
>>> symbols = set()
>>> for s in "abcdef":
... symbols.add(Symbol(s))
...
>>> sorted(symbols)
Symbol('f', is_terminal=False) __lt__ Symbol('c', is_terminal=False)
Symbol('a', is_terminal=False) __lt__ Symbol('f', is_terminal=False)
Symbol('a', is_terminal=False) __lt__ Symbol('f', is_terminal=False)
Symbol('a', is_terminal=False) __lt__ Symbol('c', is_terminal=False)
Symbol('b', is_terminal=False) __lt__ Symbol('c', is_terminal=False)
Symbol('b', is_terminal=False) __lt__ Symbol('a', is_terminal=False)
Symbol('d', is_terminal=False) __lt__ Symbol('c', is_terminal=False)
Symbol('d', is_terminal=False) __lt__ Symbol('f', is_terminal=False)
Symbol('e', is_terminal=False) __lt__ Symbol('c', is_terminal=False)
Symbol('e', is_terminal=False) __lt__ Symbol('f', is_terminal=False)
Symbol('e', is_terminal=False) __lt__ Symbol('d', is_terminal=False)
[Symbol('a', is_terminal=False), Symbol('b', is_terminal=False), Symbol('c', is_terminal=False), Symbol('d', is_terminal=False), Symbol('e', is_terminal=False), Symbol('f', is_terminal=False)]
In fact, sorting even works without @total_ordering
because the TimSort implementation only uses __lt__
; this is explicitly document:
This method sorts the list in place, using only < comparisons between items.
A sort key is also an option, just return the (is_terminal, name)
tuple from the key:
>>> sorted(symbols, key=lambda s: (s.is_terminal, s.name))
[Symbol('a', is_terminal=False), Symbol('b', is_terminal=False), Symbol('c', is_terminal=False), Symbol('d', is_terminal=False), Symbol('e', is_terminal=False), Symbol('f', is_terminal=False)]
Note that now the __lt__
method is never called, because the sort key is used instead.
Upvotes: 4