Is it overall more efficient to convert list to set when checking if a value is in the set

Question

I was reading up on some best practices for speed in python and found this which says:

Membership testing with sets and dictionaries is much faster, O(1), than searching sequences, O(n).

When testing "a in b", b should be a set or dictionary instead of a list or tuple.

But if say i have a list long_list and I want to find out if item list_item is in long_list like:

list_item in long_list

Would it under any circumstance be faster to do:

list_item in Set(long_list)

Seeing as I think list to set or dict conversion on average should be O(n) in itself. (?)

Or is it always better to just go with whichever data-type I'm already working with?

0x5453 · Accepted Answer

If you are going to be doing multiple lookups on long_list, it is worth it. Otherwise, it is not.

$ python3 -m timeit -s 'x = list(range(10000))' '1234 in x'
100000 loops, best of 3: 5.71 usec per loop

$ python3 -m timeit -s 'x = list(range(10000))' '1234 in set(x)'
10000 loops, best of 3: 61.4 usec per loop

$ python3 -m timeit -s 'x = set(list(range(10000)))' '1234 in x'
10000000 loops, best of 3: 0.0198 usec per loop

Is it overall more efficient to convert list to set when checking if a value is in the set

Answers (2)

Related Questions