Copying to pre-allocated list slower than append which seems counterintuitive

Question

This question has nothing to do with how to copy one list to another. It's about performance. Consider these two functions:

def funcA(data):
    A = []
    for d in data:
        A.append(d)
    return A


def funcB(data):
    A = [None] * len(data)
    i = 0
    for d in data:
        A[i] = d
        i += 1
    return A

funcA represents what I suppose one might call the classic way to copy by reference (ignoring the copy module for the sake of this discussion). It's easy to understand and performs extremely well. However, the ultimate size of array A is unknown and has to grow dynamically as elements are appended to it.

Now consider funcB. It's more complex, but array A is pre-allocated and items are added by index. I would expect this to be faster because it obviates any memory allocation/reallocation that could occur during appending.

However, I have determined empirically that funcB runs ~30% slower than funcA.

I'm interested to know if anyone has thoughts on why this might be. Below is the driver code I used:

from datetime import datetime

M = 1_000
ARRAY_SIZE = 100_000
DATA = [0] * ARRAY_SIZE

for func in [funcA, funcB]:
    _s = datetime.now()
    for _ in range(M):
        func(DATA)
    _e = datetime.now()
    print(f'{_e-_s}')

Typical timings are 4.2 seconds for funcA and 5.4 seconds for funcB. This is obviously a CPU-bound activity and therefore results will vary considerably across platforms.

Copying to pre-allocated list slower than append which seems counterintuitive

Answers (1)

Related Questions