Efficiently Reshape 3D Numpy Array into 1D List Plus Coordinate Vector

Question

I've got a large nested array a (256x256x256) that I need to restructure into a list with elements like this:

[ (i,j,k), a[i,j,k] ]

I'm currently doing this as follows:

aflat = a.flatten().tolist()
coords = list(itertools.product(range(256), repeat=3))
thelist = [list(x) for x in zip(coords, aflat)]

This works, but it's rather slow.

I can probably save a second or so by removing the runtime generation of the coordinate vectors and reading those from a file instead. However, the main slowdown seems to be in the last line, which is clocking in at over 6 seconds.

Is there a faster way to generate the data structure I need in Python?

sfstewman · Accepted Answer

As @P-i commented, the major problem is that the code creates a ton of lists, and Python spends a lot of time doing memory management. To eliminate this, you can use numpy arrays to preallocate the data, and use its repeat and tile functions to generate the i,j,k values:

# order='F' is important here so column-wise assignment can
# occur with a stride of 1.  Switching the order results
# in a significant performance hit.
coords = numpy.zeros([a.size,4],'d',order='F')

NI, NJ, NK = a.shape

# build columns for (i,j,k) tuples using repeat and tile
coords[:,0] = numpy.repeat(range(NI),NJ*NK)
coords[:,1] = numpy.tile(numpy.repeat(range(NJ),NK), NI)
coords[:,2] = numpy.tile(range(NK), NI*NJ)
coords[:,3] = a.flatten()

This results in an array where each row is (i,j,k,value). It does assume that your original array is in row-major ordering (C-ordered arrays in numpy).

In my timings, based on ten iterations in Python 3.5 on a 2013 MacBook Pro, it took about 20 seconds per transformation to run the OP's transformation and only about 8 seconds per transformation using this method.

The output format really has to be a list, the array can be converted into a list in the final step. However, this increased the transformation time to 13 seconds per transformation in my testing.

Efficiently Reshape 3D Numpy Array into 1D List Plus Coordinate Vector

Answers (2)

Related Questions