Numpy - Stacked memory view of two 1D arrays

Question

I know that I can do the following:

import numpy as np
c = np.random.randn(20, 2)
a = c[:, 0]
b = c[:, 1]

Here, a and b are pointers to c's first and second column respectively. Modifying a or b will change c (same reciprocally).

However, what I want to achieve is exactly the opposite. I want to create a 2D memory view where each column (or row) will point to a memory of a different 1D array. Assume that I already have two 1D arrays, is it possible to create a 2D view to these arrays where each row/column points to each of them?

I can create c from a and b in the following way:

c = np.c_[a, b]

However, this copies a's and b memory onto c. Can I just somehow create c as 'view' of [a b], where, by modifying an element of c this reflects in the respective a or b 1D array?

hpaulj · Accepted Answer

I don't think it is possible.

In your first example, the values of the a and b views are interwoven, as can be seen from this variation:

In [51]: c=np.arange(10).reshape(5,2)
In [52]: a, b = c[:,0], c[:,1]
In [53]: a
Out[53]: array([0, 2, 4, 6, 8])
In [54]: c.flatten()
Out[54]: array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])

The data buffer for c and a start at the same memory point; b starts at 4 bytes into that buffer.

In [55]: c.__array_interface__
Out[55]: 
{'strides': None,
 'data': (172552624, False),...}

In [56]: a.__array_interface__
Out[56]: 
{'strides': (8,),
 'data': (172552624, False),...}

In [57]: b.__array_interface__
Out[57]: 
{'strides': (8,),
 'data': (172552628, False),...}

Even if the a,b split were by rows, b would start just further along in the same shared data buffer.

From the .flags we see that c is C-contiguous, b is not. But b values are accessed with constant strides in that shared data buffer.

When a and b are created separately, their data buffers are entirely separate. The numpy striding mechanism cannot step back and forth between these two data buffers. A 2d composite of a and b has to work with its own data buffer.

I can imagine writing a class that ends up looking like what you want. The indexing_tricks file that defines np.c_ might give you ideas (e.g. a class with a custom __getitem__ method). But it wouldn't have the speed advantages of a regular 2d array. And it might be hard to implement all of the ndarray functionality.

Numpy - Stacked memory view of two 1D arrays

Answers (2)

Related Questions