MATLAB find mean of column in matrix using two different indices

Question

I have a 22007x3 matrix with data in column 3 and two separate indices in columns 1 and 2.

eg.

I need to find the mean of the values in column 3 when the values in column 1 are the same AND the values in column 2 are the same, to end up with something like:

ans = 

    1   3   4.6667
    1   16  3.6667
    2   4   2
    2   11  2.3333

Please bear in mind that in my data, the number of times the values in column 1 and 2 occur can be different.

Two options I've tried already are the meshgrid/accumarray option, using two distinct unique functions and a 3D array:

[U, ix, iu] = unique(x(:, 1));
[U2,ix2,iu2] = unique(x(:,2));
[c, r, j] = meshgrid((1:size(x(:, 1), 2)), iu, iu2);
totals = accumarray([r(:), c(:), j(:)], x(:), [], @nanmean);

which gives me this:

??? Maximum variable size allowed by the program is exceeded.

Error in ==> meshgrid at 60
    xx = xx(ones(ny,1),:,ones(nz,1));

and the loop option,

for i=1:size(x,1)
    if x(i,2)== x(i+1,2);
        totals(i,:)=accumarray(x(:,1),x(:,3),[],@nanmean);
    end
end

which is obviously so very, very wrong, not least because of the x(i+1,2) bit.

I'm also considering creating separate matrices depending on how many times a value in column 1 occurs, but that would be long and inefficient, so I'm loathe to go down that road.

Oleg · Accepted Answer

Group on the first two columns with a unique(...,'rows'), then accumulate only the third column (always the best approach to accumulate only where accumulation really happens, thus avoiding indices, i.e. the first two columns, which you can reattach with unX):

[unX,~,subs] = unique(x(:,1:2),'rows');
out          = [unX accumarray(subs,x(:,3),[],@nanmean)];

out =
            1            3       4.6667
            1           16       3.6667
            2            4            2
            2           11       2.33

MATLAB find mean of column in matrix using two different indices

Answers (2)

Related Questions