Standard deviation from center of mass along Numpy array axis

Question

I am trying to find a well-performing way to calculate the standard deviation from the center of mass/gravity along an axis of a Numpy array.

In formula this is (sorry for the misalignment):

$\mu_j = \frac{\sum_i{i A_{ij}}}{\sum_i{ A_{ij}}} ewline ewline ext{var}_j = \frac{\sum_i{i^2 A_{ij}}}{\sum_i{A_{ij}}} - \mu_j^2 ewline ewline ext{std}_j = \sqrt{ ext{var}_j}$

The best I could come up with is this:

def weighted_com(A, axis, weights):
    average = np.average(A, axis=axis, weights=weights)
    return average * weights.sum() / A.sum(axis=axis).astype(float)

def weighted_std(A, axis):
    weights = np.arange(A.shape[axis])
    w1com2 = weighted_com(A, axis, weights)**2
    w2com1 = weighted_com(A, axis, weights**2)
    return np.sqrt(w2com1 - w1com2)

In weighted_com, I need to correct the normalization from sum of weights to sum of values (which is an ugly workaround, I guess). weighted_std is probably fine.

To avoid the XY problem, I still ask for what I actually want, (a better weighted_std) instead of a better version of my weighted_com.

The .astype(float) is a safety measure as I'll apply this to histograms containing ints, which caused problems due to integer division when not in Python 3 or when from __future__ import division is not active.

Standard deviation from center of mass along Numpy array axis

Answers (1)

Related Questions