norm fit producing incorrect fit

Question

Why does the following produce an incorrect output for muf and stdf?

import numpy as np
from scipy.stats import norm
x=np.linspace(-50,50,100)
sig=10
mu=0
y=1/np.sqrt(2*sig*sig*np.pi)*np.exp(-(x-mu)*(x-mu)/(2*sig*sig))
muf, stdf = norm.fit(y)
print muf,stdf

This prints 0.00989999568634 0.0134634293279

Thanks.

ImportanceOfBeingErnest · Accepted Answer

The documentation of scipy.stats.norm says for its fit function

fit(data, loc=0, scale=1) Parameter estimates for generic data.

To me this is highly ununderstandable and I'm pretty sure that one cannot expect this function to return a fit in the usual sense.

However, to fit a gaussian is rather straight forward:

from __future__ import division
import numpy as np

x=np.linspace(-50,50,100)
sig=10
mu=0
y=1/np.sqrt(2*sig*sig*np.pi)*np.exp(-(x-mu)*(x-mu)/(2*sig*sig))  #

def gaussian_fit(xdata,ydata):
    mu = np.sum(xdata*ydata)/np.sum(ydata)
    sigma = np.sqrt(np.abs(np.sum((xdata-mu)**2*ydata)/np.sum(ydata)))
    return mu, sigma

print gaussian_fit(x,y)

This prints (-7.474196315587989e-16, 9.9999422983567516) which is sufficiently close to the expected values of (0, 10).

norm fit producing incorrect fit

Answers (2)

Related Questions