Multivariate regression not getting same coefficients as sklearn

Question

I am computing coefficients like this:

def estimate_multivariate(data, target):
    x = np.array(data)
    y = np.array(target)
    inv = np.linalg.inv(np.dot(x.T,x))
    beta = np.dot(np.dot(inv, x.T), y)
    return beta

and get these results:

[[ 103.56793536] [  63.93186848][-272.06215991][ 500.43324361] [ 327.45075839]]

However if I create the model with sklearn.linear_model I get these results:

[ 118.45775015   64.56441108 -256.20123986  500.43324362  327.45075841]

This only happens when I use

preprocessing.PolynomialFeatures(degree=2)
poly.fit_transform(x)

with a degree greater than 1. When I use the original data the coefficients of both methods are the same. What could account for this? Is there some truncation somewhere?

Multivariate regression not getting same coefficients as sklearn

Answers (1)

Related Questions