Fit a Normalizer with an array, then transform another in python with sklearn

Question

I'm not sure if i'm doing something wrong, or if this is not the correct way to do this..

I'm encoding variables in a dataset for a model, now, i'm using a Normalizer() from sklearn.preprocessing to normalize one of my variables which is numerical.

My dataset is split in two, one for the training and one for the inference. Now, my goal is to normalize this numerical variable (let's call it column x) in the training subset, and then use the normalization parameters to normalize the same variable in the inference dataset. Now, both subsets don't have the same amount of entries, so, what i'm doing is:

nr = Normalizer()
nr.fit([df1.x])
new_col = nr.transform(df1.x)

Now, the problme is.. when i try to use the same normalizer parameters on the column x in the inference subset, since it has a different number of rows:

new_col1 = nr.transform(df2.x)

I get:

X has 10 features, but Normalizer is expecting 697 features as input.

I'm not sure if it's some reshape problem or if the Normalizer() shouldn't be used in that way, so, any advice would be more than welcome.

Fit a Normalizer with an array, then transform another in python with sklearn

Answers (1)

Related Questions