Scaling down high dimensional pandas' data frame data using sklean

Question

I am trying to scale down values in pandas data frame. The problem is that I have 291 dimensions, so scale down the values one by one is time consuming if we are to do it as follows:

from sklearn.preprocessing import StandardScaler
sclaer = StandardScaler()
scaler = sclaer.fit(dataframe['dimension_1'])
dataframe['dimension_1'] = scaler.transform(dataframe['dimension_1'])

Problem: This is only for one dimension, so how we can do this please for the 291 dimension in one shot?

yudhiesh · Accepted Answer

You can pass in a list of the columns that you want to scale instead of individually scaling each column.

# convert the columns labelled 0 and 1 to boolean values 
df.replace({0: False, 1: True}, inplace=True)

# make a copy of dataframe
scaled_features = df.copy()

# take the numeric columns i.e. those which are not of type object or bool
col_names = df.dtypes[df.dtypes != 'object'][df.dtypes != 'bool'].index.to_list()
features = scaled_features[col_names]

# Use scaler of choice; here Standard scaler is used
scaler = StandardScaler().fit(features.values)
features = scaler.transform(features.values)

scaled_features[col_names] = features

Scaling down high dimensional pandas' data frame data using sklean

Answers (2)

Related Questions

Scaling down high dimensional pandas&#39; data frame data using sklean

Answers (2)

Related Questions

Scaling down high dimensional pandas' data frame data using sklean