Bert Carremans
Bert Carremans

Reputation: 1733

Keep other variables when executing get_dummies in Pandas

I have a DataFrame with an ID variable and another categorical variable. I want to create dummy variables out of the categorical variable with get_dummies.

dum = pd.get_dummies(df)

However, this makes the ID variable disappear. And I need this ID variable later on to merge to other data sets.

Is there a way to keep other variables. In the documentation of get_dummies I could not find anything. Thanks!

Upvotes: 9

Views: 16108

Answers (2)

Tom
Tom

Reputation: 1063

You can also copy the original column into a new one before executing get_dummies. E.g.,

df['dum_orig'] = df['dum']
df = pd.get_dummies(df, columns=['dum'])

Upvotes: 11

Bert Carremans
Bert Carremans

Reputation: 1733

I found the answer. You can concatenate the dummies data set to the original data set like shown below. As long as you don't re-order the data in the meantime.

df = pd.concat([df, dum], axis=1) 

Upvotes: 7

Related Questions