How to transform string into binary records?

Question

I have such base here.

df = pd.read_csv('c:/1/Autism_Data.arff',na_values="?")

I need to transform columns: "gender", "jundice", "austim" into binar records 0-1. I would like to see this table like that.

modesitt · Accepted Answer

If you'd like to be brief you can use pd.Categorical. For example,

df['gender'] = pd.Categorical(df.gender).codes

you can extend this to the other desired columns. These will assign the numbers alphabetically - so you ought to pay attention to that and mask otherwise desired results. Alternatively, if you would like some more control you can use LabelEncoder.

sklearn.preprocessing import LabelEncoder

le = LabelEncoder()
df['gender'] = le.fit_transform(df.gender)

How to transform string into binary records?

Answers (2)

Related Questions