Manipulating series in a dataframe

Question

My dataframe has a list of comma separated values in one column. I want to find the list of distinct entries, create a new column for each distinct entry in the dataframe, then fill the new columns with 1 or 0 depending on whether the row has the city name. The idea is to use the new columns in building a logistic regression model.
As an example

Before

Name    City 
Jack    NewYork,Chicago,Seattle
Jill    Seattle, SanFrancisco
Ted     Chicago,SanFrancisco
Bill    NewYork,Seattle

After

Name    NewYork     Chicago     Seattle     SanFrancisco
Jack    1           1           1           0
Jill    0           0           1           1
Ted     0           1           0           1
Bill    1           0           1           0

Manipulating series in a dataframe

Answers (1)

Related Questions