how to split a string in one column into new columns for each character in pandas

Question

I have a pandas dataframe that looks like:

                flag
0        NNxxNxNNxNN
1        xxNNNNNNNNN
2        xxxNNxNNNNN
3        xxxxNxxxxxN
4        xxxxxxNxxxx
5        xxxxxxxNxNN

And I would like to split the string into a new column for each character, for example like this:

         col1 col2 col3 col4 col5 col6 col7 col8 col9 col10 col11
0        N    N    x    x    N    x    N    N    x    N    N
1        x    x    N    N    N    N    N    N    N    N    N
2        x    x    x    N    N    x    N    N    N    N    N
3        x    x    x    x    N    x    x    x    x    x    N
4        x    x    x    x    x    x    N    x    x    x    x
5        x    x    x    x    x    x    x    N    x    N    N

My dataframe has several million rows - is there an efficient way to do this?

BENY · Accepted Answer

Using tolist with pd.DataFrame

pd.DataFrame(df.flag.apply(list).tolist())
Out[905]: 
  0  1  2  3  4  5  6  7  8  9  10
0  N  N  x  x  N  x  N  N  x  N  N
1  x  x  N  N  N  N  N  N  N  N  N
2  x  x  x  N  N  x  N  N  N  N  N
3  x  x  x  x  N  x  x  x  x  x  N
4  x  x  x  x  x  x  N  x  x  x  x
5  x  x  x  x  x  x  x  N  x  N  N

And method from extractall

df.flag.str.extractall('(.)')[0].unstack()
Out[931]: 
match 0  1  2  3  4  5  6  7  8  9  10
0      N  N  x  x  N  x  N  N  x  N  N
1      x  x  N  N  N  N  N  N  N  N  N
2      x  x  x  N  N  x  N  N  N  N  N
3      x  x  x  x  N  x  x  x  x  x  N
4      x  x  x  x  x  x  N  x  x  x  x
5      x  x  x  x  x  x  x  N  x  N  N

how to split a string in one column into new columns for each character in pandas

Answers (2)

Related Questions