Expand all columns with lists into own columns

Question

Given the following dataframe:

    FrameLen FrameCapLen  IPHdrLen  ... TLSRecordLen  TLSAppData PacketTime
0  [117, 66]   [117, 66]  [20, 20]  ...  [46.0, nan]  [nan, nan]   0.000045
1  [117, 66]   [117, 66]  [20, 20]  ...  [46.0, nan]  [nan, nan]   0.000024
2  [117, 66]   [117, 66]  [20, 20]  ...  [46.0, nan]  [nan, nan]   0.000027
3  [117, 66]   [117, 66]  [20, 20]  ...  [46.0, nan]  [nan, nan]   0.000024
4  [117, 66]   [117, 66]  [20, 20]  ...  [46.0, nan]  [nan, nan]   0.000011

How can I expand each column that contains a list into respective new columns, so that FrameLen would become FrameLen_1, FrameLen_2, ..., etc.?

I know this can be done manually for each column manually (see below) but how could this be done automatically?

Example of manual operation (which looks inefficient?):

tags = df['FrameLen'].apply(pd.Series)
tags = column.rename(columns = lambda x : 'FrameLen_' + str(x))
pd.concat([df[:], tags[:]], axis=1)
del df['FrameLen']

Mayank Porwal · Accepted Answer

You can use a for loop with df.add_prefix to append column name:

Take below df for example:

In [93]: In [55]: df = pd.DataFrame({'FrameLen':[[117, 66], [117, 66], [117, 66]], 'FrameCapLen':[[117, 66],[117, 66],[117, 66]], 'IPHdrLen':[[20, 20],[20, 20],[20, 20]], 'PacketTime':[0.000045, 0.000024,
    ...:  0.000027]})    

In [94]: df 
Out[94]: 
    FrameLen FrameCapLen  IPHdrLen  PacketTime
0  [117, 66]   [117, 66]  [20, 20]    0.000045
1  [117, 66]   [117, 66]  [20, 20]    0.000024
2  [117, 66]   [117, 66]  [20, 20]    0.000027

In [90]: d = pd.DataFrame() ## empty dataframe

In [91]: for i,j in df.items(): 
    ...:     if df['{}'.format(i)].dtype == 'object': 
    ...:         temp = pd.DataFrame(df[i].values.tolist()).add_prefix('{}_'.format(i)) 
    ...:         d = pd.concat([d,temp], axis=1) 
    ...:     else: 
    ...:         d = pd.concat([d, df[i]], axis=1) 
    ...:   

In [92]: d 
Out[92]: 
   FrameLen_0  FrameLen_1  FrameCapLen_0  FrameCapLen_1  IPHdrLen_0  IPHdrLen_1  PacketTime
0         117          66            117             66          20          20    0.000045
1         117          66            117             66          20          20    0.000024
2         117          66            117             66          20          20    0.000027

Expand all columns with lists into own columns

Answers (2)

Related Questions