How to remove square brackets from dataframe

Question

I have seen many links related to my question:

How to remove extraneous square brackets from a nested list inside a dictionary?

but none of that worked

below is my example:

df1

column1    column2   column3    ..... upto 'n' number of columns

[data1]    data1     data1
NAN        data2     data2
[data2]    data3     [data3, data3, testing how are you guys hope you guys are doing :)]
[data3]    data3     [data4, dummy text to test to test test test] 
NAN        data4     [data5]

below is my tried code:

df1[column1] = df[column1].str[0]
# not working !
# want to give df1 instead of df1[columns] because there are lot of 
# columns

i want to remove only the bracket, not anything else and want to give only dataframe not along with columns because there are lot of columns !

expected output:

column1    column2   column3    ..... upto 'n' number of columns

data1      data1     data1
NAN        data2     data2
data2      data3     data3, data3, testing how are you guys hope you guys are doing :)
data3      data3     data4, dummy text to test to test test test
NAN        data4     data5

not_speshal · Accepted Answer

Try with apply, explode and groupby:

>>> df.apply(lambda x: x.explode().astype(str).groupby(level=0).agg(", ".join))
  column1 column2                                            column3
0   data1   data1                                              data1
1     nan   data2                                              data2
2   data2   data3  data3, data3, testing how are you guys hope yo...
3   data3   data3        data4, dummy text to test to test test test
4     nan   data4                                              data5

Use pandas.explode() to transform each list element to its own row, replicating index values.
Then groupby identical index values and aggregate using str.join().
Use apply to apply the same function to all columns of the DataFrame.

How to remove square brackets from dataframe

Answers (2)

Related Questions