remove duplicates in list in column in Pandas

Question

Pandas perhaps way out there question.

Have a dataframe like this

    Col1           Col2
['joe', 'joe']     ['joe']
['sam','bob']     ['sam'.'bob']
['mary','mary']   ['mary']

I want to use an apply function on Col1 to get the result in Col2. Meaning, I want the lists with duplicates in Col1 to no longer have those duplicates in Col2. Tried various functions with apply and set, no dice. Seems like it should be straightforward, but hold on to the laptop, it isn't. Or so it seems..

BENY · Accepted Answer

For get the col two

df['ColB'] = df['Col1'].explode().groupby(level=0).unique()

remove duplicates in list in column in Pandas

Answers (2)

Related Questions