Replace all None's in a Pandas data frame with a tuple of None's

Question

So I am working on some code for an NLP application. An interesting solution on Stackoverflow creates a dataframe from lists of unequal lengths. Taking the code from that solution with tuples in the input:

import pandas as pd
import itertools

aa = [('aa1',4), ('aa2',3), ('aa3',2), ('aa4',2), ('aa5',1)]
bb = [('bb1',8), ('bb2',6), ('bb3',4), ('bb4',4)]
cc = [('cc1',3), ('cc2',2), ('cc3',1)]
nest = [aa, bb, cc]

df = pd.DataFrame((_ for _ in itertools.zip_longest(*nest)), columns=['aa', 'bb', 'cc'])
df

you get a dataframe which looks like this:

A subsequent step requires all elements in the data frame to be tuples.

I have tried this:

df.replace({None : (None,None)})

While it seems to run without error, it does not carry out any replacement. Any ideas how to accomplish this?

Stefan Falk · Accepted Answer

One way to do it would be using pandas.DataFrame.apply() and pandas.Series.map() like this:

df.apply(lambda ds: ds.map(lambda x: x if x != None else (None, None)))

Replace all None's in a Pandas data frame with a tuple of None's

Answers (2)

Related Questions

Replace all None&#39;s in a Pandas data frame with a tuple of None&#39;s

Answers (2)

Related Questions

Replace all None's in a Pandas data frame with a tuple of None's