Removing nulls from Pyspark Dataframe in individual columns

Question

I have a pyspark dataframe like this:

I want to remove the null values from each individual columns so the non-null data lines up.

The desired output is:

+--------------------+--------------------+ | name| value| +--------------------+--------------------+ | id| 1| | name| Joe| | age| 47| | food| pizza| +--------------------+--------------------+

I have tried removing nulls doing something like df.dropna(how='any'/'all') but and by separating out the columns and removing the nulls, but then it becomes difficult to join them back together.

Removing nulls from Pyspark Dataframe in individual columns

Answers (1)

Related Questions