Converting one column spark dataframe into Single String delimited by pipline character in Python

Question

I have a spark data frame, which contains one column of information. It looks like:

Name
----------
Bob


----------
Dan

I want to convert this into a single string, delimited by pipeline characters.

"Bob|Dan"

How would I go about doing so in Python (pyspark)? Currently, I'm creating the dataframe via

df = sqlContext.sql("Select name from db")

If you could help lead me in a certain direction, I'd appreciate it.

Ezer K · Accepted Answer

Does this help?

df = sqlContext.createDataFrame([{'name':'Bob'},{'name':'Dan'}])

'|'.join([str(x.asDict().values()[0])  for x in df.collect()])

Answers (2)