Load a variable into a dataframe

Question

In PySpark, I am trying to load a dataframe from a string variable.

My variable is a multi line text..

string_data = """
 Name|age|city
 david|23|London
 krish|24|Bali
 john|56|Goa
"""

I wanted to load this data into a dataframe in PySpark. Thought of using datasets but they are not available in PySpark.

Using Pandas, I used to write like this:

string2 = StringIO(string_data)

df = pd.read_csv(string2,sep='|')

Answers (1)