user3591675
user3591675

Reputation: 399

Installing seaborn on Pyspark

I am using Apache Pyspark with Jupyter notebook. In one of the machine learning tutorials, the instructors were using seaborn with pyspark. How can we install and use third party libraries like Seaborn on the Apache Spark (rather Pyspark)?

Upvotes: 0

Views: 1136

Answers (1)

Jack_The_Ripper
Jack_The_Ripper

Reputation: 703

Generally, for plotting, you need to move all the data points to the master node (using functions like collect() ) before you can plot. PLotting is not possible while the data is still distributed in memory.

Upvotes: 4

Related Questions