Reputation: 423
Is there any obvious performance degradation or drawback when deploy Spark streaming cluster in virtualized environment like Xen or KVM? What's the main reason?
Upvotes: 0
Views: 835
Reputation: 27455
The usual caveats about virtualization apply, but there is nothing specific to Spark or Spark Streaming.
I don't know of an article that would directly address this question. But the Spark petasort benchmark was run on EC2 and the article pays close attention to performance: https://databricks.com/blog/2014/10/10/spark-petabyte-sort.html
Upvotes: 1