Spark : How to combine multiple window-aggregations performed on the same sliding window

Question

I am processing a time-series dataset and I need to calculate stddev, mean etc over a sliding window of (-100 , +100).
I observed that the windowing is applied for each of these calculations even though the sliding window is same for all these.
Is there a way to combine all these calculations, so that there will be only one single window and all the required calculated fields are derived upon that window

  val w = Window.partitionBy("raw_data_field_id").orderBy("date_time_epoch").rowsBetween(-100,100)
  val rawdatax = rawdata
    .withColumn("valueSqrtStdDev", stddev_pop(col("valueSqrt")).over(w))
    .withColumn("valueSqrtMean", mean(col("valueSqrt")).over(w))
    ....

Spark : How to combine multiple window-aggregations performed on the same sliding window

Answers (1)

Related Questions