Optimize tail-recursion in Clojure: exponential moving average

Question

I'm new to Clojure and trying to implement an exponential moving average function using tail recursion. After battling a little with stack overflows using lazy-seq and concat, I got to the following implementation which works, but is very slow:

(defn ema3 [c a]
    (loop [ct (rest c) res [(first c)]]
        (if (= (count ct) 0)
            res
            (recur
                (rest ct)
                (into;NOT LAZY-SEQ OR CONCAT
                    res
                    [(+ (* a (first ct)) (* (- 1 a) (last res)))]
                    )
                )
            )
        )
    )

For a 10,000 item collection, Clojure will take around 1300ms, whereas a Python Pandas call such as

s.ewm(alpha=0.3, adjust=True).mean()

will only take 700 us. How can I reduce that performance gap? Thank you,

Taylor Wood · Accepted Answer

If res is a vector (which it is in your example) then using peek instead of last yields much better performance:

(defn ema3 [c a]
  (loop [ct (rest c) res [(first c)]]
    (if (= (count ct) 0)
      res
      (recur
        (rest ct)
        (into
          res
          [(+ (* a (first ct)) (* (- 1 a) (peek res)))])))))

Your example on my computer:

(time (ema3 (range 10000) 0.3))
"Elapsed time: 990.417668 msecs"

Using peek:

(time (ema3 (range 10000) 0.3))
"Elapsed time: 9.736761 msecs"

Here's a version using reduce that's even faster on my computer:

(defn ema3 [c a]
  (reduce (fn [res ct]
            (conj
              res
              (+ (* a ct)
                 (* (- 1 a) (peek res)))))
          [(first c)]
          (rest c)))
;; "Elapsed time: 0.98824 msecs"

Take these timings with a grain of salt. Use something like criterium for more thorough benchmarking. You might be able to squeeze out more gains using mutability/transients.

Optimize tail-recursion in Clojure: exponential moving average

Answers (2)

Related Questions