Scala: expressions vs method/function comparison

Question

My post title terminology probably isn't correct. But basically, I can do a self-join that only compares ids with those observed (48-72) hours afterward. I do that with this code:

df.join(df.select(col("id") as "id_b",
                  col("timestamp") as "timestamp_b"),
           (to_timestamp(timestamp).cast(long) + 172800) < (to_timestamp(timestamp_b).cast(long)) && 
           (to_timestamp(timestamp).cast(long)) < (to_timestamp(timestamp).cast(long) + 259200) &&

I've read on several SO posts that using the expr() is a "better" way of doing the above. Reasons were not given but I guess it's supposed to be more legible? Anyway, for the sake of knowledge, how would I convert the above into it's expr() equivalent? I know how to translate between the two approaches for simpler logic. The above case is beyond me due to the need to convert the timestamp to a long and I'm not sure of the syntax for that (or if it's even possible).

Also, is there a way to break up an expr() into multiple lines? I tried: expr(""" """ but that gives an EOL error.

My timestamps look like:

2020-07-27 02:34:52.3452
2020-10-21 13:23:55.2355
etc.

and are stored in a Spark DataFrame like so:

 |-- id: string (nullable = true)
 |-- timestamp: string (nullable = true)

Scala: expressions vs method/function comparison

Answers (1)

Related Questions