unix_timestamp() function changes hour in scala spark

Question

I am using Spark 2.1.0 in unix and found a weird issue where unix_timestamp is changing hour for one particular timestamp, I created a dataframe as below

For 1st record in df2 is having "20170312020200" as String, which I later cast into timestamp in df3, the hours should be 02 but instead it comes as 03 in df3. But 2nd record doesn't have issue in converting string to timestamp.

This doesn't happen when I run the app using Intellij in local system. This is happening in spark-submit as well when we run our app.

Joe K · Accepted Answer

March 12, 2017 2:02 AM is not a valid time in a lot of time zones. That was when daylight savings kicked in and the clock skipped from 1:59:59 to 3:00:00 in the US.

My guess is your local machine and the spark cluster have different system time zone settings.

unix_timestamp() function changes hour in scala spark

Answers (2)

Related Questions