Spark SQL result different from Hive SQL result

Question

I performed the SQL statement, select cityroaddis from trip_db.tripTable where tripid='a0001' and day>'2020-09-09', in both hive shell and spark shell, but got totally different results.

The two results

Hive: cityroaddis	Spark: cityroaddis
0.0	null

Notice:

I have specified the data type of cityroaddis as Double when creating the hive table
There is no null value in cityroaddis column in Hive
Only 0.3% of rows have such problem
Not all columns have inconsistency between hive and spark (probably 15 out of 70)

Has anybody had such a problem before?

Spark SQL result different from Hive SQL result

Answers (1)

Related Questions