Select count(*) issue with hive and spark

Question

I get the correct count after I run the ANALYZE statement. But my problem is, it needs to be run every time of the count is updated. Technically I should be able to update the count for the same partition.

But it returns the same count if I don't execute the ANALYZE statement.

This is the query I execute for the count to be updated.

ANALYZE TABLE bi_events_identification_carrier_sam PARTITION(year, month, day) COMPUTE STATISTICS;

And executing is not convenient at all. any ideas?

Select count(*) issue with hive and spark

Answers (1)

Related Questions