BigQuery clustering on geography and other columns

Question

I've got a couple hundred million records across the US clustered on a state/county FIPS code column (3K separate values) followed by a Geography column. Until I added the clustering, spatial joins were timing out after 8 hours, while now they run in a couple minutes. Now I read that queries are supposed to include all clustered fields in order, or the clustering won't provide any benefit. My joins with ST_INTERSECTS only make use of geometries and don't include FIPS code. Can anyone explain why I'm seeing a clustering benefit, even though my queries are not using the clustered fields in clustered field order? Could it be that a geography column added to a BigQuery table's clustered columns can be inserted in any order, and that queries only need to use the order of the non-geography columns to reap benefits?

BigQuery clustering on geography and other columns

Answers (1)

Related Questions