Reputation: 153
I have an image that performs a query on BigQuery and then downloads the result (about 8GB) with the following code:
query_job = client.query(my_sql_query)
query_result = query_job.result()
data_table = query_result.to_arrow(progress_bar_type='tqdm')
When I run the code locally the download speed is OK (100 mbs, basically my computer actual bandwidth). The problem is that when I run It on GKE cluster the download is very slow (With tqdm I can see that it will be about 2 hours).
I checked the bandwidth of my pod with the python package speedtest-cli and it is near 500 mbs.
Upvotes: 2
Views: 714
Reputation: 153
The problem was that I wasn't installed the package google-cloud-bigquery-storage inside the pod so I was using the google-cloud-bigquery package to perform the download.
Upvotes: 3