user374374
user374374

Reputation: 343

Loading data to BigQuery using python API vs bq load

Got a new requirement. In GCS bucket have around 130+ files and these files need to be loaded into different tables on BigQuery on daily basis.

After researching, I found two options.

1) Use "bq load" command to load (Shell Script/Python Script) 2) Create a Python API to load the data to BigQuery

Which option is best. If I go with Python API, I need use APPENGINE to schedule it.

is there any better option other than this?

Thanks,

Upvotes: 1

Views: 997

Answers (2)

Cristian Avendaño
Cristian Avendaño

Reputation: 477

There are a few differences: a) BQ Load:

-You can have some issues using special chars as delimiters, like ^ and |.

-You don't need a service account (You can use a user account)

-You can't use it on google cloud functions

b) API

-You don't have the special chars trouble.

-You can use it on google cloud functions

-And if you create a python script, you can schedule it on Scheduled Tasks (On Windows)

Upvotes: 0

Tim Swena
Tim Swena

Reputation: 14786

However you do it, you'll be creating load jobs. So from the BigQuery side of things, it doesn't really matter which option you choose.

As far as scheduling goes, you do have some options on Google Cloud Platform:

Upvotes: 1

Related Questions