Boto doubles all get request cycles by design

Question

To get an object out of S3 using Boto, you have to call something like (at least this is the only documented way I can find):

key = bucket.get_key(some_id)
data = key.get_contents_as_string()

However, get_key does a HEAD, then get_contents_as_string does a GET. This is painfully inefficient. I know the things I'm requesting from S3 will almost always exist (eventual consistency aside, my ratio of there to almost there is enormous; millions:1). Is there some way I can bypass the apparent boto requirement of calling get_key and just jump straight to "get me the data associated with this id"?

Derek Litz · Accepted Answer

The API shows the class used to instantiate a Key object directly.

key = Key(bucket=bucket, name=my_key_id)
data = key.get_contents_as_string()

In my usage of boto so far I've found the docs a useful introduction but very incomplete compared to the generated API documentation.

Boto doubles all get request cycles by design

Answers (2)

Related Questions