When would you want to make s3 object keys similiar

Question

So S3 uses the object key in partitioning data, and that you should make your keys with some randomness to distribute workloads across multiple partitions. My question is are there any scenarios in which you would want to have similar keys? And if not, why then would AWS use the key to partition your data instead of randomly partitioning data itself?

I ask this because I see it as an odd design as it makes it easy for developers to make mistakes in their partitioning if they generate keys which have a pattern, but it also prevents developers from creating keys in a logical manner as this would undoubtedly result in a pattern and the data being partitioned incorrectly.

John Rotenstein · Accepted Answer

You appear to be referring to Request Rate and Performance Considerations - Amazon Simple Storage Service, which states:

The Amazon S3 best practice guidelines in this topic apply only if you are routinely processing 100 or more requests per second. If your typical workload involves only occasional bursts of 100 requests per second and fewer than 800 requests per second, you don't need to follow these guidelines.

This is unlikely to affect most applications, but if applications do have such high traffic, then spreading requests across the keyname space can improve performance.

AWS has not explained why they have designed Amazon S3 in this manner.

When would you want to make s3 object keys similiar

Answers (2)

Related Questions