AWS cloud storage optimized for retrieval for files

Question

I have built a file processing pipeline where a file once added to an S3 bucket will trigger 5 to 6 different lambdas. In each lambda, I am going to download the file and do some processing on it.

Here is the problem: S3 cost for downloading the file in each lambda is costing 50% of the total S3 costs incurred. Is there any way I can store the file in a cache, download the file from there into the lambdas and once the processing has completed, delete the file from the cache?

Some pointers: Each process must be done simultaneously, can't be combined into a single lambda.

The lambdas are present in the same region as the S3 bucket. In the previous month alone, we had a total of 250 million GET operations on the objects in the S3 bucket.

AWS cloud storage optimized for retrieval for files

Answers (1)

Related Questions