Identify new objects in Amazon S3 at regular intervals

Question

I have logs that are added to s3 bucket from various sources. I want to be able to read those logs base on interval, for example every 5 mins. However, I don't want to scan all objects again, instead I will just need to get all of the new objects added since the last time my process ran. (In this case 5 mins ago)

For now, I solved this using s3 event. When there is a new file added to s3 it triggers lambda and saves the object name on dynamodb. Then, a cron job reads all the contents of that table in dynamodb, process it and deletes right after.

I feel like its an overhead. I just want call it directly from s3 using some sort of delta. I was wondering if this is supported.

Identify new objects in Amazon S3 at regular intervals

Answers (1)

Related Questions