MongoDB Randomly Aggregate Documents (Unique Results)

Question

I've read that one can use db.collection.aggregate with $sample to get random documents from a collection. But I've also read that the $sample is NOT 100% reliable, therefore, I wrote this query:

db.blog.aggregate(
   {"$sample": { "size": 100 } }, 
   {"$group": { "_id" : "$post_id", "post" : { "$push" : "$$ROOT" }}}
)

Yes, I am attempting to group by, but the issue is that in a loop it becomes more complicated then it should i.e. when querying the results from MongoDB.

Any suggestions are appreciated, thanks in advance.

EDIT: I want to know, is grouping necessary to get unique results out, or is there a better way of doing this. It does NOT make sense to have to the $group for aggregate to return me several random documents from the MongoDB that are unique and not duplicates.

YES, I set the ID to INDEX unique in the actual collection.

Rajat Goel · Accepted Answer

If you have a unique index over the post_id field than there is no need of group operation after sampling.

Refer: https://docs.mongodb.com/manual/core/read-isolation-consistency-recency/#faq-developers-isolate-cursors

MongoDB Randomly Aggregate Documents (Unique Results)

Answers (2)

Related Questions