What is the faster way to iterate over mongo data? Find in batch vs iterate with cursor?

Question

I have over 10 million records in my mongo collection that I want to move to some other database.

There are two methods on how I can achieve that :

Batching data with find

const batchSize = 1000;
const collection = mongo.client.collection('test');
const count = await quizVersionCollection.count();
let iter = 0;
while (iter * batchSize <= count) {
  const dataArr = await collection.find({})
                  .sort({ _id: -1 })
                  .limit(batchSize)
                  .skip(iter * batchSize)
                  .toArray();
  iter += 1;
}

Using mongo cursor

while (yield cursor.hasNext()) {
    const ids = [];
    const batchSize = 1000;
    for (let i = 0; i < batchSize; i += 1) {
      if (yield cursor.hasNext()) {
        ids.push((yield cursor.next())._id);
      }
    }
    done += batchSize;
  }

In the first method, I am making a single request for every 1000 documents whereas in the second one I am making 2 requests for every single document. Which is the better method in terms of speed and computation?

What is the faster way to iterate over mongo data? Find in batch vs iterate with cursor?

Answers (1)

Related Questions