mongodb indexes covering missing values

Question

I'd like to perform efficient operations of this form with mongodb:

db.getCollection('x').find({a:{$ne:null}})

My understanding is that an index on a will not include documents which are missing the field a. So queries of the form {a:{$ne:null}} need to scan for those documents (i.e. can't rely solely on the index to find all the matching documents).

I'm considering a mongo feature request (if one hasn't been submitted already) to allow indices to optionally include documents with missing values. I'm wondering:

In the current mongo release, is it possible to speed up the above query somehow? Note that simply always adding a value for that field is good answer, but that's not possible in my case.
Is this a sensible mongo feature request? I don't know much about how indices are implemented, but from what I do know it seems like this should be possible (even though it's not desirable for all indices - just for some, at the programmer's discretion).

I know there are a lot of questions here about indices and "null" (the null value versus a missing value, etc), but I spent a bit of time and couldn't find a direct answer to this question.

As a real example, I have a collection with ~80 million documents. About 1,000 of those documents are missing the field a. I'd like to be able to iterate over those documents that are missing a (in any order). One workaround is to make sure they're never missing a and just set it to -1 or some other particular value. That seems a bit silly to me - there should be a way to have mongo do that for me under the hood.

mongodb indexes covering missing values

Answers (1)

Related Questions