Why does LINQ not cache enumerations?

Question

So it is my understanding that LINQ does not execute everything immediately, it simply stores information to get at the data. So if you do a Where, nothing actually happens to the list, you just get an IEnumerable that has the information it needs to become the list.

One can 'collapse' this information to an actual list by calling ToList.

Now I am wondering, why would the LINQ team implement it like this? It is pretty easy to add a List at each step (or a Dictionary) to cache the results that have already been calculated, so I guess there must be a good reason.

This can be checked by this code:

var list = Enumerable.Range(1, 10).Where(i => {
    Console.WriteLine("Enumerating: " + i);
    return true;
});

var list2 = list.All(i => {
    return true;
});

var list3 = list.Any(i => {
    return false;
});

If the cache were there, it would only output the Enumerating: i once for each number, it would get the items from the cache the second time.

Edit: Additional question, why does LINQ not include a cache option? Like .Cache() to cache the result of the previous enumerable?

TomTom · Accepted Answer

Because it makes no sense, and if you would think about all the cases where it makes no sense you would not ask it. This is not so much a "does it sometimes make sense" question as a "are there side effects that make it bad". Next time you evaluate something like this, think about the negatives:

Memory consumption goes up as you HAVE to cache the results, even if not wanted.
On then ext run, the results may be different as incoming data may have changed. your simplistic example (Enumerable.Range) has no issue with that - but filtering a list of customers may have them updated.

Stuff like that makes is very hard to sensibly take away the choice from the developer. Want a buffer, make one (easily). But the side effects would be bad.

Why does LINQ not cache enumerations?

Answers (2)

Related Questions