How does the compiler/runtime determine a lambda expression's type?

Question

I was wondering how the the compiler/runtime determines a lambda expression's type?

For example, the following System.Linq Select extension method (not select query)

//var recordIds = new List(records.Select(r => r?.RecordId ?? 0));
//var recordIds = new List(records.Where(r => r != null).Select(r => r.RecordId));
var recordIds = new List(records.Select(r => r.RecordId));

is defined as

Enumerable.Select Method (IEnumerable, Func)

and so takes the lambda r => r.RecordId as a Func.

How is the lambda's type determined, and once it is, is it simply cast to that type?

Eric Lippert · Accepted Answer

I was wondering how the the compiler/runtime determines a lambda expression's type?

It is rather complicated. Implementing this feature was the "long pole" for the version of Visual Studio that shipped C# 3 -- so every day that I was over schedule on this was a day that VS would slip! -- and the feature has only gotten more complicated with the introduction of covariance and contravariance in C# 4.

As others have noted, consult the specification for exact details. You might also read the various articles I've written about it over the years.

I can give you a quick overview though. Suppose we have

records.Select(r => r.RecordId)

where records is of type IEnumerable and RecordId is of type int.

The first question is "does IEnumerable have any applicable method called Select? No, it does not. We therefore go to the extension method round.

The second question then is: "is there any static type with an accessible extension method that is applicable to this call?"

SomeType.Select(records, r => r.RecordId)

Let's suppose that Enumerable is the only such type. It has two versions of Select, and one of them takes a lambda that takes two parameters. We can automatically discard that one. That leaves us with, as you note:

static IEnumerable Select(IEnumerable, Func)

Third question: Can we deduce the type arguments corresponding to type parameters A and R?

In the first round of type inference we consider all the non-lambda arguments. We only have one. We deduce that A could be Record. However, IEnumerable is covariant, so we make a note that it could be a type more general than Record as well. It cannot be a type that is more specific than Record though.

Now we ask "are we done?" No. We still don't know R.

"Are there any inferences left to make?" Yes. We haven't checked the lambda yet.

"Are there any contradictions or additional facts to know about A?" Nope.

We therefore "fix" A to Record and move on. What do we know so far? We have:

static IEnumerable Select(IEnumerable, Func)

We then say OK, the argument must be (Record r) => r.RecordId. Can we infer the return type of this lambda knowing that? Plainly yes, it is int. So we put a note on R saying that it could be int.

Are we done? Yes. Is there anything else we can make a deduction about? No. Did we infer all the type parameters? Yes. So we "fix" R to int, and we're done.

Now we do a final round that checks to make sure that Select does not produce any errors; for example, if Select had a generic constraint that was violated by we would reject it at this point.

We've deduced that records.Select(r=>r.RecordId) means the same thing as Enumerable.Select(records, (Record r) => { return (int)r.RecordId; } ) and that's a legal call to that method, so we're done.

Things get rather more complicated when you introduce multiple bounds. See if you can figure out how something like a Join works where there are four type parameters to infer.

How does the compiler/runtime determine a lambda expression's type?

Answers (2)

Related Questions

How does the compiler/runtime determine a lambda expression&#39;s type?

Answers (2)

Related Questions

How does the compiler/runtime determine a lambda expression's type?