iterating through IEnumerable causing serious performance issue

Question

I am clue less about what has happend to performance of for loop when i tried to iterate through IEnumerable type.

Following is the code that cause serious performance issue

foreach (IEdge ed in edcol)
{
    IEnumerable row = 
        from r in dtRow.AsEnumerable()
        where (((r.Field("F1") == ed.Vertex1.Name) && 
                (r.Field("F2") == ed.Vertex2.Name))
            || ((r.Field("F1") == ed.Vertex2.Name) &&
                (r.Field("F2") == ed.Vertex1.Name)))
        select r.Field("EdgeId");
    int co = row.Count();
    //foreach (string s in row)
    //{

    //}
    x++;
}

The upper foreach(IEdge ed in edcol) has about 11000 iteration to complete. It runs in fraction of seconds if i remove the line

int co = row.Count();

from the code.

The row.Count() have maximum value of 10 in all loops.

If i Uncomment the

//foreach (string s in row)
//{

//}

it goes for about 10 minutes to complete the execution of code.

Does IEnumerable type have such a serious performance issues.. ??

Jon Skeet · Accepted Answer

This answer is for the implicit question of "how do I make this much faster"? Apologies if that's not actually what you were after, but...

You can go through the rows once, grouping by the names. (I haven't done the ordering like Marc has - I'm just looking up twice when querying :)

var lookup = dtRow.AsEnumerable()
                  .ToLookup(r => new { F1 = r.Field("F1"),
                                       F2 = r.Field("F2") });

Then:

foreach (IEdge ed in edcol)
{
    // Need to check both ways round...
    var first = new { F1 = ed.Vertex1.Name, F2 = ed.Vertex2.Name };
    var second = new { F1 = ed.Vertex2.Name, F2 = ed.Vertex1.Name };
    var firstResult = lookup[first];
    var secondResult = lookup[second];

    // Due to the way Lookup works, this is quick - much quicker than
    // calling query.Count()
    var count = firstResult.Count() + secondResult.Count();

    var query = firstResult.Concat(secondResult);

    foreach (var row in query)
    {
        ...
    }
}

iterating through IEnumerable<string> causing serious performance issue

Answers (2)

Related Questions

iterating through IEnumerable&lt;string&gt; causing serious performance issue

Answers (2)

Related Questions

iterating through IEnumerable<string> causing serious performance issue