Fastest way of updating items of one list from another using StartsWith

Question

I have a scenario where I need to update few items based on the data from another list. I have already gone through various questions over here but none helped.

Scenario

listA: Total Count around 88000

public class CDRs
 {
    public string cld { get; set; }
    public string prefix2 { get; set; }
    public string country { get; set; }
    public string city { get; set; }
 }

listB: Total Count : 3000.

public class RatesVM
    {
        public string prefix { get; set; }
        public string Country { get; set; }
        public string City { get; set; }
    }

Now in listB there can be multiple matches of listA field that is cld

for eg. listA.cld = "8801123232"; Matched prefixes from ListB I get is

880     BGD Proper
8801    BGD Mobile
88011   BGD Dhaka Mobile
88017   BGD Dhaka Mobile
88018   BGD Dhaka Mobile
88019   BGD Dhaka Mobile

Now I want the closest match in this case it would be

88011   BGD Dhaka Mobile

Approach I am following right now.

foreach (var x in listA)
            {
                var tempObj = listB.FirstOrDefault(y => x.cld.StartsWith(y.prefix));
                if (tempObj != null)
                {
                    x.prefix2 = tempObj.prefix;
                    x.country = tempObj.Country;
                    x.city = tempObj.City;
                }
                else
                {
                    x.prefix2 = "InBound";
                    x.country = "Unknown";
                    x.city = "Unknown";
                }
            }

It works fine but takes a lot of time. Around 2-3 minutes for this case.

There are few scenarios where ListA will have around 1 million records. I am worried it will take forever.

Many Thanks in advance

mjwills · Accepted Answer

I would suggest the below code. The key difference is using orderedListB to ensure that you get the most specific match possible (i.e. start with the longest prefixes first), as well as a Dictionary to cache results. *

Dictionary cache = new Dictionary();
var orderedListB = listB.OrderByDescending(z => z.prefix.Length).ToList();

foreach (var x in listA)
{
    RatesVM cached;
    cache.TryGetValue(x.cld, out cached);
    var tempObj = cached ?? orderedListB.FirstOrDefault(z => x.cld.StartsWith(z.prefix));

    if (tempObj != null)
    {
        if (cached == null)
        {
            cache.Add(x.cld, tempObj);
        }

        x.prefix2 = tempObj.prefix;
        x.country = tempObj.Country;
        x.city = tempObj.City;
    }
    else
    {
        x.prefix2 = "InBound";
        x.country = "Unknown";
        x.city = "Unknown";
    }
}

You may also want to consider using Parallel.ForEach rather than just foreach.

Fastest way of updating items of one list from another using StartsWith

Answers (2)

Related Questions