What's the fastest way to find and delete duplicate nodes inside XML?

Question

XML file has structure like this


    one 
    two 
    three 
    three

Since xml file has more than 30000 nodes I'm looking for fastest way to find and delete duplicate nodes.

How would you do it?

Selman Gen&#231; · Accepted Answer

You could use a HashSet :

var values = new HashSet();
var xmlDocument = XDocument.Load("path");

foreach(var node in xmlDocument.Root.Elements("Node").ToList())
{
   if(!values.Add((string)node)) 
       node.Remove();
}

xmlDocument.Save("newpath");

Another way is to implement an IEqualityComparer for XElement class then use Distinct method.

What's the fastest way to find and delete duplicate nodes inside XML?

Answers (2)

Related Questions

What&#39;s the fastest way to find and delete duplicate nodes inside XML?

Answers (2)

Related Questions

What's the fastest way to find and delete duplicate nodes inside XML?