Efficiency of node removal vba DOM

Question

I have a set of approximately 17,000 nodes. I want to select a random number, then remove the node with that random index number. At the moment what I have works. It is:

'some code to create a DOM as "xDoc",
'create a node object as "node",
'get total nodes as variable "total",
'get paths to a node as "xPath"

'Loop to reduce nodes to 1000
Do While total > 1000
    'some code to get a random number as variable "num"

    'initialize node object
    Set node = xDoc.SelectSingleNode(xPath & "[" & CStr(num) & "]”)

    'remove the node
    node.ParentNode.RemoveChild node

Loop

This takes too long. Close to an hour and a half and in the future my xml is going to grow exponentially. It runs, removes nodes correctly just sloooowwwwwwww. I was thinking there must be some way to create a nodelist and add to it based on the random numbers then select the final nodelist from the doc to delete all at once to make it quicker. Does this make sense? Or maybe there’s a quicker more efficient way than I’m thinking?

I apologize my comment lines are not appearing as comments...I hope they don’t make things confusing.

Michael Kay · Accepted Answer

It's possible that most of the cost is in compiling the XPath expressions rather than actually deleting the nodes. Try an approach where you compile an XPath expression with a parameter once, and then execute it repeatedly with different parameters.

But I think a better approach is this, in XSLT 3.0:

Efficiency of node removal vba DOM

Answers (2)

Related Questions