Count how many differences in 2 XML files

Question

Imagine one XML as:


  Some value
  BB
  TTTTT
  XXXX

and another XML as:


  Something Else
  XXXX
  TTTTT

The difference count here is 3
a) node1 value is different
b) node2 is missing in 2nd XML
c) node5 is missing in 1st XML

I've tried using the XMLDiff class but the result is too cumbersome for my needs.

Schema:
Root named "foo" and one single set of childrens with one value each.

Question:
What's the simplest and fastest way to code this in C#?

Michael Kay · Accepted Answer

One way to do this might be to generate a list of XPath assertions from your first document, in the form:

/foo/node1 = "Some value"
/foo/node2 = "BB"
/foo/node3 = "TTTT"
/foo/node4 = "XXXX"

and then apply these assertions to the second document to count how many of them are true. Because this won't catch data that is absent on the first document and present in the second, you might want to do the inverse as well. It's not perfect, of course, for example it won't catch differences in element order. But you haven't actually defined what you mean by a significant difference, and you could adjust the XPath expressions to assert what you consider significant. For example you could vary the last assertion to:

count(/foo/node4[. = "XXXX"]) = 1

The simplest and fastest way to code this, of course, is not in C#, unless that happens to be the only programming language you know. Using XSLT or XQuery would be much better.

Count how many differences in 2 XML files

Answers (2)

Related Questions