best/most efficient way to test XSLT

Question

I am working with big and complex dictionary data (XML) which needs to be parsed by XSL and output XML.

what would be considered as a "best" way to test if XSL is processing all nodes from XML (input)?

please consider this simple example, i think it will represent nature of the problem:

input.xml



   
      
         some1
         text2
         more text1
      
   
   
      
         some2
         text2
         more text2
      
   
   text3
   
      text
      4

some tarnsformations.xsl

output.xml



   
      some1
      text2
      more text1
   
   
      some2
      text2
      more text2
   
   text3
   text
   4

In output.xml names of the tags have been changed as well as order of the content (comparing to input file). I need to compare if all text fields from Input are available in output. I think the best solution would be to creat test which will extract text from each tag and compare it string by string, outputing tags taht do not exist in output.xml to log file... ?

Mike Sokolov · Accepted Answer

I would recommend two kinds of tests: first a unit test on a smaller controlled set of data that is supposed to be a model for the data you find in your large dictionary. This could be considered a unit test for your xslt process. I usually would extract several representative pieces from the larger data set, and store these along with the test code. Then the test applies the transformation to the test data and makes assertions about the result, verifying that the transformation was successfully employed.

Then additionally you should build sanity checks in to your production system so that (for example), you make sure that the total number of nodes processed corresponds to what you expect. For example, in a dictionary with a large number of entries, you could run one step to count all the entries, and then another one to process them. Then at the end, see how many entries you processed and make sure the count is the same as what you expected. This is also useful since it provides a means of outputting a progress bar (% complete).

Anyway, that's what we do.

If the text in the output is the same as the text in the input, as in your example, Marcin, you can compare those fairly easily using xslt. If you process an xml file with an empty xslt stylesheet (just the node) then you will get back just the text, with no markup. I think xmllint can do this too. So just run that over both your input and output and compare using a simple text comparison (like diff).

best/most efficient way to test XSLT

Answers (2)

Related Questions