Using xsl:accumulator with xsl:try/xsl:catch

Question

I have very large input document (thousands of Records) that has a structure something like this (Data represents many child elements):

I'm processing it with a stylesheet that performs a complex transform on each Record which could run into many dynamic errors. In this application if a few records have bad data I would prefer not to halt the transform but I would like to know about the errors so I can fix them later. I'm using an xsl:try/xsl:catch to allow the processing to continue:



  

  

  
    
      
    
  

  
    
      
        
        
          Couldn't create good data for {@id} Code: {$err:code} {$err:description}

This works fine, but it's a pain to dig through the large input documents to find the few records that failed. What I'd like to do is write the source of the Record elements that fail to a new Input document using xsl:result-document. I'm trying to add an xsl:accumulator something like this:

However, I can't figure out what the predicate in the xsl:accumulator-rule should be, or if it's even possible to use this pattern. Can a single result document be created without restructuring the stylesheet?

NB: I'm aware of the following solution, but it wasn't my first choice because it seems like it could potentially have much higher memory requirements, but perhaps that isn't true. I could also write all the Records out to individual files but I consider this dangerous because one source document might generate thousands of failures.


  
    
      
    
  
  
    
      
    
  
  
    
      
        
      
    
  



  
    
      
      
        Couldn't create good data for {@id} Code: {$err:code} {$err:description}

Michael Kay · Accepted Answer

It's an interesting approach.

The value of an accumulator must always be a pure function of an input node. There's no way of feeding in information from other activities, e.g. whether the processing of the node failed. It's not clear to me whether you can detect the "bad records" independently from the processing that you carry out on those records: if you can, that is, if you are essentially doing custom validation on the input, then this pattern might work quite well. (But in that case, I don't think you would be doing try/catch. Rather, your main processing function would first check the accumulator to see if the data is valid.)

Note that the spec for accumulators allows the computation of one accumulators to access other accumulators, but this is not currently implemented in Saxon.

I think the more usual way of tackling this is probably to write the results of successful processing and the reports of unsuccessful processing to the same result tree, and then split this in a subsequent transformation pass. Unfortunately XSLT 3.0 streaming capabilities don't have anything to offer in the area of multi-pass processing. For the splitting pass, however, xsl:fork might well be suitable.

Using xsl:accumulator with xsl:try/xsl:catch

Answers (1)

Related Questions