XSLT Streaming complex documents

Question

Most of the examples I see of XSLT 3.0 streaming are fairly simple, and take inputs of the form


 
   text 
   text
 
 ...

Assume you need to touch all the tags inside repeatingThing. In this case, streaming works well enough, do a copy-of inside your repeatingThing template, and you have reduced your memory footprint to 1/X (where X is the number of repeatingThing tags) of its original.

However, I deal with XML that is highly nested. Additionally, because of the nature of my stylesheet (JSON<->XML conversion), I need to touch all the tags in the XML source document. The copy-of approach won't work here, as the content is spread over many child nodes, and I'd be copying the entire XML into memory, just more explicitly.

I'm at a loss of how to use streaming to work in this case. A skeleton of such a "hierarchical" document is below:

Using Saxon-EE 9.8.0.12

Martin Honnen · Accepted Answer

That sample you have linked to is too long to allow me to judge it but at least some templates are written in a style that seems too verbose even if you don't want to use streaming, e.g.


    
    
        
            
        
    
    
        
            
        
    
    
        
            
        
    
    
        
            
        
    
    
        ElectionResults.LatLng

seems to be doable as


    
    
    
    
        ElectionResults.LatLng

and then for the child elements and attributes you know that they are simple types you would simply use the approach suggested in my comment e.g.


    {.}



    {.}

Of course this basically assumes the child elements are to be processed in the order they are present and you want all of them processed but the last restriction can be eased even with streaming if you use e.g. .

So at least where you simply want to map your known schema types to JSON and have spelled out a lot of different templates for the various elements I think that use of apply-templates instead of spelling out various child selections can help to make code streamable. For the types where you have the possible minOccurs=0 and maxOccurs=unbounded I think you can live with

instead of the apply-templates, that will of course "materialize" the adjacent sibling group of elements of the same name but as you seemed to have spelled out the explicit creation of arrays so far in dedicated templates where you need it you can just rewrite this dedicated templates and don't run the risk of using that approach in general for any element.

If you want to keep the verbose style with the explicit selection of various child elements in the same template then you could try how well Saxon does with the use of xsl:fork e.g.


    
    
     
      
        
            
        
      
     
     
      
        
            
        
      
     
     
      
        
            
        
      
     
     
      
        
            
        
      
     
    
    
        ElectionResults.LatLng

The call-template use you also have will not be possible with streaming in general. It seems also be used in this stylesheet to process XML elements in a different order than the input order, it seems to output any subelements declared in abstract types after the ones declared in extended types. That of course doesn't work well with the streaming approach of forwards only, node by node processing. So I guess there you have to decide whether you can't output the base subelements first in the JSON.

XSLT Streaming complex documents

Answers (1)

Related Questions