How to put sequential number for each node (on one level) in TSQL, using .nodes()?

Question

I need to expand an xml in SQL server as table. I do that using XQuery with nodes() and .query(). But I need that each node has a sequential number and I also need to filter nodes based on their internal structure and I don't figure out how to do that.

I need following result:

------------------------
| 1  |  |
------------------------
| 2  |  |
------------------------
...

I have following XML:

I need when Code has value empty () to skip this node.

I use the following code, but I can not figure out how to make sequential numbers or how to filter:

DECLARE @XMLInput XML = '
    
        Domain
        dom
    
    
        Wdth
        1
    
    
        Code
        TEST
    


    
        Domain
        dom
    
    
        Wdth
        1
    
    
        Code
        
    
';

SELECT 
    Child.query('declare default element namespace "http://mynamespace.com/ns/"; (.)') AS node
FROM
        @XMLInput.nodes('declare default element namespace "http://mynamespace.com/ns/"; (/node)') AS N(Child)

EDIT: Because there are unclear elements I clarify. I need to filter out entire node when there is a node with node with value "Code" and corresponding node which is empty. In this case I need whole to be removed - not visible.

Gottfried Lesigang · Accepted Answer

The first part is rather easy, use ROW_NUMBER() OVER(). As XMLs have an implicit sort order, we can use (SELECT NULL). Rows will appear according to their physical order within the XML:

With this code you will get your nodes numbered:

WITH XMLNAMESPACES(DEFAULT 'http://mynamespace.com/ns/')
SELECT ROW_NUMBER() OVER(ORDER BY (SELECT NULL)) AS NodeNr
      ,Child.query('.') AS node
FROM
    @XMLInput.nodes('/node') AS N(Child)

Your second part is not clear for me. Do you want to surpress the second node completely because there is


    Code

? Or do you want to surpress just this ?

The following would use the upper as derived table (a CTE) and then use .exist() on the column node. This method merely checks the existance of any node according to the XQuery expression. In this case I search for any with the text()="Code". From there we navigate one level up and search for a element where the text() is empty. If this exists, the function returns 1, so we need those without:

WITH XMLNAMESPACES(DEFAULT 'http://mynamespace.com/ns/')
,shredded AS
(
    SELECT ROW_NUMBER() OVER(ORDER BY (SELECT NULL)) AS NodeNr
            ,Child.query(N'.') AS node
    FROM
        @XMLInput.nodes(N'/node') AS N(Child) 
)
SELECT *
FROM shredded
WHERE shredded.node.exist(N'//Tag[text()="Code"]/../Value[empty(text())]')=0

UPDATE: Documentation of sort order persistance

As @MartinSmith has pointed out, there was no proof for this

Rows will appear according to their physical order within the XML

In the meanwhile I've found this:

Well, this is still no valid proof, that a SELECT on .nodes() will return the derived table in exactly the same order as within the XML under all circumstances. But - at least - it points to the fact, that the internal order is worth to be persisted.

My conclusio: The internal order is seen as inherent part of the XML document. That's why I'm pretty sure, that .nodes() will return a derived table in the same order. Adding ROW_NUMBER() OVER(ORDER BY (SELECT NULL)) should do nothing else than to add a running number to these rows.

How to put sequential number for each node (on one level) in TSQL, using .nodes()?

Answers (1)

UPDATE: Documentation of sort order persistance

Related Questions