Getting started with XPath

Question

I am self-studying XPath from Pro XML Development with Java. Just for practice I have constructed a sample XML document and some XPath expressions.
Below are a few XPath expressions along with their explanations and a few related questions. Please correct me if my explanations are wrong and answer the questions wherever applicable.

XML



    
        John
        Computer Technology
        6
        E
    

    
        Foo
        Industrial Electronics
        6
        E
    

    
        
            
                Dingle
                Grumpiness
                3
                E

Expression 1: /people/student[@scholarship='Yes']/name
Explanation: Will select the elements .. which are contained in such that has an attribute named scholarship with a value of Yes
Question: Will this also select the value John in it ????

Expression 2: /people/student[2]
Explanation: Will select the element .. which is at the 2nd position in the element
Question: Will it also select the child nodes within ?

Expression 3: /people/student/@scholarship
Explanation: Will select the attribute scholarship in the element student. If there were multiple then it would select multiple attributes

Expression 4: //name[ancestor::student]
Explanation: Will select all the .. elements
// means 'all-the-descendants'. In my context it means 'I don't care who the descendants are as long as my immediate ancestor is student'

Dimitre Novatchev · Accepted Answer

Expression 1: /people/student[@scholarship='Yes']/name Explanation: Will select the elements .. which are contained in such that has an attribute named scholarship with a value of Yes Question: Will this also select the value John in it ????

This expression selects any (all) name element that is a child of a student element (whose scholarship attribute has as string value the string "yes")and that is a child of the top element (named people) of the XML document. XPath doesn't select "values" -- it selects nodes. In this case the string "John" is the string value of the selected name element. The selected name element has a single child text node, whose string value is "John".

Expression 2: /people/student[2] Explanation: Will select the element .. which is at the 2nd position in the element Question: Will it also select the child nodes within ?

This selects the second (in document order) student child of the top element (whose name must be people). The child nodes of the selected element are not selected themselves. The number of selected nodes can be obtained using the count() function:

count(/people/student[2])

and it is 1 -- this means that only the element (but not its children or descendants) is selected.

Expression 3: /people/student/@scholarship Explanation: Will select the attribute scholarship in the element student. If there were multiple then it would select multiple attributes

This selects the scholarship attribute of any student element that is a child of the top element (whose name must be people). This means that if there are N student elements that are children of the people top element, and if each of these has a scholarship attribute, then N scholarship attributes will be selected.

Expression 4: //name[ancestor::student] Explanation: Will select all the .. elements // means 'all-the-descendants'. In my context it means 'I don't care who the descendants are as long as my immediate ancestor is student'

This selects all name elements that have a student ancestor (and this ancestor may not only be the immediate parent, but also an ancestor of the immediate parent).

Here one can write an equivalent XPath expression that doesn't contain any reverse axes:

//student//name

In case you wanted to select all name elements whose parent is a student element, one way to express this is:

//student/name

Finally, I would recommend using a tool like the XPath Visualizer (which I created 12 years ago) that has helped many thousands of people learn XPath by playing and having fun.

Getting started with XPath

XML

Answers (2)

Related Questions