Xpath to get data before 2
tags

Question

I need to extract the text that comes before the 2 tags, that is text 3. The code is similar to the following:


    

    text1
    

    text2
    

    text3
    

    

    text4

I tried //div/text()[preceding-sibling::br], but, it extracts all the texts.

har07 · Accepted Answer

Finding 2 consecutive s in this scenario turns out to be trickier than I expected, because empty text node (the ones that consists of only whitespaces) need to be ignored here. This is one way :

/br[
    following-sibling::node()[self::*|self::text()[normalize-space()]
  ][1][self::br]]

The first predicate finds following sibling node, which type is either element node (self::*) or non-empty text node (self::text()[normalize-space()]). Then [1] takes only the first found node, and lastly [self::br] validates that the one found node is .

The complete XPath expression would be as follow :

//div
 /br[
    following-sibling::node()[self::*|self::text()[normalize-space()]
  ][1][self::br]]
 /preceding-sibling::text()[1]

Xpath to get data before 2 <br> tags

Answers (1)

Related Questions

Xpath to get data before 2 &lt;br&gt; tags

Answers (1)

Related Questions

Xpath to get data before 2 <br> tags