xpath expression to get only text content of node for dynamic content

Question

I'm trying to catch content of

without child element from the below HTML code:


   14,490
   13,490

So I'm expecting 13,490 as result.

I use the following xpath expression for that:

//div[@class="num"]/node()[not(self::del)]

which works fine for this example, but the problem is that the content of the div may vary and sometimes it doesn't contain the child element.

I'm using Python and can handle this on coding level but I'm curious whether this can be done by xpath only.

alecxe · Accepted Answer

You can just get the text():

//div[@class="num"]/text()

Demo (using xmllint):

$ xmllint index.html --xpath '//div[@class="num"]/node()[not(self::del)]'
13,490
$ xmllint index.html --xpath '//div[@class="num"]/text()'
13,490

This would also work if there is no del tag:


   13,490

Answers (1)