Extract data from table cells and ignore specific child tags with Xpath?

Question

Having this html table:



    Year 2011
    Area 45 m²     
    Condition Renovated

I am trying to extract data from 2-nd cell in each row (it is: 2011, 45 m, Renovated)

I use this Xpath expression:

//table[@class="info"]//td[2]//text()

Received output (wrong):

2011
45 m
2
Renovated

Desired output:

2011
45 m
Renovated

As you can see, from the 2-nd row I received value that is enclosed in tags. I want to exclude this value. I know that instead of my current Xpath code I can use this one (removed 1 slash in the end):

//table[@class="info"]//td[2]/text()

It will solve problem, but I need to exclude this specific tag inside . Because sometimes I have some tags inside that I do not want to exclude.

So, I want to get data from 2-nd cell in each row and exclude value in tags

alecxe · Accepted Answer

For every tr get the second td and get the /text() (single slash) to avoid getting the element children texts. Worked for me:

//table[@class="info"]//tr/td[2]/text()

Prints:

2011
45 m
Renovated

Or, if you want to exclude sup element only:

//table[@class="info"]//tr/td[2]//text()[not(parent::sup)]

Extract data from table cells and ignore specific child tags with Xpath?

Answers (1)

Related Questions