parse selective table rows in python with lxml and xpath

Question

below is the structure of the html file that i wish to parse


    'some text'

    'some text'

    'some text'

    'some text'

I am interested in parsing only the text under and ignore other 's

I get all text through .xpath('//tr/td/text()') but this is not what I want. I have tried the below code after researching for solution for sometime:

.xpath('//tr[contains(@data-mod-primary="true",None)]/td/text()')

but this too gets me the text under all basically same result as .xpath('//tr/td/text()')

Any help is appreciated. thank you.

akuiper · Accepted Answer

You can use @attr=value to extract specific tr tags:

//tr[@data-mod-primary='true']/td/text()

Or if you use contains, it would be something like:

//tr[contains(@data-mod-primary, 'true')]/td/text()

parse selective table rows in python with lxml and xpath

Answers (1)

Related Questions