Can't parse a certain information from some html elements using xpath

Question

I've created an xpath expression to target an element so that I can extract a certain information out of some html elements using xpath within scrapy. I can't reach it anyway.

Html elements:

I wish to extract R 3500 out of it.

I've tried with:

from scrapy import Selector

html = """

                
                  Rates :
                
                  R 3500
                  

              
"""
sel = Selector(text=html)
rate = sel.xpath("//*[@class='rates']/label/following::*").get()
print(rate)

Upon running my above script this is what I'm getting whereas I wish to get R 3500.

I could have used .tail if opted for lxml. However, when I go for scrapy I don't find anything similar.

How can I extract that rate out of the html elements using xpath?

RomanPerekhrest · Accepted Answer

To get a text node as a following-sibling after the label node:

...
sel = Selector(text=html)
rate = sel.xpath("//*[@class='rates']/label/following-sibling::text()").get().strip()
print(rate)

The output:

R 3500

Addition: "//*[@class='rates']/label/following::text()" should also work.

https://www.w3.org/TR/1999/REC-xpath-19991116#axes

Can't parse a certain information from some html elements using xpath

Answers (2)

Related Questions

Can&#39;t parse a certain information from some html elements using xpath

Answers (2)

Related Questions

Can't parse a certain information from some html elements using xpath