Retrieving the name of a class attribute with lxml

Question

I am working on a python project using lxml to scrap a page and I am having the challenge of retrieving the name of a span class attribute. The html snippet is below:


  12th January 2016 
  11:22pm 
  Clothing   
  
    carlos santos
   
  10 
  polo 

....

How do I retrieve the value of the span's class attribute below:

carlos santos

har07 · Accepted Answer

You can use the following XPath to get class attribute of span element that is direct child of td with class product :

//td[@class="product"]/span/@class

working demo example :

from lxml import html
raw = '''
12th January 2016 
11:22pm 
Clothing   

carlos santos
 
10 
polo 
'''

root = html.fromstring(raw)
span = root.xpath('//td[@class="product"]/span/@class')[0]
print span

output :

Brand

Retrieving the name of a class attribute with lxml

Answers (2)

Related Questions