python xpath selector from div

Question

I started playing with python and come across something that should be very simple but I cannot make it work... I had below HTML

Available Products

What I need is to use parser.xpath to get value of data-category element.

Im trying for example:

cgy = xpath('//div["data-category"]')

What Im doing wrong ?

antfuentes87 · Accepted Answer

Personally I use lxml html to do my parsing because it is fast and easy to work with in my opinion. I could of shorten up how the category is actually being extracted but I wanted to show you as much detail as possible so you can understand what is going on.

from lxml import html

def extract_data_category(tree):
    elements = [
        e
        for e in tree.cssselect('div#productlistcontainer')
        if e.get('data-category') is not None
    ]
    element = elements[0]
    content = element.get('data-category')
    return content

response = """
Available Products


"""

tree = html.fromstring(response)
data_category = extract_data_category(tree)
print (data_category)

python xpath selector from div

Answers (2)

Related Questions