No result from Xpath

Question

I am trying to get a list of teams and scores from this page http://stats.rleague.com/rl/seas/2014.html just as an exercise to learn.

I am not getting the expected results first my imports and page.

In [1]: from lxml import html

In [2]: import requests

In [3]: page = requests.get('http://stats.rleague.com/rl/seas/2014.html')

In [4]: tree = html.fromstring(page.text)

this is the html for the title.

Rugby League Tables / Season 2014

and for the teams

Souths4t 6g  28Date:Thu 06-Mar-2014 Venue:Stadium Australia Crowd:27,282
Sydney Roosters1t 2g  8Souths won by  20 pts

However I get blank lists, what am I doing wrong?

In [6]: print(tree)


In [7]: titles = tree.xpath('//html[@title]/text()')

In [8]: print(titles)
[]

In [11]: teams = tree.xpath('//tr/td[@href]/text()')

In [12]: print(teams)

[]

falsetru · Accepted Answer

Changing XPath expressions will give you wanted results:

# `title` is not an attribute, but a tag.
titles = tree.xpath('.//title/text()')
print(titles)

# `td` does not have `href` attribute, but `a` tag.
teams = tree.xpath('//tr/td/a[@href]/text()')
print(teams)

No result from Xpath

Answers (1)

Related Questions