Xpath parsing the whole page when i specify not to

Question

I'm parsing websites using python and XPath.

What I'm trying to do is to extract the href from the

So here's how is the XML (page):


  
    
      
        
          
          
            
        

          
        

          
  
    
      
        
          
          
            
        

          
        

          


And here's the code I did:

posts = page.xpath("//div[@id='posts']/div[@align='center']")
for post in posts :
  print post.xpath("//table/tr[1]/td[2]/a/@href")


But the problem is that I end up with every href of posts and not the single one from post

What am I doing wrong ?

Keith Hall · Accepted Answer

An XPath starting with a / character means that it will be begin at the document root node. To create a relative XPath from the context node, you need to put a . before the /.

So your code should be:

posts = page.xpath("//div[@id='posts']/div[@align='center']")
for post in posts:
  print post.xpath(".//table/tr[1]/td[2]/a/@href")

Xpath parsing the whole page when i specify not to

Answers (1)

Related Questions