Trouble selecting nodes with Html Agility Pack

Question

I have the current HTML layout

 //table[1]

 //table[2]

   
      
         
             
         
      
   
   
      
          //table[1]//table[1]
            
               
                  
                     
                        INFO 1
                     
                  
                  
                     
                        INFO 2
                     
                  
                  
                     
                        INFO 3
                     
                  
                  
                     
                        INFO 4
                     
                  
               
            
         
      
   
   
      
          //table[1]//table[2]
            
               
                  
                     Name
                  
                  
                     Quantity
                  
               
               
                  
                     Apples 
                  
                  10
               
            
         
      
   
   
      
           //table[1]//table[3]

I am trying to get the data within //table[1]//table[2], yet I keep getting a null HtmlNode (System.NullReferenceException) for the following:

doesn't' work: doc.DocumentNode.SelectSingleNode("//table[2]//tbody//tr//td//table[2]//tbody//tr");,

I am not sure why this occurs as when I try to get data for //table[1]//table[1] it works just fine with this syntax

works: doc.DocumentNode.SelectSingleNode("//table[2]//tbody//tr//td//table[1]//tbody//tr");

Am I misunderstanding how the indexing works with Html Agility Pack?

har07 · Accepted Answer

//table[2] return 2nd

element within the same parent because in XPath :

The ([]) has a higher precedence (priority) than (// and /). [For Reference]

In your case, there is only one

in each

, therefore the Xpath expression returned nothing. One possible solution is to put brackets to alter the precedence :

(//table[2]//tbody//tr//td//table)[2]//tbody//tr

Above Xpath get 2nd

element from all

s returned by the inner XPath //table[2]//tbody//tr//td//table. Then from that

, continue to return descendants //tbody//tr elements.

Trouble selecting nodes with Html Agility Pack

Answers (2)

Related Questions