Xpath Get text after first html tag

Question

There are next block


  head1
    Text1 

 text12  

 text 13
  head11
    Text11
  head3
    Text2

How to get text after first H1 with ignore as

Text1 
text12
text 13

I use Grab Python page = g.doc.select('//div[@class="text"]/h3[1]/following-sibling::text()]') Result is

Text1
text12
text 13
Text11
Text2

Daniel Haley · Accepted Answer

You could try selecting the text() that only has one preceding h1 sibling...

//div[@class='text']/text()[count(preceding-sibling::h1)=1]

Another alternative is to try using the Kayessian method...

//div[@class='text']/h1[1]/following-sibling::text()[count(.|//div[@class='text']/h1[1+1]/preceding-sibling::text()) = count(//div[@class='text']/h1[1+1]/preceding-sibling::text())]

Here's a better example and explanation of the Kayessian method.

Xpath Get text after first html tag

Answers (1)

Related Questions