Find nodes until condition using XPath

Question

What expression should I use to find all td nodes after the one, which contains text Foo or Bar and stop before the next with unknown text. Thanks.

Foo || Bar
TEXT1
TEXT2
TEXT3
...
VARIABLE
...

UPDATE:

use strict; 
use warnings;
use autodie;
use utf8;
use WWW::Mechanize;
use HTML::TreeBuilder::XPath;

my $url = 'www.perl.org';

my $mech = WWW::Mechanize->new;
$mech->agent_alias( 'Windows Mozilla' );
$mech->get( $url );

my $tree= HTML::TreeBuilder::XPath->new;

$tree->parse($mech->content);

for my $nodes ($tree->findnodes('//td[
                            preceding-sibling::td
                            [contains(., "Foo") or contains(., "Bar")] 
                            and following-sibling::td[@colspan="4"]
                            ]')) {

    print $nodes->as_text;

}

Kirill Polishchuk · Accepted Answer

You can use this XPath:

//td[
      preceding-sibling::td
            [contains(., 'Foo') or contains(., 'Bar')] 
      and following-sibling::td[@colspan = 4]
]

It will return:

TEXT1
TEXT2
TEXT3

Find nodes until condition using XPath

Answers (2)

Related Questions