Perl How to do optional regex match

Question

Suppose following string

$doc=<<'TEXT_END';
11：20
           
13：55   訂票  

TEXT_END

How to capture 11：20 and 13：55 with one regular expression

I don't know how to do optional match (letting the following two tag ignorable)

訂票 means "book a ticket".(the website add a link when it is available for booking)

sorry for my bad english

below is my code, it doesn't work correctly.

#!/usr/bin/env perl
#use utf8;
use LWP::Simple;

binmode(STDIN, ':encoding(utf8)');
binmode(STDOUT, ':encoding(utf8)');
binmode(STDERR, ':encoding(utf8)'); 

my $doc = get 'http://www.atmovies.com.tw/showtime/theater_t06609_a06.html';

my @movies = ($doc =~ /([^><]+).+?(.+?)/gs);

for($i=1; $i<=$#movies; $i+=3){
    print "$movies[$i]
";
    print $movies[$i+1]."

";

    #this work just fine!
    my @times = ($movies[$i+1] =~ /([^<>]+)
\s+/g);
    for($j=0; $j<=$#times; $j++){
        print "$times[$j]
";
    }

    #this regex doesn't work correctly, it catch nothing
    @times_available=($movies[$i+1] =~ /([^><\s]+)   ☆訂票  /g);
    for($j=0; $j<=$#times_available; $j++){
        print "$times_available[$j]
";
    }

}

Lee Duhem · Accepted Answer

You could try this

@times = $doc =~ m/>\s*([\d：]+)/g;

Here is the full test program:

#!/usr/bin/perl

use warnings;
use strict;

use utf8;

use Data::Dumper;

my $doc=<<'TEXT_END';
11：20
           
       13：55   訂票  

TEXT_END

my @times = $doc =~ m/>\s*([\d：]+)/g;

print Dumper(\@times);

And the result:

$ perl t020.pl 
$VAR1 = [
          '11：20',
          '13：55'
        ];

Perl How to do optional regex match

Answers (1)

Related Questions