perl regular expression to match between a fixed keyword and another two variable keywords

Question

I need to write a regex in perl to do the following.

The starting line is keyword1 (like "this is keyword1"), and the ending line is either keyword2 (like "end1 here") or keyword3 (like "end2 here"). For example, the text file may look like:

*********** this is keyword1***********
*****
..
*******apple***********
******
..
*********** this is keyword1***********
*****
..
*******orange***********
******
..
*********** this is keyword1***********
*****
..
*******orange***********
******
..

My task is to match those blocks

*********** this is keyword1***********
*****
..(comment: no "this is keyword1" here)
*******apple***********

or

*********** this is keyword1***********
*****
.. (comment: no "this is keyword1" here)
*******orange***********

Appreciate your help!

Sinan &#220;n&#252;r · Accepted Answer

My previous answer missed your revised requirements. Here is the updated code:

#!/usr/bin/env perl

use 5.012;
use strict;
use warnings;

my $text = do { local $/;  };
my $pat = qr{
    (
        [^
]*?
        keyword1
        .*?
        (?:apple|orange)
        [^
]*?
        

    )
}sx;

my $result;

while ($text =~ /$pat/g) {
    $result .= "[[[
$1]]]
";
}

say $result;


__DATA__
*********** this is keyword1***********
*****
..(comment: no "this is keyword1" here)
*******apple***********
*****
..
*********** this is keyword1***********
*****
..
*******apple***********
******
..
*********** this is keyword1***********
*****
.. (comment: no "this is keyword1" here)
*******orange***********
*****
..
*********** this is keyword1***********
*****
..
*******orange***********
******
..
*********** this is keyword1***********
*****
..
*******orange***********
******
..

Output:

[[[
*********** this is keyword1***********
*****
..(comment: no "this is keyword1" here)
*******apple***********
]]]
[[[
*********** this is keyword1***********
*****
..
*******apple***********
]]]
[[[
*********** this is keyword1***********
*****
.. (comment: no "this is keyword1" here)
*******orange***********
]]]
[[[
*********** this is keyword1***********
*****
..
*******orange***********
]]]
[[[
*********** this is keyword1***********
*****
..
*******orange***********
]]]

The brackets are there to visually verify that correct blocks were matched.

perl regular expression to match between a fixed keyword and another two variable keywords

Answers (2)

Original Suggested Solution

Revised requirements — Revised program

Re-revised Requirements — Re-revised Solution

Related Questions