abhiram potluri
abhiram potluri

Reputation: 45

Extracting specific lines with Perl

I am writing a perl program to extract lines that are in between the two patterns i am matching. for example the below text file has 6 lines. I am matching load balancer and end. I want to get the 4 lines that are in between.

**load balancer** 
new 
old
good
bad
**end**

My question is how do you extract lines in between load balancer and end into an array. Any help is greatly appreciated.

Upvotes: 3

Views: 2716

Answers (4)

Schwern
Schwern

Reputation: 165546

You can use the flip-flop operator.

Additionally, you can also use the return value of the flipflop to filter out the boundary lines. The return value is a sequence number (starting with 1) and the last number has the string E0 appended to it.

# Define the marker regexes separately, cuz they're ugly and it's easier
# to read them outside the logic of the loop.
my $start_marker = qr{^ \s* \*\*load \s balancer\*\* \s* $}x;
my $end_marker   = qr{^ \s* \*\*end\*\* \s* $}x;

while( <DATA> ) {
    # False until the first regex is true.
    # Then it's true until the second regex is true.
    next unless my $range = /$start_marker/ .. /$end_marker/;

    # Flip-flop likes to work with $_, but it's bad form to
    # continue to use $_
    my $line = $_;

    print $line if $range !~ /^1$|E/;
}

__END__
foo
bar
**load balancer** 
new 
old
good
bad
**end**
baz
biff

Outputs:

new 
old
good
bad

Upvotes: 2

TLP
TLP

Reputation: 67920

You can use the flip-flop operator to tell you when you are between the markers. It will also include the actual markers, so you'll need to except them from the data collection.

Note that this will mash together all the records if you have several, so if you do you need to store and reset @array somehow.

use strict;
use warnings;

my @array;
while (<DATA>) {
    if (/^load balancer$/ .. /^end$/) {
        push @array, $_ unless /^(load balancer|end)$/;
    }
}

print @array;

__DATA__
load balancer
new 
old
good
bad
end

Upvotes: 7

Axeman
Axeman

Reputation: 29854

For files like this, I often use a change in the Record Separator ( $/ or $RS from English )

use English qw<$RS>;
local $RS = "\nend\n";

my $record = <$open_handle>;

When you chomp it, you get rid of that line.

chomp( $record );

Upvotes: 0

JRFerguson
JRFerguson

Reputation: 7526

If you prefer a command line variation:

perl -ne 'print if m{\*load balancer\*}..m{\*end\*} and !m{\*load|\*end}' file

Upvotes: 0

Related Questions