Why are blank lines being matched in this regexp?

Question

G'day,

I am using the following Perl fragment to extract output from a Solaris cluster command.

open(CL,"$clrg status |");
my @clrg= grep /^[[:lower:][:space:]]+/,;
close(CL);

I get the following when I print the content of the elements of the array @clrg BTW "=>" and "<=" line delimiters are inserted by my print statement:

=><=
=>nas-rg             mcs0.cwwtf.bbc.co.uk   No          Online<=
=>                   mcs1.cwwtf.bbc.co.uk   No          Offline<=
=><=
=>apache-rg          mcs0.cwwtf.bbc.co.uk   No          Online<=
=>                   mcs1.cwwtf.bbc.co.uk   No          Offline<=
=><=

When I replace it with the following Perl fragment the blank lines are not matched.

open(CL,"$clrg status |");
my @clrg= grep /^[[:lower:][:space:]]{3,}/,;
close(CL);

And I get the following:

=>nas-rg             mcs0.cwwtf.bbc.co.uk   No          Online<=
=>                   mcs1.cwwtf.bbc.co.uk   No          Offline<=
=>apache-rg          mcs0.cwwtf.bbc.co.uk   No          Online<=
=>                   mcs1.cwwtf.bbc.co.uk   No          Offline<=

Simple question is why?

BTW Using {1,} in the second Perl fragment also matches blank lines!

Any suggestions gratefully received!

cheers,

Andomar · Accepted Answer

That'll be because [:space:] matches newlines and carriage returns as well.

So [[:space:]]+ would match , , or .

But [[:space:]]{3,} would require three characters, and an empty line is just a .

{1,} and + mean the same thing: match the preceding group one or more times.

P.S. A typical newline is on Unix and on Windows.

Why are blank lines being matched in this regexp?

Answers (2)

Related Questions