Perl - Split string on comma. Ignore whitespace

Question

I have this string:

$str="     a, b,    c>d:e,  f,    g ";

In this string there might be spaces and/or tabs

I split the string in perl:

my (@COLUMNS) = split(/[\s	,]+/, $str));

But this creates a leading space in position [0].

@COLUMNS=[

    a
    b
    c>d:e
    f
    g
]

I want this:

@COLUMNS=[
    a
    b
    c>d:e
    f
    g
]

Borodin · Accepted Answer

I suggest that you use a global regex match to find all subsequences of characters that are neither commas nor whitespace

It will produce the same output as your split(/[\s ,]+/. (Note that the is superfluous because \s also matches tabs.) But will create a list without any empty elements

use strict;
use warnings 'all';

my $str = "     a, b,    c>d:e,  f,    g ";

my @columns = $str =~ /[^\s,]+/g;

use Data::Dump;
dd \@columns;

output

["a", "b", "c>d:e", "f", "g"]

Note that, just like your split, this method will ignore any empty fields: something like a,,,b will return [ 'a', 'b' ] instead of [ 'a', '', '', 'b' ]. Also, columns that contain whitespace will be split, so a,two words,b will produce [ 'a', 'two', 'words', 'b' ] instead of [ 'a', 'two words', 'b' ]. Only you can tell whether these situations are likely to arise

If there is any chance that this method will produce the wrong results, then it is better to simply split on commas and write a subroutine to trim the resulting fields

use strict; 
use warnings 'all';

sub trim(;$);

my $str="     a  ,, ,two words ,,, b";
my @columns = map trim, split /,/, $str;

use Data::Dump;
dd \@columns;


sub trim(;$) {
    (my $trimmed = $_[0] // $_) =~ s/\A\s+|\s+\z//g;
    $trimmed;
}

output

["a", "", "", "two words", "", "", "b"]

Perl - Split string on comma. Ignore whitespace

Answers (2)

output

output

Related Questions