Reading and processing files word by word efficiently in perl

Question

I'm very new to perl and wanted to know how I could make this bit here faster. Here is is my current code. Any help is very much appreciated.

#!/usr/bin/perl

use strict;
use warnings;

open( FILE_IN, "extracted.txt" ) or die "$!";

print "Extracting inputs
";

while () {
    if ( $_ =~ m/^second_word/ ) {
        my @filepath2 = split (/\s+/, $_);
        print FILE_OUT $filepath2[1]."
";
    }
    if ($_ =~ m/^first_word/ ) {
        my @filepath1 = split (/\s+/, $_);
        print FILE_OUT $filepath1[1]."
";
    }
}

exit;

My input file, practicecase.txt, is simply:

first_word some/filepath
second_word another/filepath

My output file, extracted.txt, looks like:

some/filepath
another/filepath

Thank you so much!

Borodin · Accepted Answer

This is about as fast as your algorithm is going to go. The optimisations I have made are to use a single regex pattern to find either first_word or second_word at the beginning of the line, and to use the same pattern to capture the second field in the line

#!/usr/bin/perl

use strict;
use warnings;
use 5.010;
use autodie;

open my $in_fh,  '<', 'practicecase.txt';

open my $out_fh, '>', 'extracted.txt';
select $out_fh;

print "Extracting inputs
";

while ( <$in_fh> ) {
    print "$1
" if / ^ (?:first|second)_word \s+ (\S+) /x;
}

Reading and processing files word by word efficiently in perl

Answers (2)

Related Questions