How to extract certain part of the text into another file?

Question

Here is my code. I wish to extract part of the text and write into another file. The loop of the code do not stop at my selected range of text. It read until the final match line of word. Please advise me. Thanks. For example, I need to extract the $ NAME: sandy until $$.TO and then join with the contents inside $NAME: patrick which is start from G1 until $$SRU.

TEXT:

$ NAME : corry  
$$.Inc s d
$$.Oc s
$$.TO

G1 ty n1 EE EE M T1 T2 $$SRU
G2 n1 y OO OO M T3 T4 $$SRU    
$$.EON

$ NAME : patrick    
$$.Inc c d
$$.Oc c
$$.TO

G1 td n3 EE EE M T5 T6 $$SRU      
G2 n3 y OO OO M T7 T8 $$SRU    
$$.EON
$ NAME : sandy    
$$.Inc k l
$$.Oc l
$$.TO

G1 td n3 FF FF M R5 R6 $$SRU      
G2 n3 y OO OO N R7 R8 $$SRU    
$$.EON

OUTPUT:eg.

$ NAME : sandy    #from sandy section
$$.Inc k l      #sandy
$$.Oc l         #sandy
$$.TO           #sandy
G1.G1o.n ty n1 EE EE M T1 T2 $$SRU #from Patrick section
G2.G2o.n n1 y OO OO M T3 T4 $$SRU   #Patrick 
Fe.id.n ty n1 EE EE N T1 T2 $$SRU #corry
Fr.in.p n1 y OO OO N T3 T4 $$SRU   #corry 
$$.EON     #Patrick

CODE:

use strict;
use warnings;

open my $F1, '<', 'testing.txt' or die "failed $!";
open my $F2, '>', 'out.txt' or die "failed $!";

while (<$F1>) {
 if (/^\$ NAME : sandy/../\$.TO/) {
 print $F2 $_;
 }
 if (/^\$ NAME : patrick/../\$.EON/) {
 if(/^G1/../\$SRU/){
 s/G1/G1.G1o.n/g;
 print $F2 $_;}
}

 }
close $F1;
close $F2;

Suic · Accepted Answer

You can parse all the file to one big hash, and do everything you want with its elements: combine, change etc

use strict;
use warnings;
use Data::Dumper;

open my $F1, '<', 'in' or die "failed $!";
open my $F2, '>', 'out.txt' or die "failed $!";


my %elements;
my $current_element;
while (<$F1>) {
    if ( /^\$ NAME : (\w+)/ .. /\$\$[.]EON/ ) {
        if ( /^\$ NAME : (\w+)/ ) {
            $current_element = $1;
        }
        if ( /^G1/ ) {
            $elements{$current_element}->{g1} .= $_;
        }
        elsif ( /^G2/ ) {
            $elements{$current_element}->{g2} .= $_;
        }
        elsif ( ! /\$\$[.]EON/ ) {
            $elements{$current_element}->{text} .= $_;
        }

    }
}
close $F1;
$elements{patrick}->{g1} =~ s/G1/G1.G1o.n/;
$elements{patrick}->{g2} =~ s/G1/G2.G2o.n/;
$elements{corry}->{g1} =~ s/G1/Fe.id.n/;
$elements{corry}->{g2} =~ s/G2/Fr.in.p/;
print $F2 "$elements{sandy}->{text}$elements{patrick}->{g1}$elements{patrick}->{g2}$elements{corry}->{g1}$elements{corry}->{g2}
\$\$.EON";
close $F2;

this will parse all the file to hash that looks like:

$elements{'name (for example patric'}->{text} = 'everithing in patric section except G1 and G2 section'
$elements{'name (for example patric'}->{g1} = 'G1 section'
$elements{'name (for example patric'}->{g2} = 'G2 section'

so if you want to combine text from sandy and G1 from patric you can do

my $sandy_patric = $elements{sandy}->{text}.$elements{patrick}->{g1};

How to extract certain part of the text into another file?

Answers (2)

Related Questions