Perl Use of uninitialized value in addition when all values are initialized

Question

I am attempting to write a script that calculates the order parameter for several carbon to hydrogen bonds and outputs those values. The math is trivial but I am getting a "Use of uninitialized value in addition" error when I attempt to average the values at the end. I am well aware of how common and easily fixed this error is but I have checked all values given, there is a value for all 9212 values ( I checked by printing each, putting this into an excel document and searching for empty cells). I am at a loss and I am not sure how to further debug.

My script takes an input file, goes line by line, takes the x,y,z coords if certain strings are present, does math on these coords (finds the angle between two vectors and the z-axis), should be averaging each $integer section together (so average of all 2's etc). It does this for 3 segments (2-8, 9-10, and 11-18), saves these to two arrays (@theta_values and @theta2_values) and finally it should average each "integer" together to find the average angle between the vector and the z-axis. In total there should be 34 values output, which does happen but each value has a "Use of uninitialized value in addition (+) at angle_checker_v3.pl line 334, line 34303." error, and all averages other than the first are too small.

For reference, line 334 is where I average and line 34303 is the last line of the file.

Some sample data would be:

ATOM   2199  C22 POPC    1      -9.427  11.863  11.706  1.00  0.00      MEMB
ATOM   2200  H2R POPC    1     -10.347  11.662  12.293  1.00  0.00      MEMB
ATOM   2201  H2S POPC    1      -8.968  10.895  11.443  1.00  0.00      MEMB
ATOM   2211  C23 POPC    1      -9.801  12.641  10.423  1.00  0.00      MEMB
ATOM   2212  H3R POPC    1     -10.136  13.667  10.696  1.00  0.00      MEMB
ATOM   2213  H3S POPC    1     -10.658  12.124   9.934  1.00  0.00      MEMB
ATOM   2214  C24 POPC    1      -8.663  12.751   9.396  1.00  0.00      MEMB
ATOM   2215  H4R POPC    1      -7.763  13.166   9.894  1.00  0.00      MEMB
ATOM   2216  H4S POPC    1      -8.961  13.479   8.607  1.00  0.00      MEMB

*I intentionally skipped 10 atoms that did not matter above

In order the columns denote: substance (not relevant), atom number, type of atom, residue number/molecule type, residue number, x-coord, y-coord, z-coord, alpha number (not relevant), beta column (not relevant), and overall molecule type.

TLDR; My averaging script:

#Averaging theta values
for (my $t=2; ($t <= 18); $t++) {
    for (my $j=1; ($j <= $lipid_num); $j++) {
            $sum[$t]= $theta_values[$t][$j] + $sum[$t];
    }
    $average[$t]= $sum[$t] / $lipid_num;
    print "Average theta for carbon $t is $average[$t]
";
}

#Averaging Theta2 values
for (my $q=2; ($q <= 18); $q++) {
        for (my $b=1; ($b <= $lipid_num); $b++) {
                $sum2[$q]= $theta2_values[$q][$b] + $sum2[$q];
        }
        $average2[$q]= $sum2[$q] / $lipid_num;
        print "Average theta2 for carbon $q is $average2[$q]
";
}

Does not find values at all positions even though I have verified that there are values at all positions.

This is the full script, I realize how large it is.

        #Usage:                                                                      #
# perl angle_checker.pl [granuphilin_prot-memb_system].pdb 
#!/usr/bin/perl 

use strict;
use warnings;
use Math::Trig;

my $inputfile = $ARGV[0];

open (INPUTFILE, "<", $inputfile) or die $!;

my @data = ;

#Quick Change Variables

my $lipid_num = 256;

#Library
my @sum;
my @average;
my @sum2;
my @average2;
my @x1;
my @y1;
my @z1;
my $R = 'R';
my $S = 'S';
my $one = '1';
my @theta_values;
my @theta2_values;
my @vectorCtoHR;
my @vectorCtoHS;
my @normal;

#Start for lipid count
for (my $lipid=1; ($lipid <= $lipid_num); $lipid++) {
    # First Carbon/Integer counter
    for (my $integer= 2; ($integer <= 8); $integer++) {
            #Split line 1
            for (my $line = 0; $line <= $#data; ++$line) {
                    #Search 1.1
                    if(($data[$line] =~ m/\s+C2$integer\s+/)&&($data[$line] =~ m/\s+$lipid\s+/)&&($data[$line] =~ m/\s+POPC\s+/)) {
                            chomp $data[$line];
                            my @splitline = (split /\s+/, $data[$line]);
                            foreach (@splitline) {
                                    $x1[0]= $splitline[5];
                                    $y1[0]= $splitline[6];
                                    $z1[0]= $splitline[7];
                            }
                    }
                    #Search 1.2
                        if(($data[$line] =~ m/\s+H$integer$R\s+/)&&($data[$line] =~ m/\s+$lipid\s+/)&&($data[$line] =~ m/\s+POPC\s+/)) {
                                my @splitline = (split /\s+/, $data[$line]);
                            foreach (@splitline) {
                                    $x1[1]= $splitline[5];
                                        $y1[1]= $splitline[6];
                                        $z1[1]= $splitline[7];
                            }
                    }
                    #Search 1.3
                        if(($data[$line] =~ m/\s+H$integer$S\s+/)&&($data[$line] =~ m/\s+$lipid\s+/)&&($data[$line] =~ m/\s+POPC\s+/)) {
                                my @splitline = (split /\s+/, $data[$line]);
                            foreach (@splitline) {
                                    $x1[2]= $splitline[5];
                                        $y1[2]= $splitline[6];
                                        $z1[2]= $splitline[7];
                            }
                    }
            }



    #Z-axis
    $normal[0]= 0;
    $normal[1]= 0;
    $normal[2]= 100;

    #Vector 1
    $vectorCtoHR[0]=($x1[0] - ($x1[1]));
    $vectorCtoHR[1]=($y1[0] - ($y1[1]));
    $vectorCtoHR[2]=($z1[0] - ($z1[1]));

    #Vector 2
    $vectorCtoHS[0]=($x1[0] - ($x1[2]));
        $vectorCtoHS[1]=($y1[0] - ($y1[2]));
        $vectorCtoHS[2]=($z1[0] - ($z1[2]));

    #First Angle

    my $x1mag = sqrt(($vectorCtoHS[0]**2)+($vectorCtoHS[1]**2)+($vectorCtoHS[2]**2));
    my $x2mag = sqrt(($normal[0]**2)+($normal[1]**2)+($normal[2]**2));

    #Dot product 
    my $dotproduct = (($vectorCtoHS[0]*$normal[0])+($vectorCtoHS[1]*$normal[1])+($vectorCtoHS[2]*$normal[2]));

    my $theta = acos($dotproduct/($x1mag*$x2mag));
    $theta_values[$integer][$lipid]= $theta;

    # Second Angle  
        my $x3mag = sqrt(($vectorCtoHR[0]**2)+($vectorCtoHR[1]**2)+($vectorCtoHR[2]**2));

        my $dotproduct2 = (($vectorCtoHR[0]*$normal[0])+($vectorCtoHR[1]*$normal[1])+($vectorCtoHR[2]*$normal[2]));

        my $theta2 = acos($dotproduct2/($x3mag*$x2mag));
        $theta2_values[$integer][$lipid]= $theta2;
    }
    #Section 2 Search These only have one hydrogen to search for, hence 1 less search
    for (my $integer = 9; ($integer <= 10); $integer++) {
                for (my $line = 0; $line <= $#data; ++$line) {
                        if(($data[$line] =~ m/\s+C2$integer\s+/)&&($data[$line] =~ m/\s+$lipid\s+/)&&($data[$line] =~ m/\s+POPC\s+/)) {
                                chomp $data[$line];
                                my @splitline = (split /\s+/, $data[$line]);
                                foreach (@splitline) {
                                        $x1[0]= $splitline[5];
                                        $y1[0]= $splitline[6];
                                        $z1[0]= $splitline[7];
                                }
                        }
                    if(($data[$line] =~ m/\s+H$integer$one\s+/)&&($data[$line] =~ m/\s+$lipid\s+/)&&($data[$line] =~ m/\s+POPC\s+/)) {
                                my @splitline = (split /\s+/, $data[$line]);
                                foreach (@splitline) {
                                        $x1[1]= $splitline[5];
                                        $y1[1]= $splitline[6];
                                        $z1[1]= $splitline[7];
                                }
                        }
                }
    $normal[0]= 0;
    $normal[1]= 0;
    $normal[2]= 100;
    $vectorCtoHR[0]=($x1[0] - ($x1[1]));
    $vectorCtoHR[1]=($y1[0] - ($y1[1]));
    $vectorCtoHR[2]=($z1[0] - ($z1[1]));

    my $x1mag = sqrt(($vectorCtoHR[0]**2)+($vectorCtoHR[1]**2)+($vectorCtoHR[2]**2));
    my $x2mag = sqrt(($normal[0]**2)+($normal[1]**2)+($normal[2]**2));

    #Dot product 
    my $dotproduct = (($vectorCtoHR[0]*$normal[0])+($vectorCtoHR[1]*$normal[1])+($vectorCtoHR[2]*$normal[2]));

    my $theta = acos($dotproduct/($x1mag*$x2mag));
    $theta_values[$integer][$lipid]= $theta;
    $theta2_values[$integer][$lipid]= $theta;
    }

    #Effectively the same as section 1
    for (my $integer= 11; ($integer <= 18); $integer++) {
            for (my $line = 0; $line <= $#data; ++$line) {
                    if(($data[$line] =~ m/\s+C2$integer\s+/)&&($data[$line] =~ m/\s+$lipid\s+/)&&($data[$line] =~ m/\s+POPC\s+/)) {
                            chomp $data[$line];
                            my @splitline = (split /\s+/, $data[$line]);
                            foreach (@splitline) {
                                    $x1[0]= $splitline[5];
                                    $y1[0]= $splitline[6];
                                    $z1[0]= $splitline[7];
                            }
                    }
                        if(($data[$line] =~ m/\s+H$integer$R\s+/)&&($data[$line] =~ m/\s+$lipid\s+/)&&($data[$line] =~ m/\s+POPC\s+/)) {
                                my @splitline = (split /\s+/, $data[$line]);
                            foreach (@splitline) {
                                    $x1[1]= $splitline[5];
                                        $y1[1]= $splitline[6];
                                        $z1[1]= $splitline[7];
                            }
                    }
                        if(($data[$line] =~ m/\s+H$integer$S\s+/)&&($data[$line] =~ m/\s+$lipid\s+/)&&($data[$line] =~ m/\s+POPC\s+/)) {
                                my @splitline = (split /\s+/, $data[$line]);
                            foreach (@splitline) {
                                    $x1[2]= $splitline[5];
                                        $y1[2]= $splitline[6];
                                        $z1[2]= $splitline[7];
                            }
                    }
            }
    $normal[0]= 0;
    $normal[1]= 0;
    $normal[2]= 100;

    $vectorCtoHR[0]=($x1[0] - ($x1[1]));
    $vectorCtoHR[1]=($y1[0] - ($y1[1]));
    $vectorCtoHR[2]=($z1[0] - ($z1[1]));

    $vectorCtoHS[0]=($x1[0] - ($x1[2]));
        $vectorCtoHS[1]=($y1[0] - ($y1[2]));
        $vectorCtoHS[2]=($z1[0] - ($z1[2]));

    #First Angle

    my $x1mag = sqrt(($vectorCtoHS[0]**2)+($vectorCtoHS[1]**2)+($vectorCtoHS[2]**2));
    my $x2mag = sqrt(($normal[0]**2)+($normal[1]**2)+($normal[2]**2));

    #Dot product 
    my $dotproduct = (($vectorCtoHS[0]*$normal[0])+($vectorCtoHS[1]*$normal[1])+($vectorCtoHS[2]*$normal[2]));

    my $theta = acos($dotproduct/($x1mag*$x2mag));
    $theta_values[$integer][$lipid]= $theta;
    }
print "done with $lipid
";
#End of lipid search
}
#Averaging starts now

#Averaging theta values
for (my $t=2; ($t <= 18); $t++) {
    for (my $j=1; ($j <= $lipid_num); $j++) {
            $sum[$t]= $theta_values[$t][$j] + $sum[$t];
    }
    $average[$t]= $sum[$t] / $lipid_num;
    print "Average theta for carbon $t is $average[$t]
";
}

#Averaging Theta2 values
for (my $q=2; ($q <= 18); $q++) {
        for (my $b=1; ($b <= $lipid_num); $b++) {
                $sum2[$q]= $theta2_values[$q][$b] + $sum2[$q];
        }
        $average2[$q]= $sum2[$q] / $lipid_num;
        print "Average theta2 for carbon $q is $average2[$q]
";
}

knytshade · Accepted Answer

Turns out I was adding my $sum[$t], which did not have a value, to something that did and this was giving the error. To fix this I changed from:

$sum[$t]= $theta_values[$t][$j] + $sum[$t];

To:

$sum[$t]+= $theta_values[$t][$j];

Thank you all for the help.

Perl Use of uninitialized value in addition when all values are initialized

Answers (2)

Related Questions