awk counting number of digits within a given range

Question

How can I count the number of times a digit within a given range of numbers in a field occurs?

For example, the raw text foo.txt is shown below:

2,3,4,2,4
2,3,4,32,4
2,3,4,12,4
2,3,4,4,4
2,3,4,,4
2,3,4,15,4
2,3,4,15,4

I want to count the number of times a digit in field #4 falls between the following ranges: [0,10) and [10,20), where the lower bound is inclusive and the upper bound is not.

The result should be:

range 0-10: 2 range 10-20: 3

Here is my awk code below, but I am getting 8600001 for both ranges, awk -f prog.awk foo.txt:

#!/usr/range/awk
# prog.awk

BEGIN {
    FS=",";
    $range1=0;
    $range2=0;
}
$4 ~ /[0-9]/ && $4 >= 0 && $4 < 10 { $range1 += 1 };
$4 ~ /[0-9]/ && $4 >= 10 && $4 < 20 { $range2 += 1 };
END {
    print $range1, "	", $range2;
}

karakfa · Accepted Answer

another awk

$ awk -F, '$4>=0{a[int($4/10)]++} 
             END{print "range 0-10:" a[0],"range 10-20:" a[1]}' file

range 0-10:2 range 10-20:3

can be easily expanded to cover the full range

$ awk -F, '$4>=0{a[int($4/10)]++} 
             END{for(k in a) print "range ["k*10"-"(k+1)*10"):", a[k]}' file

range [0-10): 2
range [10-20): 3
range [30-40): 1

awk counting number of digits within a given range

Answers (2)

How it works

Multiline version

Modified version of original code

Related Questions