Sum and replace in awk based on duplicate column

Question

I have a file that contains the following:

z,cat,7,9,bar
x,dog,9,9,bar
y,dog,3,4,foo
s,cat,3,4,bar
t,boat,21,1,foo
u,boat,19,3,bar

and i need to reach this result:

x,cat,10,13,x
x,dog,12,13,x
x,boat,40,4,x

i was trying something similar to

awk '{a[$NF]+=$1}END{for(x in a) printf "%s  %s
",x,a[x]}'

but what happens with this approach is that when you put more columns, it breaks the hole thing, because rows 1,2 and 5 can contain alpha numeric characters

Jotne · Accepted Answer

This should do;

awk -F, '{arr1[$2]+=$3;arr2[$2]+=$4} END {for (i in arr1) print "x",i,arr1[i],arr2[i],"x"}' OFS=, file
x,cat,10,13,x
x,boat,40,4,x
x,dog,12,13,x

Sum and replace in awk based on duplicate column

Answers (2)

Related Questions