How to divide my script output by the output of another command?

Question

I have a folder, my_folder, which contains over 800 files named myfile_*.dat where * is the unique ID for each file. In my file I basically have a variety of repeated fields but the one I am interested in is the field. Lines of this field look like the following: n where n is the rating score. I have a script which sums up all of the ratings per file, but now I must divide it by the number of lines that have n in order to obtain an average rating per file. Here is my script:

dir=$1
cd $dir
grep -P -o '(?<=).*' * |awk -F: '{A[$1]+=$2;next}END{for(i in A){print i,A[i]}}'|sort -nr -k2

I figure that I would use grep -c myfile_*.dat to count the number of matching lines and then divide the sum by this count per file but do not know where to put this in my script? Any suggestions are appreciated.

My script takes the folder name as an argument in the command line.

INPUT FILE


$155


Jeter5
I hope we're not disappointed! We enjoyed New Orleans...
Dec 19, 2008
-1
-1
4
-1
3
5
3
5
5
5

...
repeat fields again...

miken32 · Accepted Answer

Just set up another array L to track the count of items:

grep -P -o '(?<=).*' * |
awk -F: '{A[$1]+=$2;L[$1]++;next}END{for(i in A){print i,A[i],A[i]/L[i]}}' |
sort -nr -k2

How to divide my script output by the output of another command?

Answers (1)

Related Questions