How to sum up numbers in my file?

Question

I have a folder, my_folder, which contains over 800 files, myfile_* where * is the unique ID for each file. In my file I basically have a variety of repeated fields but the one I am interested in is the field. Lines of this field look like the following: n where n is the rating score. These lines occur every 14th line, starting at line 10 (10 + 14i) and ending when the file ends. It is my job to write a script, myscript.sh, to sum up all values of n per file in my folder and then sort from highest to smallest. The output would look as follows

myfile_1234 5112
myfile_5214 2134
myfile_6124 1233
...

where the number suffixes are the sum of n per file. My files vary in length dramatically from as little as 20 fields to as many as 2500. How would I go about doing this? I figure that I will use some form of grep command to find occurences of and then sum up the numbers following the occurences, or maybe could use the fact that the lines occur every 10 + 14i lines, starting at 10. Thanks for your time any suggestions are much appreciated.

Input File:

2.5
$155


Jeter5
I hope we're not disappointed! We enjoyed New Orleans...
Dec 19, 2008
-1
-1
4
-1
3
5
3
5
5
5

...
repeat fields again...

The script must take the folder name as an argument in the command line, such as ./myscript.sh my_folder

Chris Lear · Accepted Answer

Here's my solution:

#/bin/bash
dir=$1

grep -P -o '(?<=).*' $dir/* |awk -F: '{A[$1]+=$2;next}END{for(i in A){print i,A[i]}}'|sort -n -k2

Looks like the sort at the end wasn't needed, so you could remove that.

How to sum up numbers in my file?

Answers (2)

Related Questions