AWK: How do I compute the total hours from a CSV file

Question

I have this file which reads

001,Time-in,2017-06-25 08:04:42,08:00:00,
001,Time-out,2017-06-25 17:04:42,17:00:00,
001,Time-in,2017-06-25 18:04:42,18:00:00,
001,Time-out,2017-06-25 22:04:42,22:00:00,
...

where field 1 is the ID number; 2 is the action performed; 3 is the exact timestamp; and 4 is the rounded off time.

I would like to calculate the total hours per ID based on field 4. I know I can use the formula

((Out2+Out1)-(In2+In1))
or
((Out1-In1)+(Out2-In2))

to get the total hours, but I'm quite stuck as to how I should begin.

I would like to get this output:

001,13
002,12
..,..
..,..

Where field 1 is the ID and 2 will be the total hours computed. Also, please note that the real file would be jumbled and not sorted like the example above. If any of the required entries are missing, i.e one time-out missing etc., it should just print that it skipped that particular ID.

Any thoughts regarding this would be extremely helpful. Thanks.

Ed Morton · Accepted Answer

$ cat tst.awk
BEGIN { FS="[-, :]" }
$3=="in"  { tin[$1]  += $10 }
$3=="out" { tout[$1] += $10 }
END {
    for (key in tin) {
        print key, tout[key] - tin[key]
    }
}

$ awk -f tst.awk file
001 13

AWK: How do I compute the total hours from a CSV file

Answers (2)

Related Questions