How to subtract numbers from lines to get time difference

Question

So for a data like :

01:58:30| USER INPUT : "Hello " 
01:58:30| SYSTEM RESPONSE: "Hello. How are you" 
01:58:56| USER INPUT : "Good thank you. How about you?" 
01:58:57| SYSTEM RESPONSE: "I am doing great!" 
01:59:13| USER INPUT : "Thats it" 
01:59:15| SYSTEM RESPONSE: "Deal"
13:29:28| USER INPUT : "Deal"

I want to subtract the time it took for response for each line For example:

01:58:30| USER INPUT : "Hello " 
<0 seconds>
01:58:30| SYSTEM RESPONSE: "Hello. How are you" 
<26 seconds>
01:58:56| USER INPUT : "Good thank you. How about you?" 
<1 seconds>
01:58:57| SYSTEM RESPONSE: "I am doing great!" 
<16 seconds>
01:59:13| USER INPUT : "Thats it" 
<2 seconds>
01:59:15| SYSTEM RESPONSE: "Deal"

so far, I know how to calculate the time difference:

from datetime import datetime
s1 = '01:59:13'
s2 = 01:59:15' # for example
format = '%H:%M:%S'
time = datetime.strptime(s2, format) - datetime.strptime(s1, format)
print time

I could use any suggestions to get just the a way to read lines. Please feel free to ask me more clarification any time!

Andrej Kesely · Accepted Answer

You can use re module for extracting the time data. I wrote simple generator that takes string input and outputs all lines along with the time interval between them:

string_input = """
01:58:30| USER INPUT : "Hello "
01:58:30| SYSTEM RESPONSE: "Hello. How are you"
01:58:56| USER INPUT : "Good thank you. How about you?"
01:58:57| SYSTEM RESPONSE: "I am doing great!"
01:59:13| USER INPUT : "Thats it"
01:59:15| SYSTEM RESPONSE: "Deal"
13:29:28| USER INPUT : "Deal"
"""

import re
from datetime import datetime

def get_time(data):
    groups = re.findall(r'(([\d:]+)\|.*)', string_input)
    time_format = '%H:%M:%S'

    t1, t2 = None, None
    for (line1, time1), (line2, time2) in zip(groups, groups[1::1]):
        time1 = datetime.strptime(time1, time_format)
        time2 = datetime.strptime(time2, time_format)
        total_time = int((time2 - time1).total_seconds())
        singular_or_plural = 'second' if total_time == 1 else 'seconds'
        yield f'{line1}
<{total_time} {singular_or_plural}>'
    yield f'{line2}'

for line in get_time(string_input):
    print(line)

Output is:

01:58:30| USER INPUT : "Hello "
<0 seconds>
01:58:30| SYSTEM RESPONSE: "Hello. How are you"
<26 seconds>
01:58:56| USER INPUT : "Good thank you. How about you?"
<1 second>
01:58:57| SYSTEM RESPONSE: "I am doing great!"
<16 seconds>
01:59:13| USER INPUT : "Thats it"
<2 seconds>
01:59:15| SYSTEM RESPONSE: "Deal"
<41413 seconds>
13:29:28| USER INPUT : "Deal"

How to subtract numbers from lines to get time difference

Answers (2)

Related Questions