Get elements form list based on content of each element of list

Question

I'm just starting to learn and faced one problem in Python.

I have a srt doc (subtitles). Name - sub. It looks like:

8
00:01:03,090 --> 00:01:05,260
MATER: Yes, sir, you did.
(MCQUEEN GASPS)

9
00:01:05,290 --> 00:01:07,230
You used to say
that all the time.

In Python it looks like:

'3', '00:00:46,570 --> 00:00:48,670', 'MCQUEEN: Okay, here we go.', '', '4', '00:00:48,710 --> 00:00:52,280', 'Focus. Speed. I am speed.', '', '5', '00:00:52,310 --> 00:00:54,250', '(ENGINES ROARING)', '',

Also, I had a list of words (name - noun). It looks like:

['man', 'poster', 'motivation', 'son' ... 'boy']

Let's look at this example:

...'4', '00:00:48,710 --> 00:00:52,280', 'Focus. Speed. I am speed.', '', '5',....

What I need to do is to find word from the list in the subtitles (first apperrence, as an illustrtion, "Speed") and get into list the time of the word appearence (00:00:48,710 --> 00:00:52,280) and sequence number (4), which is located before the time in the document. I was trying to get this information with indx but unfortunately I did not succeed.

Can you help me how to do this?)

Oleh · Accepted Answer

Continuing with Anton vBR's suggestion:

words=['ingonyama','king']
results=[]
for w in words:
    for row in df.itertuples():
        if row[2] is not None:
            if w in row[2].lower():
                results.append((w, row[0], row[1]))
        if row[3] is not None:
            if w in row[3].lower():
                results.append((w, row[0], row[1]))
print(results)

You'll get a list of tuples, each of which contains a word you're searching for, a sequence number where it appears, and a time-frame where it appears. Then you can write these tuples to a csv file or whatever. Hope this helps.

Get elements form list based on content of each element of list

Answers (2)

Related Questions