Split list into sub-lists based on integer in string

Question

I have a list of strings as such:

['text_1.jpg', 'othertext_1.jpg', 'text_2.jpg', 'othertext_2.jpg', ...]

In reality, there are more entries than 2 per number but this is the general format. I would like to split this list into list of lists as such:

[['text_1.jpg', 'othertext_1.jpg'], ['text_2.jpg', 'othertext_2.jpg'], ...]

These sub-lists being based on the integer after the underscore. My current method to do so is to first sort the list based on the numbers as shown in the first list sample above and then iterate through each index and copy the values into new lists if it matches the value of the previous integer.

I am wondering if there is a simpler more pythonic way of performing this task.

Andrej Kesely · Accepted Answer

Try:

import re

lst = ["text_1.jpg", "othertext_1.jpg", "text_2.jpg", "othertext_2.jpg"]

r = re.compile(r"_(\d+)\.jpg")
out = {}
for val in lst:
    num = r.search(val).group(1)
    out.setdefault(num, []).append(val)

print(list(out.values()))

Prints:

[['text_1.jpg', 'othertext_1.jpg'], ['text_2.jpg', 'othertext_2.jpg']]

Split list into sub-lists based on integer in string

Answers (2)

Related Questions