Regex only finds results once

Question

I'm trying to find any text between a '>' character and a new line, so I came up with this regex:

result = re.search(">(.*)
", text).group(1)

It works perfectly with only one result, such as:

>test1
(something else here)

Where the result, as intended, is

test1

But whenever there's more than one result, it only shows the first one, like in:

>test1
(something else here)
>test2
(something else here)

Which should give something like

test1
test2

But instead just shows

test1

What am I missing? Thank you very much in advance.

Sweeper · Accepted Answer

re.search only returns the first match, as documented:

Scan through string looking for the first location where the regular expression pattern produces a match, and return a corresponding MatchObject instance.

To find all the matches, use findall.

Return all non-overlapping matches of pattern in string, as a list of strings. The string is scanned left-to-right, and matches are returned in the order found.

Here's an example from the shell:

>>> import re
>>> re.findall(">(.*)
", ">test1
xxx>test2
xxx")
['test1', 'test2']

Edit: I just read your question again and realised that you want "test1 test2" as output. Well, just join the list with :

>>>  "
".join(re.findall(">(.*)
", ">test1
xxx>test2
xxx"))
'test1
test2'

Regex only finds results once

Answers (2)

Explanation -

Related Questions