How does Python Re Module work in this examle?

Question

What is the process of matching this regular expression? I don't get why the explicit group is 'c'. This is piece of code is taken from Python Re Module Doc.

>>> m = re.match("([abc])+", "abc")
>>> m.group()
'abc'
>>> m.groups()
('c',)

Also, what about:

>>> m = re.match("([abc]+)", "abc")
>>> m.group()
'abc'
>>> m.groups()
('abc',)

And:

>>> m = re.match("([abc])", "abc")
>>> m.group()
'a'
>>> m.groups()
('a',)

Thanks.

Jon Clements · Accepted Answer

re.match("([abc])+", "abc")

Matches a group consisting of a, b or c. The group at the end of that is the last character found in the character class as matching is greedy so, ends up with the last matching character which is c.

m = re.match("([abc]+)", "abc")

Matches a group that contains one or more consecutive occurences of a, b or c. The matching group at the end is the largest contingious group of a, b or c.

re.match("([abc])", "abc")

Matches either a, b or c. The match group will always be the first matching character at the start of the string.

How does Python Re Module work in this examle?

Answers (2)

Related Questions