re.sub (python) substitute part of the matched string

Question

I have a series of strings which are identifiable by finding a substring "p" tag followed by at least two CAPITAL letters.

Input:

JIM SALLY ROBERT

Eric

I want to change the "p" tag to an "i" tag if it's followed by those two capital letters (so not the last one, 'Eric').

Desired output:

JIM SALLY ROBERT Eric

I've tried this using regular expressions in Python:

import re Mytext = "JIM SALLY ROBERT Eric" changeTags = re.sub(' [A-Z]{2}', '' + re.search('
[A-Z]{2}', Mytext).group()[-2:], Mytext) print changeTags

But the output uses "i" tag + JI in every instance, rather than interating through to use SA and then RO in entries 2 and 3.

JIM JILLY JIBERT Eric

I believe the problem is that I don't understand the .group() method properly. Can anyone advise what I've done wrong?

Thank you.

Juan Diego Godoy Robles · Accepted Answer

Another way using look-ahead assertion:

re.sub(r'(?=[A-Z]{2,})','',MyText)

re.sub (python) substitute part of the matched string

Answers (2)

Related Questions