How to replace/modify a pattern by regular expression in python?

Question

Assume that I want to modify all patterns in a script, take one line as an example:

line = "assert Solution().oddEvenList(genNode([2,1,3,5,6,4,7])) == genNode([2,3,6,7,1,5,4]), 'Example 2'"

Notice that function genNode is taking List[int] as the parameter. What I want is to remove the List, and keep the all the integers in the list, so that the function is actually taking *nums as the parameters.

Expecting:

line = "assert Solution().oddEvenList(genNode(2,1,3,5,6,4,7)) == genNode(2,3,6,7,1,5,4), 'Example 2'"

I've come up with a re pattern

r"([g][e][n][N][o][d][e][(])([[][0-9\,\s]*[]])([)])"

but I am not sure how I could use this... I can't get re.sub to work as it requires me to replace with a fixed string.

How can I achieve my desired result?

heemayl · Accepted Answer

You can do:

re.sub(r'(genNode\()$$([^]]+)$$', r'\1\2', line)

(genNode\() matches genNode( and put it in captured group 1
$$ matches literal [
([^]]+) matches upto next ], and put it in captured group 2
$$ matches literal ]

In the replacement, we've used the captured groups only i.e. dropped [ and ].

You can get rid of the first captured group by using a zero-width positive lookbehind to match the portion before [:

re.sub(r'(?<=genNode\()$$([^]]+)$$', r'\1', line)

Example:

In [444]: line = "assert Solution().oddEvenList(genNode([2,1,3,5,6,4,7])) == genNode([2,3,6,7,1,5,4]), 'Example 2'"                                                                                         

In [445]: re.sub(r'(genNode\()$$([^]]+)$$', r'\1\2', line)                                                                                                                                                  
Out[445]: "assert Solution().oddEvenList(genNode(2,1,3,5,6,4,7)) == genNode(2,3,6,7,1,5,4), 'Example 2'"

In [446]: re.sub(r'(?<=genNode\()$$([^]]+)$$', r'\1', line)                                                                                                                                                 
Out[446]: "assert Solution().oddEvenList(genNode(2,1,3,5,6,4,7)) == genNode(2,3,6,7,1,5,4), 'Example 2'"

FWIW, using typical non-greedy pattern .*? instead of [^]]+ would work as well:

re.sub(r'(?<=genNode\()$$(.*?)$$', r'\1', line)

How to replace/modify a pattern by regular expression in python?

Answers (2)

Related Questions