Matching pattern using reg ex and re.sub

Question

I am trying to remove the following pattern from some data and am getting mixed results.

--endof["somerandomtext"]

Basically the text always starts with --endof[" and ends with "] and the words in between change.
The line of code I am using that is not working currently working all the time.

d = re.sub('--+([a-zA-Z0-9_"-$$]*)+$$', " ", d)

I am new to trying to parse data using re.sub or any method. I have been just guessing at how to try and make this line work, and I probably have something wrong that is causing me problems.

Any help appreciated.

DYZ · Accepted Answer

A variation of @Hexagon's answer:

s = re.sub('--endof\[[^]]+]', '', s)

This removes a string that starts with --endof[, followed by any number of non-]s ([^]]+), followed by a ]. Works for any text that does not contain closing brackets.

Matching pattern using reg ex and re.sub

Answers (2)

Related Questions