Regex. Split string depends on delimiter and include them

Question

Here is my str example, I need to save delimiters near last word like dot, dash and space.

str example:

   a = 'Beautiful. is. better5-than ugly'

what I tried

re.split('\W+', a)
['Beautiful', 'is', 'better5', 'than', 'ugly']

expected output:

 ['Beautiful.', ' ', 'is.', ' ', 'better5-', 'than', ' ', 'ugly']

Is it possible?

Czaporka · Accepted Answer

>>> import re
>>> a = 'Beautiful. is. better5-than ugly'
>>> re.findall("\w+[.-]?|\s+", a)
['Beautiful.', ' ', 'is.', ' ', 'better5-', 'than', ' ', 'ugly']

\w+[.-]? matches words with an optional dot or hyphen at the end.
\s+ matches whitespace.
| makes sure we capture either of the above.

Regex. Split string depends on delimiter and include them

Answers (2)

Related Questions