Matching everything but words, numbers and spaces

Question

This code will replace everything except for words, but how do I get it to also leave the numbers and spaces untouched? e.g. "I didn't see him until 1." -> "I didnt see him until 1"

text = regex.sub("\P{alpha}+","",text)

tchrist · Accepted Answer

Don’t use Python’s re library on Unicode. It works very poorly. Use Matthew Barnett’s regex library instead. It works much, much better.

It also runs on both Python 2 and 3 and on both narrow and wide builds, but for reasons largely unrelated to that particular library I strongly recommend that you run only a wide build of Python 3 and eschew all other combinations.

Matching everything but words, numbers and spaces

Answers (2)

Related Questions