Regex Get All Alphabetic characters

Question

I want something like [A-z] that counts for all alphabetic characters plus stuff like ö, ä, ü etc.

If i do [A-ü] i get probably all special characters used by latin languages but it also allows other stuff like ¿¿]|{}[¢§øæ¬°µ©¥

Edit: I need this in python2.

Wiktor Stribiżew · Accepted Answer

When you use [A-z], you are not only capturing letters from "A" to "z", you also capture some more non-letter characters: [ \ ] ^ _ `.

In Python, you can use [^\W\d_] with re.U option to match Unicode characters (see this post).

Here is a sample based on your input string.

Python example:

import re
r = re.search(
    r'(?P[^\W\d_]*)',
    u'TestöäüéàèÉÀÈéàè',
    re.U
)

print r.group('unicode_word')
>>> TestöäüéàèÉÀÈéàè

Answers (2)