antlr4 and international characters

Question

I have been using antlr4 to parse a German document and so far I have done the following to parse the text that includes German characters:

LETTERS:
[a-zA-Z_\u00DC\u00FC\u00D6\u00F6\u00C4\u00E4\u00DF]; // hex unicodes for ÜüÖöÄäß

what is the best way to describe lingual characters of all languages in Unicode in a way that antlr understands, without specifying each language/character individually? say, the french, Arabic, or Chinese, Japanese characters?

Thank you

antlr4 and international characters

Answers (1)

Related Questions