Does Java regex unicode support include full case folding?

Question

Assuming these string definitions:

String lowerStream = "ﬂüßchen";
String upperStream = "FLÜSSCHEN";
String streamPattern = ".*(ss).*";

Using this pattern:

Pattern pattern = Pattern.compile(streamPattern, Pattern.CASE_INSENSITIVE | Pattern.UNICODE_CASE);

...this assertion passes:

assertThat( pattern.matcher(upperStream).find() ).isTrue()

...and this one fails:

assertThat( pattern.matcher(lowerStream).find() ).isTrue()

...whereas both lowerStream and upperStream pass on rubular.com with each of these regexes:

/.*(ss).*/i

/.*(SS).*/i

/.*(ß).*/i

It is also not possible to get a successful comparison using any of String.equalsIgnoreCase(), String.toLowerCase().equals(), or String.toUpperCase().equals().

Does java's unicode regex only support simple case folding? If so, why is this not explicitly documented?

Does Java regex unicode support include full case folding?

Answers (1)

Related Questions