Regex to split when uppercase after alphabetic lowercase char

Question

So I'm trying to split a string with a regex and the split function in java. The regex should split the string when there is a capital letter after a noncapital letter like this

hHere      // -> should split to ["h", "Here"]

I'm trying to split a string like this

String str = "1. Test split hHere and not .Here and /Here";
String[] splitString = str.split("(?=\w+)((?=[^\s])(?=\p{Upper}))");
/* print splitString */
// -> should split to ["1. Test split h", "Here and not .Here and not /Here"]
for(String s : splitString) {  
    System.out.println(s);
}

output I get

1. 
Test split h
Here and not .
Here and /
Here

output I want

1. Test split h
Here and not .Here and not /Here

Just can't figure out the regex to do this

azro · Accepted Answer

You may use a easier pattern : (?<=\p{Ll})(?=\p{Lu})

(?<= ) ensures that the given pattern will match, ending at the current position in the expression.
(?= ) asserts that the given subpattern can be matched here, without consuming characters
both does not consume any characters, very important !

str.split("(?<=[a-z])(?=[A-Z])"); old version does not work for other alphabet

Regex to split when uppercase after alphabetic lowercase char

Answers (2)

Code

Option 1

Option 2

Explanation

Option 1

Option 2

Related Questions