C# Regex Split To Java Pattern split

Question

I have to port some C# code to Java and I am having some trouble converting a string splitting command.

While the actual regex is still correct, when splitting in C# the regex tokens are part of the resulting string[], but in Java the regex tokens are removed.

What is the easiest way to keep the split-on tokens?

Here is an example of C# code that works the way I want it:

using System;

using System.Text.RegularExpressions;

class Program
{
    static void Main()
    {
        String[] values = Regex.Split("5+10", @"([\+\-\*\^\/])");

        foreach (String value in values)
            Console.WriteLine(value);
    }
}

Produces:
5
+
10

Pesto · Accepted Answer

I don't know how C# does it, but to accomplish it in Java, you'll have to approximate it. Look at how this code does it:

public String[] split(String text) {
    if (text == null) {
        text = "";
    }

    int last_match = 0;
    LinkedList splitted = new LinkedList();

    Matcher m = this.pattern.matcher(text);

    // Iterate trough each match
    while (m.find()) {
        // Text since last match
        splitted.add(text.substring(last_match,m.start()));

        // The delimiter itself
        if (this.keep_delimiters) {
            splitted.add(m.group());
        }

        last_match = m.end();
    }
    // Trailing text
    splitted.add(text.substring(last_match));

    return splitted.toArray(new String[splitted.size()]);
}

C# Regex Split To Java Pattern split

Answers (2)

Related Questions