My java class implementation of XOR encryption has gone wrong

Question

I am new to java but I am very fluent in C++ and C# especially C#. I know how to do xor encryption in both C# and C++. The problem is the algorithm I wrote in Java to implement xor encryption seems to be producing wrong results. The results are usually a bunch of spaces and I am sure that is wrong. Here is the class below:

public final class Encrypter {

    public static String EncryptString(String input, String key)
    {


        int length;
        int index = 0, index2 = 0;
        byte[] ibytes = input.getBytes();
        byte[] kbytes = key.getBytes();
        length = kbytes.length;
        char[] output = new char[ibytes.length]; 
        for(byte b : ibytes)
        {
            if (index == length)
            {
                index = 0;

            }
            int val = (b ^ kbytes[index]);
            output[index2] = (char)val;
            index++;
            index2++;
        }


        return new String(output);
    }
    public static String DecryptString(String input, String key)
    {
        int length;
        int index = 0, index2 = 0;
        byte[] ibytes = input.getBytes();
        byte[] kbytes = key.getBytes();
        length = kbytes.length;
        char[] output = new char[ibytes.length]; 
        for(byte b : ibytes)
        {
            if (index == length)
            {
                index = 0;

            }
            int val = (b ^ kbytes[index]);
            output[index2] = (char)val;
            index++;
            index2++;
        }


        return new String(output);
    }
}

dty · Accepted Answer

Strings in Java are Unicode - and Unicode strings are not general holders for bytes like ASCII strings can be.

You're taking a string and converting it to bytes without specifying what character encoding you want, so you're getting the platform default encoding - probably US-ASCII, UTF-8 or one of the Windows code pages.

Then you're preforming arithmetic/logic operations on these bytes. (I haven't looked at what you're doing here - you say you know the algorithm.)

Finally, you're taking these transformed bytes and trying to turn them back into a string - that is, back into characters. Again, you haven't specified the character encoding (but you'll get the same as you got converting characters to bytes, so that's OK), but, most importantly...

Unless your platform default encoding uses a single byte per character (e.g. US-ASCII), then not all of the byte sequences you will generate represent valid characters.

So, two pieces of advice come from this:

Don't use strings as general holders for bytes
Always specify a character encoding when converting between bytes and characters.

In this case, you might have more success if you specifically give US-ASCII as the encoding. EDIT: This last sentence is not true (see comments below). Refer back to point 1 above! Use bytes, not characters, when you want bytes.

My java class implementation of XOR encryption has gone wrong

Answers (2)

Related Questions