How to convert Shift-JIS encoded string to UTF-8?

Question

I am getting html source from Aozora Bunko. Html file is Shift-JIS encoded. I am trying to get book title and author. Then I want to record title and author into SQLite(UTF-8) database.

    String[] splittedResult = result.split("\"title\">");
            splittedResult = splittedResult[1].split("");
            String title = splittedResult[0];
            byte[] b = null;
            try {
                b = title.getBytes("Shift_JIS");
            } catch (UnsupportedEncodingException e1) {
                // TODO Auto-generated catch block
                e1.printStackTrace();
            }
            String value=null;
            try {
                value = new String(b, "UTF-8");
            } catch (UnsupportedEncodingException e1) {
                // TODO Auto-generated catch block
                e1.printStackTrace();
            }

...
myDatabase.addBookInformation(value, author);

Result is like this: latin letters are showing normally. But japanese letters are shown by blocks question mark inside (please do not pay attention to null values)

enter image description here

How to solve this problem?

How to convert Shift-JIS encoded string to UTF-8?

Answers (1)

Related Questions