Reputation: 7126
Here's a tough one:
I'm working on a website where most of the content is in Japanese.
But consider this url:
http://www.stackoverflow.com/質問/日本語のURLはどうする
Most URL parsers, including stackoverflow's, don't know where to delimit URLs that contain Japanese. Not good.
I haven't checked with other browsers yet, but Google Chrome will display URL-decoded Japanese urls in Japanese -- and when you copy the url, it URL-encodes it:
That's the right thing to do, but it's not exactly the most friendly. So, I thought, I'd have the content providers just type in short English slugs, like this:
http://www.stackoverflow.com/questions/what-to-do-with-japanese-urls
Awesome. But there's two problems with this:
Any thoughts on the best course of action? :D
Upvotes: 7
Views: 2393
Reputation: 1554
The big Japanese websites seem to be somewhat divided on this question. Japanese wikipedia and amazon.co.jp use encoded Japanese, for instance. But the English slugs seem to be the most common. Take a look at http://fujifilm.jp/ for an example.
I think your concern about search engines is pretty valid though, and in your position that might be enough to tip the balance in favor of using encoded Japanese for me. It would be a bit of a pain though- lots of opportunity for error there.
Also, as Mike points out, what stackoverflow is doing is pretty much the right thing, as the url needs to be encoded.
Upvotes: 6
Reputation: 9971
Using romaji transliteration?
http://www.stackoverflow.com/questionu/nihonnoURLwadousuru
:D
Upvotes: 0
Reputation: 3460
Unfortunately HTTP doesn't have native unicode support. All unicode URL path and query params need to be encoded. You can use unicode encoding instead of % encoding if you want, but that will still not make the URL any more human readable.
Upvotes: 0