hebrew characters don't show in "UTF-8 without BOM" only "UTF-8"

Question

My html document starts as follows:


  



אבגד

If I encode my document as UTF-8, it appears correctly in the browser. If I encode as UTF-8 without BOM (which I understand is more standard) I get unusual characters.

What am I doing wrong?

Josh Lee · Accepted Answer

Your web server is declaring that the encoding is ISO-8859-1, and the browser is respecting that. Ironically enough, using a byte order mark sends a stronger signal to the browser that the encoding must actually be UTF-8. (The exact reason for this is complicated and boring.)

Fixing your web server depends on what the server is. If this is a static resource on disk served by Apache httpd, then something like AddCharset UTF-8 .html will add the header.

If this resource is served dynamically, then you should make sure you add the proper HTTP headers when producing the response, something like self.send_header('Content-Type', 'text/html; charset=utf-8') for Python's basic http server.

hebrew characters don't show in "UTF-8 without BOM" only "UTF-8"

Answers (1)

Related Questions

hebrew characters don&#39;t show in &quot;UTF-8 without BOM&quot; only &quot;UTF-8&quot;

Answers (1)

Related Questions

hebrew characters don't show in "UTF-8 without BOM" only "UTF-8"