How to disable auto-entitying by DOMDocument

Question

I'm parsing a HTML string with DOMDocument. I'm loading it this way:

$dom = new DOMDocument();
$dom->loadHTML(''.$text.'', LIBXML_NOENT);

Then, I'm running some XPath queries and node replacing on it. (Even if I comment out these actions, the characters are still getting replaced.) Finally, I'm saving it this way:

$parsed = $dom->saveHTML();

But DOMDocument seems to be replacing non-ASCII characters with its entity representation. For example, string in Czech language:

ěščřžýáíé

Returns:

ěščřžýáíé

I can't use html_entity_decode(), because it breaks already highlighted and properly escaped source code.

What should I do to disable automatic replacing non-ASCII characters with entities by DOMDocument (so, the above example will return also ěščřžýáíé)?

Jakub · Accepted Answer

Finally, I have a solution. It is so simple, just instead of use .

How to disable auto-entitying by DOMDocument

Answers (2)

Related Questions