Reputation: 63
What is best practice to repair malformed XML files with PHP? For example CDATA part contains illegal chars. With regular expressions? Or execute some Linux command line tools?
Upvotes: 6
Views: 5419
Reputation: 66781
Tidy is a binding for the Tidy HTML clean and repair utility which allows you to not only clean and otherwise manipulate HTML documents, but also traverse the document tree.
// Specify configuration
$config = array(
'indent' => true,
'input-xml' => true,
'output-xml' => true,
'wrap' => false);
// Tidy
$tidy = new tidy;
$tidy->parseFile('sample.xml', $config);
$tidy->cleanRepair();
// Output
echo $tidy;
Upvotes: 8