How to delete HTML text between html tags in PHP using SimpleHtmlDom

Question

Using http://simplehtmldom.sourceforge.net/ I know this could extract the html text:

plaintext; 

?>

But how to delete all the text?

For example, if I have this input HTML:


    
        Example
    
    
        Lore Ipsum
        
            Lorem ipsum dolor sit amet, consectetuer adipiscing elit.

            Aenean commodo ligula eget dolor. Aenean massa.

I would like to get this output with SimpleHtmlDom:

In other words, I want to keep the structure of the document only.

Please help.

Gordon · Accepted Answer

I don't know for sure how to do that with SimpleHtmlDom. From it's manual, I'd assume something like

$html = file_get_html('http://www.google.com/');
foreach( $html->find('text') as $text) {
    $text->plaintext = '';
}

However, you can also use PHP's native DOM parser. It can do XPath queries and should in general be a good deal faster:

libxml_use_internal_errors(TRUE);
$dom = new DOMDocument;
$dom->loadHTMLFile('http://www.google.com');
$xp = new DOMXPath($dom);
foreach ($xp->query('//text()') as $textNode) {
    $textNode->parentNode->removeChild($textNode);
}
$dom->formatOutput = TRUE;
echo $dom->saveXML($dom->documentElement);

How to delete HTML text between html tags in PHP using SimpleHtmlDom

Answers (2)

Set `innertext` Property of HTML Element to the Empty String

Related Questions

How to delete HTML text between html tags in PHP using SimpleHtmlDom

Answers (2)

Set innertext Property of HTML Element to the Empty String

Related Questions

Set `innertext` Property of HTML Element to the Empty String