preg_replace multiline match but preserve new lines

Question

I need a one liner that trims PHP from an HTML file. The trick is that I also need it to preserve the newlines previously taken up by the PHP lines.

php -r "echo preg_replace('/<\\?.*(\\?>|\$)/Us','', file_get_contents(\$argv[1]));" -- "./index.php"

This "works" but does not preserve the new lines, for example:

Resolves to:

But I need it to resolve to:

Maybe I am using a hammer to drive a screw but what I am trying to do is remove the PHP code, run the result through htmlhint and have the reported line numbers actually match the lines in the file.

If there is a better solution, I would love to hear it. The end goal is to lint files that have a mix of PHP, Javascript and HTML with their respective linters.

Casimir et Hippolyte · Accepted Answer

Ok one line using the tokenizer (Ugly thing inside):

php -r 'echo array_reduce(token_get_all(file_get_contents($argv[1])),function($c,$i){return $i[0]==321?$c.$i[1]:$c.str_repeat("
",@count_chars($i.$i[1])[10]);});'

demo

Advantage of the tokenizer: even a string like "abc '; ?> def" is correctly parsed.

321 is the value of the constant T_INLINE_HTML (all that isn't between php tags).

10 is ASCII code for the newline character (LF). (by default, count_chars returns an associative array with the ASCII codes as keys and the number of occurrences as values).

The ugly thing is $i.$i[1] that concatenates an array with a string or a string with something not defined. @ avoids the warnings and notices. Whatever, this trick avoids a test and the number of newline characters is preserved. (see what returns token_get_all to understand the problem).

Or with DOMDocument:

php -r '$d=DOMDocument::loadHTMLFile($argv[1],8196);foreach((new DOMXPath($d))->query("//processing-instruction()")as$p)$p->parentNode->replaceChild($d->createTextNode(preg_replace("~\S+~","",$p->nodeValue)),$p);echo$d->saveHTML();'

preg_replace multiline match but preserve new lines

Answers (2)

Brief

Code

Results

Input

Output

Explanation

Related Questions