Unable to use regex to search in PHP?

Question

I'm trying to get the code of a html document in specific tags.

My method works for some tags, but not all, and it not work for the tag's content I want to get.

Here is my code:




(.*)<\/div>/";
     preg_match_all($pattern, $data, $adsLinks, PREG_SET_ORDER);
     var_dump($adsLinks);
     foreach ($adsLinks as $i) {
         echo "".$i[0]."";
     } 

?>

The above code doesn't work, but it works when I change the $pattern into:

$pattern = "/(.*)<\/div>/";

or

$pattern = "/(.*)<\/div>/";

I can't see any different between these $pattern. Please help me find the error. Thanks.

Paul Dixon · Accepted Answer

The reason your regex fails is that you are expecting . to match newlines, and it won't unless you use the s modifier, so try

$pattern = "/(.*)<\/div>/s";

When you do this, you might find the pattern a little too greedy as it will try to capture everything up to the last closing div element. To make it non-greedy, and just match up the very next closing div, add a ? after the *

$pattern = "/(.*?)<\/div>/s";

This just serves to illustrate that for all but the simplest cases, parsing HTML with regexes is the road to madness. So try using DOM functions for parsing HTML.

Unable to use regex to search in PHP?

Answers (2)

Related Questions