preg_match_all not matching href section correctly

Question

I'm having a problem matching the href section of a link using preg_match_all, currently it is capturing 3 sections (full link, url only, link text only) which is perfect but the url only part is capturing any other tags located after the href tag.

Also how do I make the "href" text case insensitive?

Code:

$content = 'Google is a search engine. Yahoo is a search engine.';

preg_match_all('/([^<]*)<\/a>/', $content, $matches);

print_r($matches);

Result:

Array
(
    [0] => Array
        (
            [0] => Google
            [1] => Yahoo
        )

    [1] => Array
        (
            [0] => http://www.google.com" target="_blank
            [1] => http://www.yahoo.com" title="yahoo" target="_blank
        )

    [2] => Array
        (
            [0] => Google
            [1] => Yahoo
        )

)

bizzehdee · Accepted Answer

your starting out looking for the > and not taking in to account any other attributes. try

/]+>([^<]*)<\/a>/

this will now pull out the href, then skip over the rest of the attributes, and then pull the html right up the next tag

preg_match_all not matching href section correctly

Answers (1)

Related Questions