Preg_Replace Change URL

Question

I am trying to grab content from another one of my site which is working fine, apart from all the links are incorrect.

    include_once('../simple_html_dom.php');


    $page = file_get_html('http://www.website.com');


$ret = $page->find('div[id=header]');


echo $ret[0];

Is there anyway instead of all links showing link to have the full link? using preg replace.

   $ret[0] = preg_replace('@(http://([\w-.]+)+(:\d+)?(/([\w/_.]*(\?\S+)?)?)?)@', 
       'http://fullwebsitellink.com$1', $ret[0]);

I guess it would be something like above but I dont understand?

Thanks

IMSoP · Accepted Answer

Your question doesn't really explain what is "incorrect" about the links, but I'm guessing you have something like this:

Home | Sitemap

and you want to embed it in another site, where those links need to be fully-qualified with a domain name, like this:

Home | Sitemap

Assuming this is the case, the replacement you want is so simple you don't even need a regex: find all href attributes beginning "/", and add the domain part (I'll use "http://example.com") to their beginning to make them absolute:

$scraped_html = str_replace('href="/', 'href="http://example.com/', $scraped_html);

Preg_Replace Change URL

Answers (1)

Related Questions