Extracting specific URLs out of the document

Question

I think this should be elementary, but I still can't get my head around it. Let's say there's fair amount of HTML documents and I need to catch every image URLs out of them.

The rest of the content changes, but the base of the url is always the same for example: http://images.examplesite.com/images/,

So I want to extract every string that contains that part. the problem is that they're always mixed with or tags, so how could I drop them out? preg_match probably?

Narcis Radu · Accepted Answer

Try something like: preg_match_all('/http:\/\/images\.examplesite\.com\/images\/(.*?)"/i', $html_data, $results, PREG_SET_ORDER)

Extracting specific <a href> URLs out of the document

Answers (2)

Related Questions

Extracting specific &lt;a href&gt; URLs out of the document

Answers (2)

Related Questions

Extracting specific <a href> URLs out of the document