preg_match_all How to get all links?

Question

I'm trying to get all images links with preg_match_all those that begin with http://i.ebayimg.com/ and ends with .jpg , from page that I'm scraping.. I Can not do it correctly... :( I tried this but this is not what i need...:

preg_match_all('/(http|https|ftp|ftps)\:\/\/[a-zA-Z0-9\-\.]+\.[a-zA-Z]{2,3}(\/\S*)?/', $contentas, $img_link);

Same problem is with normal links... I don't know how to write preg_match_all to this:

Thank you very much!!!

UPDATE I'm trying from here: http://suchen.mobile.de/fahrzeuge/search.html?isSearchRequest=true&scopeId=C&makeModelVariant1.makeId=1900&makeModelVariant1.modelId=10&makeModelVariant1.modelDescription=&makeModelVariantExclusions%5B0%5D.makeId=&categories=Limousine&minSeats=&maxSeats=&doorCount=&minFirstRegistrationDate=2006-01-01&maxFirstRegistrationDate=&minMileage=&maxMileage=&minPrice=&maxPrice=11000&minPowerAsArray=&maxPowerAsArray=&maxPowerAsArray=PS&minPowerAsArray=PS&fuels=DIESEL&minCubicCapacity=&maxCubicCapacity=&ambitCountry=DE&zipcode=&q=&climatisation=&airbag=&daysAfterCreation=7&withImage=true&adLimitation=&export=&vatable=&maxConsumptionCombined=&emissionClass=&emissionsSticker=&damageUnrepaired=NO_DAMAGE_UNREPAIRED&numberOfPreviousOwners=&minHu=&usedCarSeals= get cars links and image links and all information, with information is everything fine, my script works good, but i have problem with scraping images and links.. here is my script :

', '');
     //filtravimas naikinami mokami top skelbimai
    $contentas = preg_replace('/(.*?)<\/div>/', '' ,$turinys);
    //filtravimas baigtas

      preg_match_all('/(.*?)<\/span>/',$contentas,$pavadinimas); 

      preg_match_all('/(.*?)<\/span>/',$contentas,$data); 

      preg_match_all('/(.*?)<\/span>/',$contentas,$miestas);

      preg_match_all('/(.*?)<\/span>/', $contentas, $kaina);

      preg_match_all('/

Louis Barranqueiro · Accepted Answer

1. To capture src attribute starting by http://i.ebayimg.com/ of all img tags :

regex : /src="((?:http|https):\/\/i.ebayimg.com\/.+?.jpg)"/i

Here is an example :

$re = "/src="((?:http|https):\/\/i.ebayimg.com\/.+?.jpg)"/i"; 
$str = "codeOfHTMLPage"; 
preg_match_all($re, $str, $matches);

Check it in live : here

If you want to be sure that you capture this url on an img tag then use this regex (keep in mind that performance will decrease if page is very long) :

$re = "/



2. To capture href attribute starting by http://i.ebayimg.com/ of all a tags :

regex : /href="((?:http|https):\/\/suchen.mobile.de\/fahrzeuge\/.+?.jpg)"/i

Here is an example :

$re = "/href="((?:http|https):\/\/suchen.mobile.de\/fahrzeuge\/.+?.jpg)"/i; 
$str = "codeOfHTMLPage"; 
preg_match_all($re, $str, $matches);


Check it in live : here

If you want to be sure that you capture this url on an a tag then use this regex (keep in mind that performance will decrease if page is very long) :

$re = "/

preg_match_all How to get all links?

Answers (2)

Related Questions