php crawler(to crawl single website)

Question

I'm working on crawler project and I need some help from you, this is my first project. The task is to fetch the data from 'http://justdial.com'. for example, I want to fetch the city name(bangalore), categoury(hotels), hotel name, address and phone number.

I have written a code to fetch the tag content from its 'id', like I have fetched the address from this:

");

$newlines="'(.*?)'si";
$newlines=preg_replace('#]*)>.#u','',$newlines);

preg_match_all("$newlines", $stripped_file, $matches);


//DEBUGGING

  //$matches[0] now contains the complete A tags; ex: text
  //$matches[1] now contains only the HREFs in the A tags; ex: link

  header("Content-type: text/plain"); //Set the content type to plain text so the print below is easy to read!
 $path= ($matches);

 print_r($path); //View the array to see if it worked
?>

Now the problem is, I want to seperate the tags from the contents and store it in a database. And from database to the excel sheet. Please help me.

php crawler(to crawl single website)

Answers (1)

Related Questions