Get rigth Xpath for HTML elements

Question

I need to scrape this HTML page ...

http://www1.usl3.toscana.it/default.asp?page=ps&ospedale=3

.... using PHP and XPath to get the values like 0 under the string "CODICE BIANCO"

(NOTE: you could see different values in that page if you try to browse it ... it doesn't matter ..,, they changing dinamically .... )

I'm using this PHP code sample to print the value ...

loadHTML($data);

    $xpath = new DOMXPath($dom);
    $colorWaitingNumber = $xpath->query($xpath_for_parsing);
    $theValue =  'N.D.';
    foreach( $colorWaitingNumber as $node )
    {
      $theValue = $node->nodeValue;
    }

    print $theValue;

?>

I've extracted the xpath using both the Chrome and Firefox web consoles ...

Suggestions / examples?

Matey · Accepted Answer

Both Chrome and Firefox most probably improve the original HTML by adding elements inside

because the original HTML does not contain them. CURL does not do this and that's why your XPATH fails. Try this one instead:

$xpath_for_parsing = '//*[@id="contentint"]/table[2]/tr[1]/td/table/tr[3]/td[1]/table/tr[11]/td[3]/b';

Get rigth Xpath for HTML elements

Answers (2)

Related Questions