domDocument encoding not match with database

Question

I'm using domDocument to find html tags.

 $mensaje = "Te informamos que la parada Plaza de la Estación está
   próxima a vaciarse, el día 2013-04-22 a las 17:34:50.";

$dom = new domDocument('1.0', 'utf8_general_ci');
          // load the html into the object ***/
          $dom->loadHTML($mensaje);

          //discard white space
          $dom->preserveWhiteSpace = false;
          $nodeList= $dom->getElementsByTagName('b'); // here u use your desired tag

          $items = array();
          for($i=0; $i < $nodeList->length; $i++) {
                    $node = $nodeList->item($i);
                    $items[] = trim($node->nodeValue);
          }
          var_dump($items);

$mensaje is extracted from my database, this field is utf8_general_ci, but it fails:

array(3) { 
 [0]=> string(21) "Plaza de la EstaciÃ³n" 
  [1]=> string(10) "2013-04-22" 
  [2]=> string(8) "17:34:50" }

The first element has bad encoding.

How can I solve this?

domDocument encoding not match with database

Answers (1)

Related Questions