How to use use php preg_split with an html string

Question

I am trying to parse a badly formed html table:

A couple of lines of this are:

  Food: Yes

  Pool: Beach

  Centre: Yes

After spending a lot of time on this with Xpath, I think it is probably better to split the above text into lines use preg_split and parse from there.

The pattern I think would work uses:

<\b><\br>*: <\b>

my code is as follows:

$pattern='
*:';           
$pattern=preg_quote($pattern,'#');
$chars = preg_split($pattern, $output);
print_r($chars);

I am getting the following error:

Delimiter must not be alphanumeric or backslash

What I am doing wrong?

Cal · Accepted Answer

Try this:

$pattern='
*:';           
$pattern=preg_quote($pattern,'#');
$chars = preg_split('#'.$pattern.'#', $output);
print_r($chars);

The preg_quote function just makes it safely escaped, it doesn't actually add the delimiters for you.

As other people will surely point out, using regular expressions is not a good way to parse HTML :)

Your regular expression is also not going to match what you hope. Here's a version that will probably work for your input:

$in = " Pool: Beach
";
$out = explode(':', strip_tags($in));
$key = trim($out[0]);
$value = trim($out[1]);
echo "$key = $value
";

This removes all the HTML, then splits on the colon, and then removes any surrounding whitespace.

How to use use php preg_split with an html string

Answers (2)

Related Questions