Regular expression for matching between text

Question

I have a file, which contains automatically generated statistical data from apache http logs.

I'm really struggling on how to match lines between 2 sections of text. This is a portion of the stat file I have:

jpg 6476 224523785 0 0
Unknown 31200 248731421 0 0
gif 197 408771 0 0
END_FILETYPES

# OS ID - Hits
BEGIN_OS 12
linuxandroid 1034
winlong 752
winxp 1320
win2008 204250
END_OS

# Browser ID - Hits
BEGIN_BROWSER 79
mnuxandroid 1034
winlong 752
winxp 1320

What I'm trying to do, is write a regex which will only search between the tags BEGIN_OS 12 and END_OS.

I want to create a PHP array that contains the OS and the hits, for example (I know the actual array won't actually be exactly like this, but as long as I have this data in it):

array(
   [0] => array(
      [0] => linuxandroid
      [1] => winlong
      [2] => winxp
      [3] => win2008
   )
   [1] => array(
      [0] => 1034
      [1] => 752
      [2] => 1320
      [3] => 204250
   )
)

I've been trying for a good couple of hours now with gskinner regex tester to test regular expressions, but regex is far from my strong point.

I would post what I've got so far, but I've tried loads, and the closest one I've got is:

^[BEGIN_OS\s12]+([a-zA-Z0-9]+)\s([0-9]+)

which is pathetically awful!

Any help would be appreciated, even if its a 'It cant be done'.

Amal · Accepted Answer

A regular expression may not be the best tool for this job. You can use a regex to get the required substring and then do the further processing with PHP's string manipulation functions.

$string = preg_replace('/^.*BEGIN_OS \d+\s*(.*?)\s*END_OS.*/s', '$1', $text);

foreach (explode(PHP_EOL, $string) as $line) {
    list($key, $value) = explode(' ', $line);
    $result[$key] = $value;
}

print_r($result);

Should give you the following output:

Array
(
    [linuxandroid] => 1034
    [winlong] => 752
    [winxp] => 1320
    [win2008] => 204250
)

Regular expression for matching between text

Answers (2)

Related Questions