Regex non capturing group

Question

Regex Experts. I need some help to capture the IP address and its status from the below HTML string.

$html = "
         Active Zone : BW Zone 1[1],   VIP = 192.168.254.10


             https://192.168.254.10/checkGlobalReplicationTier
  [ACTIVE]
             https://192.168.254.10/checkReplication

             https://192.168.254.11/checkGlobalReplicationTier
  [STANDBY]
             https://192.168.254.11/checkReplication

         Local Zones:
             LC Zone 3[3],   VIP = 192.168.254.13
                 https://192.168.254.13/checkReplication
  [ACTIVE]"

[regex]::matches($html, '((\d{1,3}\.){3}\d{1,3})((?s).*?)((?<=$$)[A-z]*(?=$$))').value

The above regex is able to get the IP and Status.. but i want to omit everything in-between the IP and Status. How do i do this with non capturing regex.

192.168.254.10  Active
192.168.254.11  Standby
192.168.254.13  Active

mklement0 · Accepted Answer

Generally, consider iRon's helpful answer for robust HTML parsing with a dedicated parser.

How do i do this with non capturing regex.

You can't, because in order to exclude parts of the matching span of text you'd need look-around assertions (such as the negative look-behind assertions in your attempt, e.g. (?<=$$)), but these in turn prevent you from consuming the unwanted parts of the span.

Instead, use two capture groups and access them as follows:

[regex]::Matches(
  $html, 
  '(?s)((?:\d{1,3}\.){3}\d{1,3}).+?\[([A-Z]+)$$'
 ) | ForEach-Object { 
   [pscustomobject] @{ 
     Ip = $_.Groups[1].Value
     Status = $_.Groups[2].Value
   } 
 }

This results in the following display output:

Ip             Status
--             ------
192.168.254.10 ACTIVE
192.168.254.11 STANDBY
192.168.254.13 ACTIVE

Regex non capturing group

Answers (2)

Related Questions