How to write a Multi-line RegEx Expression

Question

I have a vb.net class that cleans some html before emailing the results.

Here is a sample of some html I need to remove:

    
      Blah blah blah
 
      Blah blah blah
 
      Blah blah blah

I am already using RegEx to do most of my work now. What would the RegEx expression look like to replace the block above with nothing?

I tried the following, but something is wrong:

'html has all of my text
html = Regex.Replace(html, ".*?", "", RegexOptions.IgnoreCase)

Thanks.

Heinzi · Accepted Answer

Add the Singleline option:

html = Regex.Replace(html, ".*?", "", RegexOptions.IgnoreCase Or RegexOptions.Singleline)

From MSDN:

Singleline: Specifies single-line mode. Changes the meaning of the dot (.) so it matches every character (instead of every character except ).

PS: Parsing HTML with regular expressions is discouraged. Your code will fail on something like this:


    bla
    bla

Answers (2)