Regex Pattern for Whitespace

Question

I am creating a regex library to work with HTML (I'll post it on MSDN Code when it's done). One of the methods removes any whitespace before a closing tag.

See the dog run

It would eliminate the space before the closing paragraph. I am using this:

    public static string RemoveWhiteSpaceBeforeClosingTag(string text)
    {
        string pattern = @"(\s+)(?:



As you can see I am replacing the spaces with

cletus · Accepted Answer

\s+(?=



is that expression you're after. It means one or more white-space characters followed by 


(?=...) is a positive lookahead. This won't be included in the expression;
(?:...) is a non-capturing group. This will be included in the expression.


That all being said, regular expressions are a flaky and error-prone way of processing HTML so should be used with caution if at all.

Regex Pattern for Whitespace

Answers (2)

Related Questions