mrblah
mrblah

Reputation: 103517

regex to trap img tag, both versions

I need to remove image tags from text, so both versions of the tag:

<img src="" ... ></img>


<img src="" ... />

Upvotes: 4

Views: 7840

Answers (2)

Roman
Roman

Reputation: 4611

In C# you can use following code:

            string html = @"
<h1>
<img src="" ... >
</img>
<img></img>-bad
<img/>-bad
<img src="" ... />
</h1>";
            string result = Regex.Replace(html, @"<img\s[^>]*>(?:\s*?</img>)?", "", RegexOptions.IgnoreCase);

But be aware that modifying HTML with Regex isn't good approach, its better to use Html Agility Pack or other HTML parsers.

Upvotes: 8

soulmerge
soulmerge

Reputation: 75714

This is not something you do with regular expressions. Use the xpath-query //img instead.

Upvotes: 4

Related Questions