Reputation: 1367
I have some code to remove all html tag but I want to remove all html but except </td>
and </tr>
tags.
How can this be done?
public string HtmlStrip( string input)
{
input = Regex.Replace(input, "<input>(.|\n)*?</input>", "*");
input = Regex.Replace(input, @"<xml>(.|\n)*?</xml>", "*"); // remove all <xml></xml> tags and anything inbetween.
return Regex.Replace(input, @"<(.|\n)*?>", "*"); // remove any tags but not there content "<p>bob<span> johnson</span></p>" becomes "bob johnson"
}
Upvotes: 4
Views: 1672
Reputation: 1
remove all tag html but except tag td and tr
input = Regex.Replace(input, @"<(?!td|/td|tr|/tr).*?>", "");
Upvotes: 0
Reputation: 589
Regex is not great for parsing XML or HTML. Take a look at the HTML Agility Pack
Upvotes: 6