Reputation: 145
In the below code is there any way to keep the embedded image src like data:image/gif;base64, data:image/png;base64 anddata:image/jpeg;base64. I don't want to keep data:text/html;base64 in src
static void Main(string[] args)
{
var htmlContent =
@"
<div class='Editor-editor' contenteditable='true' style='overflow: auto;'>xss.
<div>
<script>$(document ).ready(function() {alert('xss...')});</script>
<object data='data:text/html;base64,PHNjcmlwdD5hbGVydCgiSGV sbG8iKTs8L3NjcmlwdD4='></object>
<img onmouseover='alert(99)' src='data:text/html;base64,PHNjcmlwdD5hbGVydCgiSGV sbG8iKTs8L3NjcmlwdD4='>
<img src=''>
</div>
</div>
";
ValidateHtmlContent(htmlContent);
Console.ReadLine();
}
public static void ValidateHtmlContent(string htmlContent)
{
var sanitizer = new HtmlSanitizer();
sanitizer.AllowedSchemes.Add("data"); // showing all data src, i want to show only data:image/gif in image src
var sanitized = sanitizer.Sanitize(htmlContent);
Console.WriteLine(sanitized);
}
Upvotes: 3
Views: 2091
Reputation: 81
Many thanks to Jitinder, I adapted his code (link) but found a problem with assigning to e.Reason (which is read only - perhaps the library has been updated since his answer. I appreciate that this post would be better as a comment to his answer but I don't have the reputation necessary, but thought this would be useful to future readers.
I came up with this code, slightly edited and simplified but based on Jitinder's (link), and seems to work for me:
var sanitizer = new HtmlSanitizer();
sanitizer.AllowedAttributes.Remove("src");
var _dataImage = new List<string> { "data:image/gif", "data:image/jpeg", "data:image/png", "data:image/jpg", "https://" };
sanitizer.RemovingAttribute += (s, e) =>
{
if (e.Tag.TagName == "IMG")
{
if (_dataImage.Any(x => e.Attribute.Value.StartsWith(x)))
{
e.Cancel = true;
}
}
};
var output = sanitizer.Sanitize(viewmodel.Notes);
Upvotes: 4
Reputation: 145
After many tries, I have found the solution. And it is working perfectly for me. maybe this solution will help others
public static void SanitizeHtmlContent(string htmlContent)
{
var sanitizer = new HtmlSanitizer();
sanitizer.AllowedAttributes.Remove("src");
sanitizer.RemovingAttribute += (s, e) =>
{
var _dataImage = new List<string> { "data:image/gif", "data:image/jpeg", "data:image/png", "data:image/jpg", "http://", "https://" };
switch (e.Tag.TagName)
{
case "IMG":
{
if (_dataImage.Any(x => e.Attribute.Value.StartsWith(x)))
{
e.Reason = RemoveReason.NotAllowedAttribute;
e.Cancel = true;
}
break;
}
}
};
var sanitized = sanitizer.Sanitize(htmlContent);
Console.WriteLine(sanitized);
}
Upvotes: 5