Jitender Bisht
Jitender Bisht

Reputation: 145

c# how to allow embedded image HtmlSanitizer

In the below code is there any way to keep the embedded image src like data:image/gif;base64, data:image/png;base64 anddata:image/jpeg;base64. I don't want to keep data:text/html;base64 in src

static void Main(string[] args)
        {
            var htmlContent =
                @"
                    <div class='Editor-editor' contenteditable='true' style='overflow: auto;'>xss.&nbsp; &nbsp;
                        <div>
                            <script>$(document ).ready(function() {alert('xss...')});</script>
                            <object data='data:text/html;base64,PHNjcmlwdD5hbGVydCgiSGV sbG8iKTs8L3NjcmlwdD4='></object>
                            <img onmouseover='alert(99)' src='data:text/html;base64,PHNjcmlwdD5hbGVydCgiSGV sbG8iKTs8L3NjcmlwdD4='>
                            <img src=''>
                        </div>
                    </div>
                ";


            ValidateHtmlContent(htmlContent);
            Console.ReadLine();
        }

        public static void ValidateHtmlContent(string htmlContent)
        {
            var sanitizer = new HtmlSanitizer();
            sanitizer.AllowedSchemes.Add("data"); // showing all data src, i want to show only data:image/gif in image src
            var sanitized = sanitizer.Sanitize(htmlContent);

            Console.WriteLine(sanitized);
        }

Upvotes: 3

Views: 2091

Answers (2)

Shewan
Shewan

Reputation: 81

Many thanks to Jitinder, I adapted his code (link) but found a problem with assigning to e.Reason (which is read only - perhaps the library has been updated since his answer. I appreciate that this post would be better as a comment to his answer but I don't have the reputation necessary, but thought this would be useful to future readers.

I came up with this code, slightly edited and simplified but based on Jitinder's (link), and seems to work for me:

var sanitizer = new HtmlSanitizer();
sanitizer.AllowedAttributes.Remove("src");
var _dataImage = new List<string> { "data:image/gif", "data:image/jpeg", "data:image/png", "data:image/jpg", "https://" };
sanitizer.RemovingAttribute += (s, e) =>
{
  if (e.Tag.TagName == "IMG")
  {
    if (_dataImage.Any(x => e.Attribute.Value.StartsWith(x)))
    {
      e.Cancel = true;
    }
  }
};
var output = sanitizer.Sanitize(viewmodel.Notes);

Upvotes: 4

Jitender Bisht
Jitender Bisht

Reputation: 145

After many tries, I have found the solution. And it is working perfectly for me. maybe this solution will help others

public static void SanitizeHtmlContent(string htmlContent)
        {
            var sanitizer = new HtmlSanitizer();
            sanitizer.AllowedAttributes.Remove("src");
            sanitizer.RemovingAttribute += (s, e) =>
            {
                var _dataImage = new List<string> { "data:image/gif", "data:image/jpeg", "data:image/png", "data:image/jpg", "http://", "https://" };

                switch (e.Tag.TagName)
                {
                    case "IMG":
                        {
                            if (_dataImage.Any(x => e.Attribute.Value.StartsWith(x)))
                            {
                                e.Reason = RemoveReason.NotAllowedAttribute;
                                e.Cancel = true;
                            }

                            break;
                        }
                }
            };
            var sanitized = sanitizer.Sanitize(htmlContent);
            Console.WriteLine(sanitized);
        }

Upvotes: 5

Related Questions