Regex replace domain substring with html tag in C#

Question

I'm trying to replace plain domain like substrings of a input string with 'a' tags, using regex like this:

var pattern = @"[A-Za-z0-9-]+(\.[A-Za-z0-9-]+)*(\.[A-Za-z]{2,})";

var input = "text1 www.example.com text2 www.example.com text3";

var result = Regex.Replace(input, pattern, string.Format("$0"));

This will create following output:

text1 www.example.com text2 www.example.com text3

Which is wrong as second domain is already tag and it is now tag within tag.

Is there a way to modify regex pattern to ignore matching of second domain substring?

Perhaps by ignoring the '>' char at domain substring start? (or '<' char at the end)

Effectively generating this result:

text1 www.example.com text2 www.example.com text3

Srb1313711 · Accepted Answer

Try this:

 (?i)(?)((w{3}\.)[^.]+\.[a-z]+(\.?[a-z])*)

This is assuming each domain begins with www. You can use your replace with this at will work unless the domain is preceded with a >. This may not be exactly what you are looking for but its somewhere to start, research negative look behinds as i believe this will help you.

Regex replace domain substring with html tag in C#

Answers (2)

Related Questions