How do I restrict regular expression from taking certain words

Question

I have created a regular expression Regex for string that starts from " and ends with " eg: "mynameis"

""(?:[^"\]|\.)*""

Now I want that this expression must not take {we, us, they, and} words. How do I do that? For instance if I input "mynameisalexand" Compiler must ignore {and} and take this string as "mynameisalex"

Wiktor Stribiżew · Accepted Answer

Since there is no way to match non-continuous text with regex, you can still use your regex or an unrolled one:

"[^"\]*(?:\.[^"\]*)*"

See the regex demo

and remove the substrings you defined with a mere String.Replace (or with a regex like we|and|...).

See the C# demo:

var input = ""mynamesarealexandandrew" "mynameisalexand"";
var regex = new Regex(@"""[^""\]*(?:\.[^""\]*)*""", RegexOptions.IgnorePatternWhitespace);
var results = regex.Matches(input).Cast()
                   .Select(p => p.Value.Replace("we", "")
                                       .Replace("us", "")
                                       .Replace("they", "")
                                       .Replace("and", ""))
                   .ToList();
foreach (var s in results)    // DEMO
{
    Console.WriteLine(s);
}

How do I restrict regular expression from taking certain words

Answers (2)

Related Questions