Regular expression ignore string if starts with specific substring

Question

I need to find with the regular expression domain names that don't start with the string "http". For example:

https://domain1.com -> Don't match
http://domain2.com -> Don't match
domain3.com -> Match
domain4.co.uk -> Match

I found a regex that almost got this:

(?:[a-zA-Z0-9](?:[a-zA-Z0-9\-]{,61}[a-zA-Z0-9])?\.)+[a-zA-Z]{2,6}

But it also detects "https://domain1.com"

Example given:

https://regex101.com/r/DjDBrx/1/

In this example I want to avoid "https://domain1.com"

Any help would be gratefully appreciated.

Wiktor Stribiżew · Accepted Answer

You can use a word boundary coupled with two negative lookbehinds:

\b(?


The (? are two negative lookbehinds that will get triggered at the same location inside the string (since lookarounds are non-consuming patterns) and - after making sure the location is at the word boundary due to \b - they will fail the match if there is http:// or https:// immediately to the left of the current location.

Regular expression ignore string if starts with specific substring

Answers (2)

Related Questions