How to use regex for extract domain parts from string

Question

I have a URL string like "https://example.com". I want to show the parts of this URL like protocol, domain, and extension. How can I do this using regular expression?

fool-dev · Accepted Answer

In the Ruby I have used something like this

user:~/workspace $ irb
2.3.4 :018 > url = "https://example.com"
 => "https://.example.com" 
2.3.4 :019 > u = url.match(/(?[\w]+):\/\/(?[\w-]+)\.(?\w+)/)
 => # 
2.3.4 :020 > u[:protocol]
 => "https" 
2.3.4 :021 > u[:domain]
 => "example" 
2.3.4 :022 > u[:extension]
 => "com"

If you have also subdomain then use like below regular expression

2.3.4 :034 > url = "https://sub.example.com"    
2.3.4 :035 > u = url.match(/(?[\w]+):\/\/(?[[a-zA-Z0-9]\.-]+)\.(?\w+)/)
 => # 
2.3.4 :036 > u[:protocol]
 => "https" 
2.3.4 :037 > u[:domain]
 => "sub.example" 
2.3.4 :038 > u[:extension]
 => "com"

In the http://rubular.com/ I have created a snippet for testing regular expression which not failing with subdomain see this Rubular

How to use regex for extract domain parts from string

Answers (2)

Related Questions