regex for extracting all urls from string excluding period for terminating strings

Question

I'm trying to extract URL from a piece of string I have different posts that contains URL in their message. I've prepared a pattern to match but it's not working properly. I have asked the same question here but forgot to add this case in that so I'm asking a new question for it.

Tried Pattern

\b(\.?)(?:https?://)?(?:(?i:[a-z]+\.)+)[^\s,]+\b

CODE

for ( $i = 0; $i < $resultcount; $i ++ ) {
    $pattern = '%\b(\.?)(?:https?://)?(?:(?i:[a-z]+\.)+)[^\s,]+\b%';
    $message = (string)$result[$i]['message'];
    preg_match_all($pattern,$message,$match);
    print_r($match);
    }

A Example of my post like this

"This is just a post to test regex for extracting URL. http://google.com, https://www.youtube.com/watch?v=dlw32af https://instagram.com/oscar/ en.wikipedia.org"

Post may have comma or may not have comma for multiple URLs and also it is possible that a string and url doesn't have any space in between like

sometext.http://google.com

regexDemo

Thank you people :)

revo · Accepted Answer

This will match strings which are precisely encoded and have formats like an HTTP URL except those fall into IDN categorization:

(?i)(?:https?://[^"'\s<>(){}]++|[a-z0-9](?<=\b.)[a-z0-9-]*+(?:\.[a-z-]{2,}+)++(?=[/?"'()\s]|:\d++|\Z)[^"'\s<>(){}]*+)

So you will not expect

ftp://username:password@ftpserver/folder/

to be matched.

Live demo

regex for extracting all urls from string excluding period for terminating strings

Answers (2)

Related Questions