Burhan
Burhan

Reputation: 37

Best Choice Regex for extract the Facebook Link

i search the best regex method for the most functionality.

I search on Google and will extract the Facebook Links. Because Google has no Search API that works 1to1 with the exact Google Results i don't can use the API.

I send now a normal request to google, extract the html code and will find all Facebook Link without google parameters.

Examples you find on regex debbuger.

I will see only this links if is possible.

Here Example Strings to search: `

 /url?q=https://www.facebook.com/pageid/about&sa=U&ved=0ahUKEwi27NeDvfTTAhWBfywKHbuDDS4QjBAIHDAB&usg=AFQjCNH7T2JEP5DzGpiiwT_pMt2oGJ10ow

 /url?q=https://www.facebook.com/pageid/%3Fpnref%3Dlhc&sa=U&ved=0ahUKEwiWv8S6vfTTAhUEBiwKHW04AH8Q_BcIyQQoATBu&usg=AFQjCNEZIUb1yqqYtzjPfDEVi4GPHDY5FQ

 /url?q=https://www.facebook.com/pageid%3Fpnref%3Dlhc&sa=U&ved=0ahUKEwiWv8S6vfTTAhUEBiwKHW04AH8Q_BcIyQQoATBu&usg=AFQjCNEZIUb1yqqYtzjPfDEVi4GPHDY5FQ

/url?q=https://www.facebook.com/name-name-585606818284844/%3Fpnref%3Dlhc&sa=U&ved=0ahUKEwiWv8S6vfTTAhUEBiwKHW04AH8Q_BcIyQQoATBu&usg=AFQjCNEZIUb1yqqYtzjPfDEVi4GPHDY5FQ

/url?q=https://www.facebook.com/name-name-585606818284844%3Fpnref%3Dlhc&sa=U&ved=0ahUKEwiWv8S6vfTTAhUEBiwKHW04AH8Q_BcIyQQoATBu&usg=AFQjCNEZIUb1yqqYtzjPfDEVi4GPHDY5FQ`

Thats my Regex this works but not for all options. Regex Debugger:

https://regex101.com/r/LcYz8c/8

Upvotes: 1

Views: 363

Answers (2)

sniperd
sniperd

Reputation: 5274

Something like:

"q=(https?://.*?facebook.com/)derName-/"

 "q=(https?://.*?facebook.com/)derName(?:%[^%]*%..|[-/])?([^&]‌​+)"

might be what you are looking for. From what I see in your example, it looks like you want: everything from the http up to the first / after the domain. Then skip the derName, and then grab everything up to the next &. So this is going to use 2 capture groups. Hope that helps!

Upvotes: 1

4EACH
4EACH

Reputation: 2197

Try this:

q=(https:\/\/www.facebook.com.*?)&

https://regex101.com/r/LcYz8c/11

Upvotes: 0

Related Questions