Regex to capture an URL

Question

I've extracted an URL from a website in this string form:

@{href=http://download.company.net/file.exe}[0]

I can't figure out pattern how to get this part out of it: http://download.company.net/file.exe so I can use it as URL to download file.

From my point of view the logic would be, that I need to first match "http" as beggining of a string, wildcard inbetween and then match "}", but not include it in final output. So IDK ...[http]*\} (I know that this "syntax" of mine is totally wrong, but you get the idea)

Reason I dont want to include "exe" to pattern, is that file extension could be "msi" and I want it to be more universal. Also some good and comprehensive PS regex article would help me greatly (with inexperience in mind) - I really didnt find any "newbie friendly" or comprehensive enough to understand this topic.

Martin Brandl · Accepted Answer

You can either, use [regex]::match or -replace.

In the following example, I capture everything after href= that is not a starting curly bracket }:

'@{href=http://download.company.net/file.exe}[0]' -replace '@{href=([^}]+).*', '$1'

Output:

http://download.company.net/file.exe

Regex to capture an URL

Answers (2)

Related Questions