Parser a href tag in a website with bash shell

Question

i have a website with one url inside. it's a href tag

I need to parser a website to keep the "href" value.

In this website page, there is juste one "href" tag. This "href" hasn't class name.

i use a bash shell with curl

for now, i tried this :

curl http://MyWebsite | grep "href=" | cut -d '>' -f4 | cut -d '<' -f1

but no result. i'm novice with bash shell

Someone have an idea ? Thank's for your answers

ralz · Accepted Answer

If you want to keep the href= part

curl -s http://MyWebsite | grep -E -io 'href="[^\"]+"'

If you only want URL without the href=

curl -s http://MyWebsite | grep -E -io 'href="[^\"]+"' | awk -F\" '{print$2}'

Answers (2)