Perl regex that matches the first substring specified

Question

I need to extract data from an HTML document and compose an XML document with only interesting information. The way I'm doing this is by transforming the HTML doc into an XML doc, step by step. I have the 5 outermost XML tags in one line each, now I'm trying to structure what's inside of those.

I have a line that's structured this way :

    
      blablabla  title I want  some other stuff  text I don't want  blablabla

What I want is :

    
    link/I/want
     title I want

The regex I have is :

    /a href="(.*)"(.*)>(.*)<\/a>/

hoping to get #$1 = url , $2 = whatever , $3 = title.

This isn't working because it's taking this instead:

    
    link/I/want *some css* > title I want  some other stuff 
    text I don't want

How do I extract the content of the FIRST anchor tag of the line ?

Thanks !

Perl regex that matches the first substring specified

Answers (1)

Related Questions