How regex until last occurrence?

Question

I am using python, I need regex to get contacts link of web page. So, I made (.*?)Contacts(.*?) and result is:

href="/ru/o-nas.html"  id="menu263" title="About">AboutPhotoContacts

,but I need on last like



href="/ru/kontakt.html" class="last" id="menu583" title="">Contacts


What regex pattern should I use?

python code:

match = re.findall('(.*?)Contacts(.*?)', body)
if match:
    for m in match:
        print ''.join(m)

AKS · Accepted Answer

Since you are parsing HTML, I would suggest to use BeautifulSoup

# sample html from question
html = 'About
Photo
Contacts'

from bs4 import BeautifulSoup
doc = BeautifulSoup(html)
aTag = doc.find('a', id='menu583') # id for Contacts link
print(aTag['href'])
# '/ru/kontakt.html'

How regex until last occurrence?

Answers (2)

Related Questions