Get href within a table

Question

Sorry, has most likely been asked before but I can't seem to find an answer on stack/from search engine.

I'm trying to scrape some data from a table, but there are href links which I need to get. Html as follows:


**1)**
 West Drayton



    
    **2)**


     
    





So far I have used the following: 

for table in soup.findAll('table', {'class': 'featprop results'}):
    for tr in table.findAll('tr'):
        for a in tr.findAll('a'):
            print(a)


Which returns both 1 and 2 in the above html, could anyone help me strip out just the href link?

宏杰李 · Accepted Answer

for table in soup.findAll('table', {'class': 'featprop results'}):
    for tr in table.findAll('tr'):
        for a in tr.findAll('a'):
            print(a['href'])

out:

/lettings-search-results?task=View&itemid=136
/lettings-search-results?task=View&itemid=136

Attributes

EDIT:

links = set() # set will remove the dupilcate
for a in tr.findAll('a', href=re.compile(r'^/lettings-search-results?')):
    links.add(a['href'])

regular expression

Get href within a table

Answers (2)

Related Questions