Couldn't extract out the href values form the tags using BS4

Question

I am using BS4 for webpage scraping, and have the below html :


                                Screenshot.docx

Now how to get the value of the href using BS4, couldn't get. Can you help?

Thanks,

Fredrick Brennan · Accepted Answer

for a in soup.find_all('a', {"style": "display:inline; position:relative;"}, href=True):
    href = a['href'].strip()
    href = "http://example.com" + href
print(href)

'http://example.com/aems/file/filegetrevision.do?fileEntityId=8120070&cs=LU31NT9us5P9Pvkb1BrtdwaCrEraskiCJcY6E2ucP5s.xyz'

The built in function strip() is very helpful here. :)

Couldn't extract out the href values form the <a> tags using BS4

Answers (2)

Related Questions

Couldn&#39;t extract out the href values form the &lt;a&gt; tags using BS4

Answers (2)

Related Questions

Couldn't extract out the href values form the <a> tags using BS4