Extract all Links in Table with Beautiful Soup

Question

Testing

I'm trying to use BeautifulSoup to get all the href of a tags which are a child of a td tag.

I can run

urls = [x for x in soup.findAll("td")]

to obtain all the td tags and then loop through them manually to see if they contain an a tag and if so extract the href, but is there a cleaner way of doing this in one line?

MendelG · Accepted Answer

Try using the :has() CSS Selector to select all td tags that have an tag.

from bs4 import BeautifulSoup

html = """Testing"""
soup = BeautifulSoup(html, "html.parser")
print([tag.find("a")["href"] for tag in soup.select("td:has(a)")])

Output:

['https://www.blabla.com']

Extract all Links in Table with Beautiful Soup

Answers (1)

Related Questions