Python: Extract all childs of a
tag using BeautifulSoup

Question

The tags are like this:


The Taste of Fear (A Suspense Action...

by Jeremy Bates
Free



Another Book

by Jeremy
Free

I am using BeautifulSoup to read the webpage and extract a few details:

Title, Author, Price and Link

The code that I have tried could extract only one of them, but I want all of it in a collection per title.

items = soup.find_all("div", {"class":"zg_itemWrapper"})

for item in items:
    titles = item.find_all("div", {"class":"zg_title"})
    for title in titles:
        print title.text

alecxe · Accepted Answer

You are on the right track.

Use find by class name for every "itemWrapper" found:

items = soup.find_all("div", {"class":"zg_itemWrapper"})

for item in items:
    title_elm = item.find("div", {"class":"zg_title"}).a
    title = title_elm.get_text()
    link = title_elm["href"]

    author = item.find("div", {"class": "zg_byline"}).get_text()
    price = item.find("div", {"class": "zg_price"}).get_text()

    print title, link, author, price

Python: Extract all childs of a <div> tag using BeautifulSoup

Answers (1)

Related Questions

Python: Extract all childs of a &lt;div&gt; tag using BeautifulSoup

Answers (1)

Related Questions

Python: Extract all childs of a <div> tag using BeautifulSoup