Soup does not find specific class from div

Question

I have tried everything. The response is perfect and I do get what I am supposed to be getting, I just don't understand why I receive an empty array when I'm searching for a div with a specific class (that definitely exists) on the web page. I have tried looking everywhere, but nothing seems to work.

Here's my code:

import requests
import lxml
from bs4 import BeautifulSoup

baseurl = 'https://www.atea.dk/eshop/products/?filters=S_Apple%20MacBook%20Pro%2016%20GB'

response = requests.get(baseurl)
soup = BeautifulSoup(response.content, 'lxml')

productlist = soup.find_all("div", class_="nsv-product ns_b_a")

print(productlist)

I am essentially trying to build a script that emails me when the items from an e-shop are marked as available (pa lager) instead of unavailable (ikke pa lager).

khgb · Accepted Answer

You might need to use Selenium on this one.

The div is, AFAIK, rendered by JS.

BeautifulSoup does not capture JS-rendered content.

from selenium import webdriver
from webdriver_manager.firefox import GeckoDriverManager
from selenium.webdriver.common.keys import Keys
options = webdriver.FirefoxOptions()
options.headless = True
driver = webdriver.Firefox(executable_path=GeckoDriverManager().install(),options=options)
driver.get('https://www.atea.dk/eshop/products/?filters=S_Apple%20MacBook%20Pro%2016%20GB')
k = driver.find_elements_by_xpath("//div[@class='nsv-product ns_b_a']")

Your code below that snippet should contains everything you need, e.g. processing, saving into your database, etc..

Note: That snippet is a bit flawed, e.g. you want to use Chrome, but it only provides an example, so tweak it to your own needs.

Soup does not find specific class from div

Answers (2)

Related Questions