Python requests not returning fully loaded content

Question

Trying to gain the sizes from here.

The content I want:

However I am receiving:

I also tried using requests-html to see if it was a javascript rendering issue. But I was just receiving empty values.

Here is my code:

import requests
import randomheaders
from bs4 import BeautifulSoup

proxy = {'''PROXY'''}
while True:
    try:
        source = requests.get("https://www.size.co.uk/product/grey-nike-air-max-98-se/132114/", proxies= proxy, headers=randomheaders.LoadHeader(),timeout=30).text
        soup = BeautifulSoup(source, features = "lxml")
        print(soup.find_all("div", class_="options"))

    except Exception as e:
        print(e)

    time.sleep(5)

Nazim Kerimbekov · Accepted Answer

from a technical point of view your code is correct. As this website uses Javascript to render itself, the size is store on a different URL, which is the following:

https://www.size.co.uk/product/grey-nike-air-max-98-se/132114/stock

as you can see you just have to add /stock to your initial URL.

That being said, try replacing this:

source = requests.get("https://www.size.co.uk/product/grey-nike-air-max-98-se/132114/", proxies= proxy, headers=randomheaders.LoadHeader(),timeout=30).text
soup = BeautifulSoup(source, features = "lxml")
print(soup.find_all("div", class_="options"))

with:

source = requests.get("https://www.size.co.uk/product/grey-nike-air-max-98-se/132114/stock", proxies= proxy, headers=randomheaders.LoadHeader(),timeout=30).text
soup = BeautifulSoup(source, features = "lxml")
sizes = [x["title"].replace("Select Your UK Size ","") for x in soup.find_all("button",{"data-e2e":"product-size"})]
print(sizes)

Where sizes is a list containing all of the sizes and has the following output:

['6', '7', '7.5', '8', '8.5', '9', '9.5', '10', '10.5', '11', '11.5', '12']

Hope this helps!

Python requests not returning fully loaded content

Answers (2)

Related Questions