Unable to get table text values using python Beautifulsoup

Question

I'm trying to get the table text values from a td tag, but I always get an empty list.

Here is the link from where I'm trying to extract table values.

Here is what I have tried.

response = requests.get('https://www.international-pc.com/product/interfine-629')
soup = BeautifulSoup(response.text, 'html.parser')
tables = soup.find("table", {"id": "documentTable-1"}).find_all("tbody")
print(tables)
Output :  []

the HTML


          
            PRODUCT DATASHEET LANGUAGE DOWNLOAD
          
        Interfine 629 English (United Kingdom) PDF
Interfine 629 Korean (Korea, Republic of) PDF
Interfine 629 Chinese (China) PDF

I want to extract all three rows text values from the table .

Any suggestions?

bharatk · Accepted Answer

https://www.international-pc.com/product/interfine-629 website link is dynamic rendering request table data. You should try automation selenium library. it allows you to scrape dynamic rendering request(js or ajax) page data.

Try this:

from bs4 import BeautifulSoup
from selenium import webdriver

driver = webdriver.Chrome("/usr/bin/chromedriver")
driver.get('https://www.international-pc.com/product/interfine-629')

soup = BeautifulSoup(driver.page_source, 'lxml')

tables = soup.find("table", {"id": "documentTable-1"}).find("tbody")

for tr in tables.find_all("tr"):
    for td in tr.find_all("td"):
        print(td.text)
        link = td.find("a",href=True)

        if link is None:
            continue
        print(link['href'])

O/P:

 Interfine 629
Chinese (China)
PDF
https://international.brand.akzonobel.com/m/6980eb615ebe99f0/original/Interfine_629_chi_s_A4_20150205.pdf
Interfine 629
Korean (Korea, Republic of)
PDF
https://international.brand.akzonobel.com/m/664b77540ff01960/original/Interfine_629_kor_A4_19000101.pdf
Interfine 629
English (United Kingdom)
PDF
https://international.brand.akzonobel.com/m/1ff7b0196600886b/original/Interfine_629_eng_A4_20151012.pdf

where '/usr/bin/chromedriver' selenium web driver path.

Download selenium web driver for chrome browser:

http://chromedriver.chromium.org/downloads

Install web driver for chrome browser:

https://christopher.su/2015/selenium-chromedriver-ubuntu/

Selenium tutorial:

https://selenium-python.readthedocs.io/

Unable to get table <td> text values using python Beautifulsoup

Answers (2)

Related Questions

Unable to get table &lt;td&gt; text values using python Beautifulsoup

Answers (2)

Related Questions

Unable to get table <td> text values using python Beautifulsoup