Scraping a Span tag without Class name and does not appear in all Elements

Question

I am web scraping a review page using Selenium in Python. I want to extract the rating of each review (ie. Extract 7 from 7/10 in a review). The HTML element constructs like this:

    
         
            
               
               7             # What I want to extract
               /10

The element does not have any class name, so I assume to extract it using the class user-rating under the span tag:

    rating = driver.find_elements_by_class_name('user-rating')

But how should I extract the span tag within another span tag? I cannot refer it to any class name.

In addition, not every review contains a rating, so when it scrapes to a review without rating, it prompts me the error:

    NoSuchElementException: Message: no such element: Unable to locate element: {"method":"css selector","selector":".rating-other-user-rating"} (Session info: chrome=87.0.4280.66)

This is what I have tried out so far:

    review = driver.find_elements_by_class_name("review")
    rating_ls = []
    
    for i in review:
        rating = i.find_element_by_class_name('rating-other-user-rating').text
        # If rating exists, append it to the list, otherwise append "N/A" 
        rating_ls.append(rating[0] if rating else "N/A")

I appreciate if anyone can help me with this. Thanks a lot in advance!

DonnyFlaw · Accepted Answer

Try to wait for elements (probably they added by JS code):

from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC

reviews = WebDriverWait(driver, 10).until(EC.presence_of_all_elements_located((By.CLASS_NAME, "review-container")))

for review in reviews:
    _rating = review.find_elements_by_class_name('rating-other-user-rating')
    rating = _rating[0].text if _rating else 'N/A' 
    _comment = review.find_elements_by_class_name('content')
    comment = _comment[0].text if _comment else 'N/A' 
    print(rating + ": " + comment)

Scraping a Span tag without Class name and does not appear in all Elements

Answers (2)

Outro

Related Questions