Python selenium gives me an empty string for text

I have an HTML page that contains 40 of the following div

<div class='movie-featured'>
    <div class="item analytics">
        <div class="movie-details">
            <div class="movie-rating-wrapper">
                <span class="movie-rating-summary">
                    <span>some text</span>
                </span>
            </div>
        </div>
    </div>
</div>

and I'm trying to get the text from this span some text rom inside each one of the 40 divs via: find_element_by_css_selector('span.moview-rating-summary').find_element_by_tag_name('span').text

Output:

['', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '6/10', '', '', '', '', '', '', '', '', '7.5/10', '', '', '', '', '']

As you can see, I only get text from few spans and not all of them.

I also tried: find_element_by_tag_name('span').get_attribute('textContent') and find_element_by_tag_name('span').get_attribute('innerHTML').

But still the same result

Any ideas how to fix that??

Code trials:

from selenium import webdriver
import time
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
from selenium.webdriver.common.by import By
from selenium.common.exceptions import TimeoutException
browser = webdriver.Chrome()
delay = 10 
browser.get("www.example.com")


browser.execute_script("window.scrollTo(0,document.body.scrollHeight)")
time.sleep(2)
images =[]

myElem = WebDriverWait(browser, delay).until(EC.presence_of_element_located((By.CLASS_NAME, 'item-responsive')))


body = browser.find_element_by_class_name('movie-featured') # body of images container

imageItems = body.find_elements_by_css_selector('div.item.analytics')  #list of divs that hold movies images


for item in imageItems:
    
    rate = item.find_element_by_css_selector('span.moview-rating-summary').text

    images.append(rate)
    
print(images)
browser.close()

Thank you guys for all the help you gave. I fixed the problem by changing my code as follows:

body = browser.find_element_by_class_name('movie-featured')
rateDivs = body.find_elements_by_xpath('//div[@class="moview-rating-wrapper"]')
ratelist = []
for div in rateDivs:
    span = div.find_element_by_css_selector('span.moview-rating-summary')
    ratespan = span.find_element_by_tag_name('span')
    rate = ratespan.text
    if len(rate) > 0:
        ratelist.append(rate)
    else:
        continue
print(ratelist)

browser.close()

I really appreciate all the time you spent to help me.

Upvotes: 1

Answers (2)

undetected Selenium

Reputation: 193298

To extract the texts e.g. some text, from allof the  using Selenium and python you have to induce WebDriverWait for visibility_of_all_elements_located() and you can use either of the following Locator Strategies:

Using CSS_SELECTOR and get_attribute("innerHTML"):

print([my_elem.get_attribute("innerHTML") for my_elem in WebDriverWait(driver, 20).until(EC.visibility_of_all_elements_located((By.CSS_SELECTOR, "span.movie-rating-summary>span")))])

Using XPATH and text attribute:

print([my_elem.text for my_elem in WebDriverWait(driver, 20).until(EC.visibility_of_all_elements_located((By.XPATH, "//span[@class='movie-rating-summary']/span")))])

Note : You have to add the following imports :

from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.common.by import By
from selenium.webdriver.support import expected_conditions as EC

Outro

Link to useful documentation:

get_attribute() method Gets the given attribute or property of the element.
text attribute returns The text of the element.
Difference between text and innerHTML using Selenium

Upvotes: 1

TheLegend42

Reputation: 71

Try this:

driver.find_element_by_xpath('//span[@class="movie-rating-summary"]/span[1]')

Upvotes: 0

Python selenium gives me an empty string for <span> text<span>

Answers (2)

Outro

Related Questions

Python selenium gives me an empty string for &lt;span&gt; text&lt;span&gt;

Answers (2)

Outro

Related Questions

Python selenium gives me an empty string for <span> text<span>