Xpath element not matched in Selenium but match in Browser Console

Question

Hello i have this xpath code and i want to take the link and data.



Google or Yahoo?

  Both


  Both


    75 answers
    ·
  Google
  ·
  3 days ago

In this picture is present only the data field, the xpath for take the link of the question works well. i Try to use this xpath and works well in the browser, but when i use with selenium in Python i have xpath error.

 post_elems = self.driver.find_elements_by_xpath('//li[contains(@class,"qTile P-14 Bdbx-1g Bgc-w")]')

 i = 0
 for post in post_elems:
     data_of_question = post.find_element_by_xpath('.//div[contains(@class,"Fz-12 Clr-888")]/text()[last()]')
     url = post.find_element_by_xpath('.//h3/a[contains(@class,"Clr-b")]')
     url_accodare = url.get_attribute('href')

alecxe · Accepted Answer

The problem is that the XPath expressions in selenium has to point to a tag and not a text node. In other words, .//div[contains(@class,"Fz-12 Clr-888")]/text()[last()] expression is illegal and you have to get that question date in a different way.

For instance, you can get the complete text of the element and use regular expressions to extract the part you are interested in. Example:

import re

value = post.find_element_by_xpath('.//div[contains(@class,"Fz-12 Clr-888")]').text
match = re.search(r"(\d+ days ago)", value)
print(match.group(1))

Or, you can also grab the outerHTML of the element and get the text you need by parsing it with, for instance, BeautifulSoup:

from bs4 import BeautifulSoup

elm = post.find_element_by_xpath('.//div[contains(@class,"Fz-12 Clr-888")]')
data = elm.get_attribute("outerHTML")

soup = BeautifulSoup(data)
print(soup.find_all(text=True)[-1])

There are also, definitely, other options to extract the desired text node.

Xpath element not matched in Selenium but match in Browser Console

Answers (1)

Related Questions