Not able to locate element in Selenium

Question

I'm trying to download captcha image which its URL and content are dynamically change every time you load a page, I understand that I can to take a screenshot for the browser and locate the captcha image location, I'm not able to locate the captcha img.

From the HTML source code I found this

//this script used to generate captcha

//when i click on src="/efs/servlet/efs/jsp-ns/captcha.jsp" , it leads me to this


 



Insert title here

this line '' define the captcha url but the number 't=1378993130057' dynamically change

I've seen this thread Download image with selenium python but I don't understand how the authors could find out the image location such as

img = browser.find_element_by_xpath('//*[@id="cryptogram"]')

for google captcha [http://www.google.com/recaptcha/demo/recaptcha]

img = driver.find_element_by_xpath('//div[@id="recaptcha_image"]/img')

python 2.6 I'm using Selenuim to browse the site

update

try:
    browser.save_screenshot('screenshot.png')
    img = browser.find_element_by_xpath('//body/img')
    src = img.get_attribute('src')
    loc = img.location

except Exception,e:
    print e

output

Message: u'Unable to locate element: {"method":"xpath","selector":"//body/img"}' ; Stacktrace: 
    at FirefoxDriver.prototype.findElementInternal_ (file:///tmp/tmppjlmPW/extensions/fxdriver@googlecode.com/components/driver_component.js:8899)
    at FirefoxDriver.prototype.findElement (file:///tmp/tmppjlmPW/extensions/fxdriver@googlecode.com/components/driver_component.js:8908)
    at DelayedCommand.prototype.executeInternal_/h (file:///tmp/tmppjlmPW/extensions/fxdriver@googlecode.com/components/command_processor.js:10840)
    at DelayedCommand.prototype.executeInternal_ (file:///tmp/tmppjlmPW/extensions/fxdriver@googlecode.com/components/command_processor.js:10845)
    at DelayedCommand.prototype.execute/< (file:///tmp/tmppjlmPW/extensions/fxdriver@googlecode.com/components/command_processor.js:10787)

Update #2

from selenium import webdriver
import datetime
from selenium.webdriver.common.proxy import *


print '[+] Starts at '+ datetime.datetime.now().isoformat()

browser = webdriver.Firefox() 
browser.get("https://www.example.com") 


try:
    browser.save_screenshot('screenshot.png')
    img = browser.find_element_by_xpath('//body/img')
    src = img.get_attribute('src')
    loc = img.location

except Exception,e:
    print e


browser.delete_all_cookies()
browser.close()

print '[+] Done at ' + datetime.datetime.now().isoformat()

Any help is much appreciated.

alecxe · Accepted Answer

You can get the img tag by xpath, get src attribute value and then download it via urlretrieve:

import urllib

img = browser.find_element_by_xpath('//body/img')
src = img.get_attribute("src")
urllib.urlretrieve(src, "captcha.png")

Not able to locate element in Selenium

update

Update #2

Answers (2)

Related Questions