Selenium, phantomJS and Geb headless scraping

Question

I need to scrap data from web site on weekly basis. Data is visible only after click on the page(js function is called). Data is loaded in a table(which can be found by id). This script will be run on a server without browser support. This is my code with geb:

    @Grab("org.gebish:geb-core:0.13.1")
    @Grab("org.seleniumhq.selenium:selenium-firefox-driver:2.52.0")
    @Grab("org.seleniumhq.selenium:selenium-support:2.52.0")
    @GrabExclude('org.codehaus.groovy:groovy-all')  

    import geb.Browser

Browser.drive{
    // driver.webClient.javaScriptEnabled = true
    go "mysite"
    js.loadWeekData()
   println $("div.data-listing").text()
    }

I've searched a lot on this topic but nothing was working as headless scraping with js support. This is the record from Selenium IDE:

driver.findElement(By.linkText("Next")).click();

I was not able to make phantomJS to work with geb.

Edit 1 This is the error from phantom js: java.lang.NoClassDefFoundError: org/openqa/selenium/browserlaunchers/Proxies I've read about the problem with versions but I was not able to resolve it.

@Grab("org.gebish:geb-core:0.13.1")
@Grab("org.seleniumhq.selenium:selenium-firefox-driver:2.52.0")
@Grab("org.seleniumhq.selenium:selenium-support:2.52.0")
@Grab("com.codeborne:phantomjsdriver:1.3.0")
WebDriver driver = new PhantomJSDriver();

        // Load Google.com
        driver.get("http://www.google.com");
        // Locate the Search field on the Google page
        WebElement element = driver.findElement(By.name("q"));

In short I need to perform the first script in headless mode(if possible without installing Xvfb). Preferably groovy or java solution.

Selenium, phantomJS and Geb headless scraping

Answers (1)

Related Questions