Selenium.NoSuchElementException with dynamic tables

Question

Please help me with this problem!

At the moment, I am scraping a website using the Selenium Firefox driver in C#. The data on this website however is dynamically filled for tables that cover data concerning future dates.

While the structure of the table is exactly the same for both future and past dates, the tables that are being updated during my selenium call throw a "NoSuchElementException" concerning IWebElements that are clearly there.

These are the relevant copied XPaths from the tables. One from a past date, on which it works perfectly fine, and one on a future date, on which the exception is thrown. As you can see, they are identical.

XPath 18052015

/html/body/div[1]/div/div[2]/div[5]/div[1]/div/div[1]/div[2]/div[1]/div[7]/div[1]/table/tbody/tr[1]/td[1]/div/a[2]

XPath 05022016

/html/body/div[1]/div/div[2]/div[5]/div[1]/div/div[1]/div[2]/div[1]/div[7]/div[1]/table/tbody/tr[1]/td[1]/div/a[2]

Using the FindElements(By.XPath(...)) function, I use two foreach loops to go through the highlighted tr's and the td's in the Xpath to get some text in the a[2] header. In both cases, the DOM in FireFox Firebug seems to be identical in both cases. The only difference I have observed between the two tables is the fact that every few seconds, the one concerning the future date updates its values (also resetting the table when looked at via firebug). Here you have a relevant piece of code, with an important comment.

            foreach (var tr in table.FindElements(By.XPath("div/table/tbody/tr")))
            {
                foreach (var td in tr.FindElements(By.XPath("td")))
                {
                    if(td.GetAttribute("innerHTML").Contains("some stuff"))
                    {
                        // This part is always reached, so condition is satisfied. > x is the relevant value, it is assigned the proper value when the error is thrown, but it still throws an exception.
                        x = td.FindElement(By.XPath("div/a[2]")).GetAttribute("href").Split('/')[4];
                        bmID = getBookmakerID(bmName);
                    }
                    if(td.GetAttribute("class").Contains("some other stuff"))
                    {

                    }
                }

Has any of you had similar problems before and were you able to solve them?

Dr.Ripper · Accepted Answer

Thank you very much for helping. @ Buaban - I have added the waits, but I am afraid that didn't change much. It did make the algorithm go further, but eventually it broke down.

In the end, we solved it by using a combination of Selenium webdriver and the HTMLAgilityPack. As the code is too specific to actually post (and I don't have it available at the moment), I will share with you the main philosphy... which is short:

Use Selenium Webdriver to open and navigate the browser, e.g. doing actions as

Going to the right URL
Opening draw-down menus/tables
Logging in/clicking around the website
Define the table/field from which you are ripping data as a web-element (WE)
Check out a tutorial here: http://toolsqa.com/selenium-webdriver-tutorials-in-c-selenium-tutorial-in-c/

Use HTMLAgilityPack to browse and rip the defined web ellement (WE)

Load the InnerHTML attribute of the WE for processing in the HTML AP
You can navigate it accross different divs/trs/tds
It is A LOT - and I really mean A LOT - faster than ripping with Selenium Webdriver, as it parses the HTML as a string
See a nice tutorial here: http://www.mikesdotnetting.com/article/273/using-the-htmlagilitypack-to-parse-html-in-asp-net, check out the "Where<>"functionality especially!

Concluding, this approach of handling self-refreshing pages has proven to be extremely stable (it hasn't failed one time so far), extremely fast (due to parsing the HTML as a string) and flexible (as it uses specialiced packages to both navigate and rip data from the browser).

Happy coding!

Selenium.NoSuchElementException with dynamic tables

Answers (2)

Related Questions