How can I fetch the html elements within a bounding box using playwright?

Question

I have a full page screenshot taken with playwright and a object detection model which returns some bounding box coordinates over it. I would like to fetch the corresponding html element's text that is contained within the bounding box.

So far, I am using Playwright's Python API like this:

with sync_playwright() as p:
     browser = p.webkit.launch()
     page = browser.new_page()
     page.goto(url)     
     element_text = page.evaluate("([x, y]) => document.elementFromPoint(x,y).textContent",
                                   [bbox_x_center, bbox_y_center])

But it seems that the coordinates further down in the page are not found. Is it possible that the lower elements are not in the viewport and I need to scroll down to find them? And in general is there an easier way to retrieve the element handles of interest within a bounding box from a screenshot?

How can I fetch the html elements within a bounding box using playwright?

Answers (1)

Related Questions