mechanize opening multiple pages

Question

I am using mechanize and beautifulsoup to create a python script to pull some data from a web page. The scraping works fine but the problem I am having is in moving to multiple pages. Is there a way to move between pages in mechanize in a loop?

Here's what I tried

Browser().follow_link(text_regex="Next")

but it goes to the next page which is fine but if the "Next" button doesn't exist it just dies. I'm not sure how to either make a better loop or really just check if the link exists before running the above follow link command.

Most of the examples and documentation I have found seems to only work on one page.

phooji · Accepted Answer

How does your code "just die"? If it throws an exception, you can catch it and do something to handle it propery (inside your loop as is):

try:
   Browser.follow_link(text_regex="Next")
except Exception:
   print "No more next button; terminating loop (but not dying mysteriously)"
   break

mechanize opening multiple pages

Answers (1)

Related Questions