taabouzeid
taabouzeid

Reputation: 939

How can I get title & scripts inside a webpage using webkit + gtk?


Here 's my code snippet

import gtk, webkit
window = gtk.Window()
browser = webkit.WebView()
url = "www.google.com"
browser.open(url)

Now I wanna get the web page title, script tags inside. So how can I do that ?

The documentation is not clear at these points and I only found documentation for Objective-C and I am trying to find my way there. Please if you know where can I get a better reference not necessarily for Python. C, C++ would be fine also.

Thanks

Upvotes: 1

Views: 1194

Answers (2)

Mew
Mew

Reputation: 1049

I think the following should work (I can't try it out right now):

def title_changed(widget, frame, title):
    print title

browser.connect('title-changed', title_changed)

There is some documentation here and here and two examples in the demo directory from the source tarball.

Upvotes: 1

Eric Fortin
Eric Fortin

Reputation: 7603

It is not bound to the technology used to retrieve the html. Once browser has opened it, just parse the html with beautiful soup or anything that supports XPath for example.

Upvotes: 0

Related Questions