Why defining intermediate instances to call a method in Python?

Question

Sorry for this newbie question guys, I am sure the answer is simple but I still don't get it.

Despite some reading, Object Oriented Programming still has darkness I do not understand. It is the second time I see that it is necessary to create a intermediate instance of a class to call a method. Check the example below from the urllib doc:

import urllib
opener = urllib.FancyURLopener({})
f = opener.open("http://www.python.org/")
f.read()

I don't get why it needs to create the opener instance to call the open method. I don't get why the following code doesn't work :

import urllib
f = urllib.FancyURLopener.open("http://www.python.org/")
f.read()

I get an error when trying to call urllib.FancyURLopener.open :

TypeError: unbound method open() must be called with FancyURLopener instance as first argument (got str instance instead)

Could you guys bring lights in this shadows ? Thanks a lot !

torek · Accepted Answer

You're not creating an "intermediate instance" of the class here, you're creating an instance (call it a "primary" instance if you like) of the class.

Here, FancyURLopener is the name of the class itself. The class is a type:

>>> import urllib
>>> type(urllib.FancyURLopener)

(the fact that it's a instead of tells you that this is one of Python2's "old style" classes, but that's a subject for another question entirely :-) ). Because it's a type, to actually use it, you generally have to make an object of that type. Kind of similar to: to use some integers, you have to make integers, not just refer to the int type.

A class-type has (i.e., lets you refer to) a bunch of "class methods", which are really just "functions that take a state-holder variable", which we normally spell self. So you create an instance of the type:

>>> x = urllib.FancyURLopener()

and now you can invoke functions with x.open(...) which really means: urllib.FancyURLopener.open(x, ...). This allows the code inside urllib.FancyURLopener to stash all kinds of state inside x. Want to make HTTP connections use persistence? Keep some state inside x as to whether the connection is still open. Want to manipulate some web-server cookies? Keep some state inside x to help track them down. Anything you (the writer of urllib) want, you can wedge it into x. When your functions are called you get the caller's x as your self, and self.whatever is the state stashed in x.

(Python actually exposes all this supposedly-hidden state to the caller, so people use _spam and __eggs member names to help keep them out of the way, but again that's getting away from the simple "why".)

Why defining intermediate instances to call a method in Python?

Answers (2)

Related Questions