Handling rss redirects with Python/urllib2

Question

Calling urrlib2.urlopen on a link to an article fetched from an RSS feed leads to the following error:

urllib2.HTTPError: HTTP Error 301: The HTTP server returned a redirect error tha t would lead to an infinite loop. The last 30x error message was: Moved Permanently

According to the documentation, urllib2 supports redirects.

On Java the problem was solved by just calling

HttpURLConnection.setFollowRedirects(true);

How can I solve it with Python?

UPDATE

The link I'm having problems with:

http://feeds.nytimes.com/click.phdo?i=8cd5af579b320b0bfd695ddcc344d96c

sleeplessnerd · Accepted Answer

Turns out you need to enable Cookies. The page redirects to itself after setting a cookie first. Because urllib2 does not handle cookies by default you have to do it yourself.

import urllib2
import urllib
from cookielib import CookieJar

cj = CookieJar()
opener = urllib2.build_opener(urllib2.HTTPCookieProcessor(cj))
p = opener.open("http://feeds.nytimes.com/click.phdo?i=8cd5af579b320b0bfd695ddcc344d96c")

print p.read()

Handling rss redirects with Python/urllib2

Answers (2)

Related Questions