hytromo
hytromo

Reputation: 1531

Transform URL string into normal string in Python (%20 to space etc)

Is there any way in Python to transform this %CE%B1%CE%BB%20 into this αλ which is its real representation?

Upvotes: 49

Views: 65046

Answers (3)

olmerg
olmerg

Reputation: 435

python 3 answer

import urllib 
urllib.parse.unquote('/El%20Ni%C3%B1o/')

'/El Niño/'

source

Upvotes: 4

Igor Chubin
Igor Chubin

Reputation: 64563

For python 2:

>>> import urllib2
>>> print urllib2.unquote("%CE%B1%CE%BB%20")
αλ 

For python 3:

>>> from urllib.parse import unquote
>>> print(unquote("%CE%B1%CE%BB%20"))
αλ

And here's code that works in all versions:

try:
    from urllib import unquote
except ImportError:
    from urllib.parse import unquote

print(unquote("%CE%B1%CE%BB%20"))

Upvotes: 101

Mark Byers
Mark Byers

Reputation: 838216

There are two encodings in play here. Your string has first been encoded as UTF-8, then each byte has been percent-encoded.

To get the original string back you need to first unquote it, and then decode it:

>>> import urllib
>>> s = '%CE%B1%CE%BB%20'
>>> result = urllib.unquote(s).decode('utf8')
>>> print result
αλ 

Note that you need a Unicode enabled console in order to display the value (if you get an error with the print statement, try running it in IDLE).

Upvotes: 11

Related Questions