Reputation: 53933
I want to concatenate a list of various Python objects into one string. The objects can be literally anything. I thought I could simply do this using the following code:
' '.join([str(x) for x in the_list])
but unfortunately that sometimes gives me a UnicodeEncodeError:
UnicodeEncodeError: 'ascii' codec can't encode character u'\xa0' in position 80: ordinal not in range(128)
in this SO answer I found someone who says that I need to use .encode('utf-8')
, so I changed my code to this:
' '.join([x.encode('utf-8') for x in the_list])
But if the objects are not strings or unicodes but for example int
s I get an AttributeError: 'int' object has no attribute 'encode'
. So this means I need to use some kind of if-statement to check what kind of type it is and how to convert it. But when should I use .encode('utf-8')
and when should I use str()
?
It would be even better if I could also do some kind of oneliner for this, but I wouldn't know how? Does anybody else know? All tips are welcome!
Upvotes: 7
Views: 11521
Reputation: 177901
Python 2.x use repr()
. Python 3.x use repr()
if you don't mind non-ASCII Unicode in the result, or ascii()
if you do:
>>> a=1 # integer
>>> class X: pass
...
>>> x=X() # class
>>> y='\u5000' # Unicode string
>>> z=b'\xa0' # non-ASCII byte string
>>> ' '.join(ascii(i) for i in (a,x,y,z))
"1 <__main__.X object at 0x0000000002974B38> '\\u5000' b'\\xa0'"
Example of differences between 2.X and 3.X repr()
, and 3.X ascii()
:
>>> # Python 3
>>> s = 'pingüino' # Unicode string
>>> s
'pingüino'
>>> repr(s)
"'pingüino'"
>>> print(repr(s))
'pingüino'
>>> ascii(s)
"'ping\\xfcino'"
>>> print(ascii(s))
'ping\xfcino'
>>> # Python 2
>>> s = u'pingüino'
>>> s
u'ping\xfcino'
>>> repr(s)
"u'ping\\xfcino'"
>>> print(repr(s))
u'ping\xfcino'
Upvotes: 6
Reputation: 6631
You can try join
ing with a unicode object instead..
u' '.join(unicode(x) for x in thelist)
Or what you had before will work fine in python3. Just be sure to:
For more details see this talk
Upvotes: 1
Reputation: 236
You could try combining the ternary operator with your current one-liner. Also join
works just fine with a generator, so I don't think you need to create the list. Something like
' '.join(x.encode('utf-8') if isinstance(x, basestring) else str(x)
for x in the_list)
Upvotes: 0