mysterious Python Pandas lambda function error

Question

I have a pandas dataframe and I have a column called 'email'. I have verified the dtype is object. It contains normally formatted emails such as xxx@yyy.com

When I do this:

$ df['emaillower'] = df['email'].apply(lambda x: x.lower())

I get this:

Traceback (most recent call last):

File "", line 1, in 
df['emaillower'] = df['email'].apply(lambda x: x.upper())

File "C:\ProgramData\Anaconda2\lib\site-packages\pandas\core\series.py", 
line 
2355, in apply
mapped = lib.map_infer(values, f, convert=convert_dtype)

File "pandas\_libs\src\inference.pyx", line 1569, in 
pandas._libs.lib.map_infer (pandas\_libs\lib.c:66440)

File "", line 1, in 
df['emaillower'] = df['email'].apply(lambda x: x.upper())

AttributeError: 'float' object has no attribute 'upper'

What is going on?

Sevy · Accepted Answer

One of the entries in the column 'email' is a float, not a string, and it doesn't know how to do upper() on a float. This is common when one entry is empty and is converted to NaN - this is read as a float and that's the source of your error. Something like this may fix the problem:

df['emaillower'] = df['email'].apply(lambda x: x.upper() if type(x) is str else 'empty')

Also want to note that you call the column emaillower but you are actually making it upper case - this might cause some confusion in the future

mysterious Python Pandas lambda function error

Answers (2)

Related Questions