iammrmehul
iammrmehul

Reputation: 820

python - fuzzywuzzy error - object of type float has no len

I am trying to use the fuzzywuzzy library to get similarity score between strings in 2 datasets using the fuzz.ratio function.

Although I am constantly getting the following error :

 File "title_matching.py", line 29, in <module>
    match = match_title(title, all_titles_list, 75)
  File "title_matching.py", line 12, in match_title
    score = fuzz.ratio(title, title2)
  File "/usr/local/lib/python3.7/site-packages/fuzzywuzzy/utils.py", line 38, in decorator
    return func(*args, **kwargs)
  File "/usr/local/lib/python3.7/site-packages/fuzzywuzzy/utils.py", line 29, in decorator
    return func(*args, **kwargs)
  File "/usr/local/lib/python3.7/site-packages/fuzzywuzzy/utils.py", line 45, in decorator
    if len(args[0]) == 0 or len(args[1]) == 0:
TypeError: object of type 'float' has no len()

Below is the module where I am using the library function :

def match_title(title, list_titles, min_score=0):
    # -1 score incase we don't get any matches
    max_score = -1
    # Returning empty name for no match as well
    max_name = ""
    # Iternating over all names in the other
    for title2 in list_titles:
        #Finding fuzzy match score
        score = fuzz.ratio(title, title2)
        # Checking if we are above our threshold and have a better score
        if (score > min_score) & (score > max_score):
            max_name = title2
            max_score = score
    return (max_name, max_score)

I have checked the values of title & list_titles by printing them and they are string and list of strings respectively. I have no idea why this is happening or how to fix it since the error is being generated in the library file.

Upvotes: 0

Views: 2911

Answers (2)

mpal09
mpal09

Reputation: 1

In my case, dropping null values also did not help as I encountered " object of type 'int' has no len" error after. Converting the dataframe column to str helped.

dataframe = dataframe.dropna()

dataframe["column"]=dataframe["column"].astype(str)

Upvotes: 0

DeepSpace
DeepSpace

Reputation: 81604

score = fuzz.ratio(title, title2)

Either title or title2 is a float and not a string.

from fuzzywuzzy import fuzz

print(fuzz.ratio('1', '2'))
# 0
print(fuzz.ratio(1.0, '2'))
  Traceback (most recent call last):
  File "main.py", line 3, in <module>
    print(fuzz.ratio(1.0, '2'))
  File "C:\Python37\lib\site-packages\fuzzywuzzy\utils.py", line 38, in decorator
    return func(*args, **kwargs)
  File "C:\Python37\lib\site-packages\fuzzywuzzy\utils.py", line 29, in decorator
    return func(*args, **kwargs)
  File "C:\Python37\lib\site-packages\fuzzywuzzy\utils.py", line 45, in decorator
    if len(args[0]) == 0 or len(args[1]) == 0:
TypeError: object of type 'float' has no len()

Upvotes: 1

Related Questions