Comparing Ranked List in Python

Question

I have 50 products. For each product, I want to identify the following four related products using similarity measures.

1 related the most
2 partially related
1 not related

I want to compare the ranked list generated by my model (predicted) with the ranked list specified by the domain experts (ground truth).

Through reading, I found that I may use rank correlation based approaches such as Kendall Tau/Spearmen to compare the ranked lists. However, I am not sure if these approaches are suitable as my number of samples is low (4). Please correct me if I am wrong.

Another approach is to use Jaccard similarity (set intersection) to quantify the similarity between two ranked list. Then, I may plot histogram from the setbased_list (see below).

for index, row in evaluate.iterrows():
    d= row['Id']
    y_pred = [3,2,1,0]
    y_true = [row['A'],row['B'],row['C'],row['D']]
    sim = jaccard_similarity_score(y_true, y_pred)
    setbased_list.append(sim)

Is my approach to the problem above correct?
What are other approaches that I may use if I want to take into consideration the positions of elements in the list (weight-based)?

Comparing Ranked List in Python

Answers (1)

Related Questions