top_k_accuracy_score() giving shape mismatch error: Number of classes in 'y_true' (255) not equal to the number of classes in 'y_score' (269)

Question

I have a pipeline working perfectly and now I want to check the top-k accuracy. I can obviously do this by running a loop the hard way but how can I do the same using the given function?

from sklearn.metrics import top_k_accuracy_score

# x and y can be any random feature and labels. Please assume

y = df_whole['target'].values.ravel() # get 1-D y labels currently in String format

set_y = set(y) # unique classes
class_int_mapping = dict(zip(set_y,range(len(set_y)))) # change car : 0, bus : 1 etc..

y = np.array([class_int_mapping[i] for i in y]) # array. List also works

x_train, x_test, y_train, y_test = train_test_split(x,y,test_size = 0.25,stratify = y)

When I train and test my pipeline, it gives desired results. Please assume any classification pipeline. When I do,

print(pipeline.predict_proba(x_train).shape, pipeline.predict_proba(x_test).shape)

>> (19794, 269) (6599, 269)

and when I do:

top_k_accuracy_score(y_test,pipeline.predict_proba(x_test), k = 5)

it gives me error as:

ValueError: Number of classes in 'y_true' (255) not equal to the number of classes in 'y_score' (269).

How can this be happening?

P.S.: For now, I am doing it like:

probs = pipeline.predict_proba(x_test)
topn = np.argsort(probs, axis = 1)[:,-5:]

top_k_acc_result = np.mean(np.array([1 if y_test[k] in topn[k] else 0 for k in range(len(topn))]))

top_k_accuracy_score() giving shape mismatch error: Number of classes in 'y_true' (255) not equal to the number of classes in 'y_score' (269)

Answers (1)

Related Questions

top_k_accuracy_score() giving shape mismatch error: Number of classes in &#39;y_true&#39; (255) not equal to the number of classes in &#39;y_score&#39; (269)

Answers (1)

Related Questions

top_k_accuracy_score() giving shape mismatch error: Number of classes in 'y_true' (255) not equal to the number of classes in 'y_score' (269)