Is there a way to implement a 2x2 confusion matrix for multilabel classifier?

Question

I'm interested in creating a 2x2 confusion matrix for a multilabel classification problem, where it only shows the total false/true positives/negatives.

I have a section of code that generates a full confusion matrix, but with 98 labels, it's nearly impossible to see anything. I don't really care too much about having a full matrix, so a 2x2 where it only shows the aforementioned four attributes would be ideal, I'm just not sure how to implement it.

Here's the code snippet, if it helps:

predictions_d7 = model_d7.predict(x_test_d7)

y_pred = np.argmax(predictions_d7, axis=1)
y_test = np.argmax(Y_test_d7, axis=1)

print(y_test)
print(y_pred)

cm = confusion_matrix(y_test, y_pred)
disp = ConfusionMatrixDisplay(confusion_matrix=cm, display_labels=[label_list)
fig, ax = plt.subplots(figsize=(20,20))
disp.plot(ax=ax, values_format="d", cmap='gray')
disp.im_.colorbar.remove()
print( classification_report(y_test,y_pred))

user11989081 · Accepted Answer

You could calculate a 2 x 2 confusion matrix as follows:

import numpy as np

def confusion_matrix(y_true, y_pred):

    tp = np.logical_and(y_pred == 1, y_true == 1).sum()
    tn = np.logical_and(y_pred == 0, y_true == 0).sum()
    fp = np.logical_and(y_pred == 1, y_true == 0).sum()
    fn = np.logical_and(y_pred == 0, y_true == 1).sum()

    return tp, tn, fp, fn

from sklearn.datasets import make_multilabel_classification
from sklearn.ensemble import RandomForestClassifier

X, y = make_multilabel_classification(random_state=42)

clf = RandomForestClassifier(max_depth=3, random_state=42)
clf.fit(X, y)

y_pred = clf.predict(X)

tp, tn, fp, fn = confusion_matrix(y, y_pred)
print(tp, tn, fp, fn)
# 114 314 7 65

Is there a way to implement a 2x2 confusion matrix for multilabel classifier?

Answers (2)

Related Questions