I'm not understanding .pred_class in classification (using logistic regression)

Question

I have a pretty simply problem where my outcome is binary and I am trying to use logistic regression (using tidymodels) to classify based on a few predictors (some of which are well-known as good predictors).

I coded the factor outcome as 0 and 1 (1=positive and that what I am mostly interested in).

When I run the predict function with both types="class" and types="prob" I get columns named: .pred_class, .pred_0, and .pred_1.

Then when, for example, plotting the ROC curve I am wondering whether I should use

roc1 <- roc_curve(data_test_pred, outcome, .pred_1)

or

roc1 <- roc_curve(data_test_pred, outcome, .pred_0).

The first (which I would have thought was correct) gives a bad ROC curve below the diagonal and the second gives a decent ROC curve.

So, I am just not understanding what is going on here and I'm not sure how to proceed.

I'm not understanding .pred_class in classification (using logistic regression)

Answers (1)

Related Questions

I&#39;m not understanding .pred_class in classification (using logistic regression)

Answers (1)

Related Questions

I'm not understanding .pred_class in classification (using logistic regression)