Logistic Regression: how to compare predicted value with a threshold and get the classification done

Question

I have this Credit Default dataset with head like this:

default student balance      income        default_Yes

No      No      729.526495   44361.625074   0 

No      Yes     817.180407   12106.134700   0 

No      No      1073.549164  31767.138947   0 

No      No      529.250605   35704.493935   0 

No      No      785.655883   38463.495879   0

I am trying to perform logistic regression for 'default_Yes' based on the 'balance' attribute and used the following function:

 from sklearn.cross_validation import train_test_split
 from sklearn import metrics
 X = cred_def[['balance']]
 Y = cred_def['default_Yes']
 X_train, X_test,Y_train,Y_test = train_test_split(X,Y,test_size=0.3,random_state=76)
 logist = LogisticRegression()
 logist.fit(X_train,Y_train)
 y_pred = logist.predict(X_test)


 def model(threshold):
     def_thresh = np.greater(y_pred, threshold).astype(int)
     acc_score = metrics.accuracy_score(Y_test, def_thresh)
     print(acc_score)
     plt.scatter(X_test.values,Y_test.values)
     plt.scatter(X_test.values,def_thresh)
     conf = metrics.confusion_matrix(Y_test, y_pred)
     print(conf)

The problem I am facing is: no matter what value of threshold I am passing to the function 'model', it's producing same output and not considering the value passed.

Logistic Regression: how to compare predicted value with a threshold and get the classification done

Answers (1)

Related Questions