How does predict_proba work with cross-validation?

Question

When using 5-fold cross-validation to create a model, 5 different models are created. The selection of the final model can vary:

best-estimated (or other criteria) model out of the 5-fold created model or
the model create when trained on all the datasets.

I understand that cross-validation is used for model checking, not for model building. So when a predict_proba is used on the model, how is this probability defined? Could you share some papers or articles that talk about how the prediction works on caret in R and in sklearn in Python with cross-validation?

How does predict_proba work with cross-validation?

Answers (1)

Related Questions