How to simulate last_fit() using fit() in tidymodels?

Question

I would like to apply the random forest method to predict the wait time of patients at a hospital. I closely follow the instructions at https://www.tidymodels.org/start/case-study/ to tune my model. After obtaining my best model, I create the last_rf_workflow object as described in the link above.

last_rf_workflow <- 
  Data_rf_wflow %>% 
  update_model(last_rf_mod)

Then, I use the code below to fit the final model:

set.seed(345)
last_rf_fit <- 
  last_rf_workflow %>% 
  last_fit(data_split)

As described here the last_fit() function fits a model on the entire training set and calculates the predicted values for the testing set. These predictions can be accessed at collect_predictions(last_rf_fit).

However, when I fit the model to the entire training set and then use the predict() function, I get slightly different predictions:

set.seed(345)
last_rf_fit_2 <- 
  last_rf_workflow %>% 
  fit(training(data_split))

predict(last_rf_fit_2, testing(data_split))

I wonder if someone could help me understand why these two predictions are different. Thanks.

How to simulate last_fit() using fit() in tidymodels?

Answers (1)

Related Questions