King Dedede
King Dedede

Reputation: 1010

How can I resume a training job in Sagemaker script mode?

In non-script mode for Sagemaker training jobs with Tensorflow, I was able to specify a checkpoint path in S3 with checkpoint_path.

However, in script mode this parameter is disabled.

How can I start from most recent checkpoint for a Sagemaker Tensorflow training job in Script mode?

Upvotes: 1

Views: 379

Answers (1)

lauren
lauren

Reputation: 513

with script mode, the parameter you're looking for is model_dir

docs: https://sagemaker.readthedocs.io/en/stable/using_tf.html#adapting-your-local-tensorflow-script

Upvotes: 1

Related Questions