Reputation: 511
I am trying to run a training job using sagemaker sdk.
I set the base_job_name
as base-job-name
and model_dir as s3://my-bucket/model-output/
, The trained model, however, is at s3://my-bucket/model-output/base-job-name-2020-10-12-21-30-42-748/output
.
Can I do something to remove the date-time part from the base-job-name
folder? It is perfectly fine to overwrite files as a result.
I couldn't seem to locate any property in the documentation which can help me set this.
This is how I am creating the estimator
estimator = TensorFlow(
base_job_name='base-job-name',
entry_point='model.py',
source_dir=source_dir,
output_path='s3://my-bucket/model-output/',
model_dir='s3://my-bucket/model-output/',
instance_type='ml.m5.large',
instance_count=1,
role=my_role,
framework_version='2.2.0',
py_version='py37',
subnets=subnets,
security_group_ids=security_group_ids,
sagemaker_session=sagemaker_sess,
tags=tags
)
Upvotes: 1
Views: 2836
Reputation: 301
You are not able to remove the date and time stamp from the output name. The reason for this is because if you run the estimator code, and then call the .fit()
function upon it more than 1 time, you will be overwriting the output model data, event data, etc.
Upvotes: 1