/AWS1/CL_SGMSTOPPINGCONDITION¶
Specifies a limit to how long a job can run. When the job reaches the time limit, SageMaker ends the job. Use this API to cap costs.
To stop a training job, SageMaker sends the algorithm the SIGTERM signal, which delays job termination for 120 seconds. Algorithms can use this 120-second window to save the model artifacts, so the results of training are not lost.
The training algorithms provided by SageMaker automatically save the intermediate results of a model training job when possible. This attempt to save artifacts is only a best effort case as model might not be in a state from which it can be saved. For example, if training has just started, the model might not be ready to save. When saved, this intermediate data is a valid model artifact. You can use it to create a model with CreateModel.
The Neural Topic Model (NTM) currently does not support saving intermediate model artifacts. When training NTMs, make sure that the maximum runtime is sufficient for the training job to complete.
CONSTRUCTOR¶
IMPORTING¶
Optional arguments:¶
iv_maxruntimeinseconds TYPE /AWS1/SGMMAXRUNTIMEINSECONDS /AWS1/SGMMAXRUNTIMEINSECONDS¶
The maximum length of time, in seconds, that a training or compilation job can run before it is stopped.
For compilation jobs, if the job does not complete during this time, a
TimeOuterror is generated. We recommend starting with 900 seconds and increasing as necessary based on your model.For all other jobs, if the job does not complete during this time, SageMaker ends the job. When
RetryStrategyis specified in the job request,MaxRuntimeInSecondsspecifies the maximum time for all of the attempts in total, not each individual attempt. The default value is 1 day. The maximum value is 28 days.The maximum time that a
TrainingJobcan run in total, including any time spent publishing metrics or archiving and uploading models after it has been stopped, is 30 days.
iv_maxwaittimeinseconds TYPE /AWS1/SGMMAXWAITTIMEINSECONDS /AWS1/SGMMAXWAITTIMEINSECONDS¶
The maximum length of time, in seconds, that a managed Spot training job has to complete. It is the amount of time spent waiting for Spot capacity plus the amount of time the job can run. It must be equal to or greater than
MaxRuntimeInSeconds. If the job does not complete during this time, SageMaker ends the job.When
RetryStrategyis specified in the job request,MaxWaitTimeInSecondsspecifies the maximum time for all of the attempts in total, not each individual attempt.
iv_maxpendingtimeinseconds TYPE /AWS1/SGMMAXPENDINGTIMEINSECS /AWS1/SGMMAXPENDINGTIMEINSECS¶
The maximum length of time, in seconds, that a training or compilation job can be pending before it is stopped.
When working with training jobs that use capacity from training plans, not all
Pendingjob states count against theMaxPendingTimeInSecondslimit. The following scenarios do not increment theMaxPendingTimeInSecondscounter:
The plan is in a
Scheduledstate: Jobs queued (inPendingstatus) before a plan's start date (waiting for scheduled start time)Between capacity reservations: Jobs temporarily back to
Pendingstatus between two capacity reservation periods
MaxPendingTimeInSecondsonly increments when jobs are actively waiting for capacity in anActiveplan.
Queryable Attributes¶
MaxRuntimeInSeconds¶
The maximum length of time, in seconds, that a training or compilation job can run before it is stopped.
For compilation jobs, if the job does not complete during this time, a
TimeOuterror is generated. We recommend starting with 900 seconds and increasing as necessary based on your model.For all other jobs, if the job does not complete during this time, SageMaker ends the job. When
RetryStrategyis specified in the job request,MaxRuntimeInSecondsspecifies the maximum time for all of the attempts in total, not each individual attempt. The default value is 1 day. The maximum value is 28 days.The maximum time that a
TrainingJobcan run in total, including any time spent publishing metrics or archiving and uploading models after it has been stopped, is 30 days.
Accessible with the following methods¶
| Method | Description |
|---|---|
GET_MAXRUNTIMEINSECONDS() |
Getter for MAXRUNTIMEINSECONDS, with configurable default |
ASK_MAXRUNTIMEINSECONDS() |
Getter for MAXRUNTIMEINSECONDS w/ exceptions if field has no |
HAS_MAXRUNTIMEINSECONDS() |
Determine if MAXRUNTIMEINSECONDS has a value |
MaxWaitTimeInSeconds¶
The maximum length of time, in seconds, that a managed Spot training job has to complete. It is the amount of time spent waiting for Spot capacity plus the amount of time the job can run. It must be equal to or greater than
MaxRuntimeInSeconds. If the job does not complete during this time, SageMaker ends the job.When
RetryStrategyis specified in the job request,MaxWaitTimeInSecondsspecifies the maximum time for all of the attempts in total, not each individual attempt.
Accessible with the following methods¶
| Method | Description |
|---|---|
GET_MAXWAITTIMEINSECONDS() |
Getter for MAXWAITTIMEINSECONDS, with configurable default |
ASK_MAXWAITTIMEINSECONDS() |
Getter for MAXWAITTIMEINSECONDS w/ exceptions if field has n |
HAS_MAXWAITTIMEINSECONDS() |
Determine if MAXWAITTIMEINSECONDS has a value |
MaxPendingTimeInSeconds¶
The maximum length of time, in seconds, that a training or compilation job can be pending before it is stopped.
When working with training jobs that use capacity from training plans, not all
Pendingjob states count against theMaxPendingTimeInSecondslimit. The following scenarios do not increment theMaxPendingTimeInSecondscounter:
The plan is in a
Scheduledstate: Jobs queued (inPendingstatus) before a plan's start date (waiting for scheduled start time)Between capacity reservations: Jobs temporarily back to
Pendingstatus between two capacity reservation periods
MaxPendingTimeInSecondsonly increments when jobs are actively waiting for capacity in anActiveplan.
Accessible with the following methods¶
| Method | Description |
|---|---|
GET_MAXPENDINGTIMEINSECONDS() |
Getter for MAXPENDINGTIMEINSECONDS, with configurable defaul |
ASK_MAXPENDINGTIMEINSECONDS() |
Getter for MAXPENDINGTIMEINSECONDS w/ exceptions if field ha |
HAS_MAXPENDINGTIMEINSECONDS() |
Determine if MAXPENDINGTIMEINSECONDS has a value |