/AWS1/CL_SGMROLLINGUPDATEPLY¶
Specifies a rolling deployment strategy for updating a SageMaker endpoint.
CONSTRUCTOR
¶
IMPORTING¶
Required arguments:¶
io_maximumbatchsize
TYPE REF TO /AWS1/CL_SGMCAPACITYSIZE
/AWS1/CL_SGMCAPACITYSIZE
¶
Batch size for each rolling step to provision capacity and turn on traffic on the new endpoint fleet, and terminate capacity on the old endpoint fleet. Value must be between 5% to 50% of the variant's total instance count.
iv_waitintervalinseconds
TYPE /AWS1/SGMWAITINTERVALINSECONDS
/AWS1/SGMWAITINTERVALINSECONDS
¶
The length of the baking period, during which SageMaker monitors alarms for each batch on the new fleet.
Optional arguments:¶
iv_maxexectimeoutinseconds
TYPE /AWS1/SGMMAXEXECTMOUTINSECONDS
/AWS1/SGMMAXEXECTMOUTINSECONDS
¶
The time limit for the total deployment. Exceeding this limit causes a timeout.
io_rollbackmaximumbatchsize
TYPE REF TO /AWS1/CL_SGMCAPACITYSIZE
/AWS1/CL_SGMCAPACITYSIZE
¶
Batch size for rollback to the old endpoint fleet. Each rolling step to provision capacity and turn on traffic on the old endpoint fleet, and terminate capacity on the new endpoint fleet. If this field is absent, the default value will be set to 100% of total capacity which means to bring up the whole capacity of the old fleet at once during rollback.
Queryable Attributes¶
MaximumBatchSize¶
Batch size for each rolling step to provision capacity and turn on traffic on the new endpoint fleet, and terminate capacity on the old endpoint fleet. Value must be between 5% to 50% of the variant's total instance count.
Accessible with the following methods¶
Method | Description |
---|---|
GET_MAXIMUMBATCHSIZE() |
Getter for MAXIMUMBATCHSIZE |
WaitIntervalInSeconds¶
The length of the baking period, during which SageMaker monitors alarms for each batch on the new fleet.
Accessible with the following methods¶
Method | Description |
---|---|
GET_WAITINTERVALINSECONDS() |
Getter for WAITINTERVALINSECONDS, with configurable default |
ASK_WAITINTERVALINSECONDS() |
Getter for WAITINTERVALINSECONDS w/ exceptions if field has |
HAS_WAITINTERVALINSECONDS() |
Determine if WAITINTERVALINSECONDS has a value |
MaximumExecutionTimeoutInSeconds¶
The time limit for the total deployment. Exceeding this limit causes a timeout.
Accessible with the following methods¶
Method | Description |
---|---|
GET_MAXEXECTIMEOUTINSECONDS() |
Getter for MAXIMUMEXECTIMEOUTINSECONDS, with configurable de |
ASK_MAXEXECTIMEOUTINSECONDS() |
Getter for MAXIMUMEXECTIMEOUTINSECONDS w/ exceptions if fiel |
HAS_MAXEXECTIMEOUTINSECONDS() |
Determine if MAXIMUMEXECTIMEOUTINSECONDS has a value |
RollbackMaximumBatchSize¶
Batch size for rollback to the old endpoint fleet. Each rolling step to provision capacity and turn on traffic on the old endpoint fleet, and terminate capacity on the new endpoint fleet. If this field is absent, the default value will be set to 100% of total capacity which means to bring up the whole capacity of the old fleet at once during rollback.
Accessible with the following methods¶
Method | Description |
---|---|
GET_ROLLBACKMAXIMUMBATCHSIZE() |
Getter for ROLLBACKMAXIMUMBATCHSIZE |