RecommendationJobStoppingConditions
Specifies conditions for stopping a job. When a job reaches a stopping condition limit, SageMaker ends the job.
Contents
- FlatInvocations
-
Stops a load test when the number of invocations (TPS) peaks and flattens, which means that the instance has reached capacity. The default value is
Stop. If you want the load test to continue after invocations have flattened, set the value toContinue.Type: String
Valid Values:
Continue | StopRequired: No
- MaxInvocations
-
The maximum number of requests per minute expected for the endpoint.
Type: Integer
Required: No
- ModelLatencyThresholds
-
The interval of time taken by a model to respond as viewed from SageMaker. The interval includes the local communication time taken to send the request and to fetch the response from the container of a model and the time taken to complete the inference in the container.
Type: Array of ModelLatencyThreshold objects
Array Members: Fixed number of 1 item.
Required: No
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: