CapacityLimit
Defines the capacity limit for a service environment. This structure specifies the maximum amount of resources that can be used by service jobs in the environment.
Contents
- capacityUnit
-
The unit of measure for the capacity limit. This defines how the maxCapacity value should be interpreted. For
SAGEMAKER_TRAINING
jobs, useNUM_INSTANCES
.Type: String
Required: No
- maxCapacity
-
The maximum capacity available for the service environment. This value represents the maximum amount of resources that can be allocated to service jobs.
For example,
maxCapacity=50
,capacityUnit=NUM_INSTANCES
. This indicates that the maximum number of instances that can be run on this service environment is 50. You could then run 5 SageMaker Training jobs that each use 10 instances. However, if you submit another job that requires 10 instances, it will wait in the queue.Type: Integer
Required: No
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: