Interface CfnInferenceComponent.InferenceComponentCapacitySizeProperty
- All Superinterfaces:
software.amazon.jsii.JsiiSerializable
- All Known Implementing Classes:
CfnInferenceComponent.InferenceComponentCapacitySizeProperty.Jsii$Proxy
- Enclosing class:
CfnInferenceComponent
@Stability(Stable)
public static interface CfnInferenceComponent.InferenceComponentCapacitySizeProperty
extends software.amazon.jsii.JsiiSerializable
Specifies the type and size of the endpoint capacity to activate for a rolling deployment or a rollback strategy.
You can specify your batches as either of the following:
- A count of inference component copies
- The overall percentage or your fleet
For a rollback strategy, if you don't specify the fields in this object, or if you set the Value parameter to 100%, then SageMaker AI uses a blue/green rollback strategy and rolls all traffic back to the blue fleet.
Example:
// The code below shows an example of how to instantiate this type.
// The values are placeholders you should change.
import software.amazon.awscdk.services.sagemaker.*;
InferenceComponentCapacitySizeProperty inferenceComponentCapacitySizeProperty = InferenceComponentCapacitySizeProperty.builder()
.type("type")
.value(123)
.build();
- See Also:
-
Nested Class Summary
Nested ClassesModifier and TypeInterfaceDescriptionstatic final classstatic final classAn implementation forCfnInferenceComponent.InferenceComponentCapacitySizeProperty -
Method Summary
Methods inherited from interface software.amazon.jsii.JsiiSerializable
$jsii$toJson
-
Method Details
-
getType
Specifies the endpoint capacity type.- COPY_COUNT - The endpoint activates based on the number of inference component copies.
- CAPACITY_PERCENT - The endpoint activates based on the specified percentage of capacity.
- See Also:
-
getValue
Defines the capacity size, either as a number of inference component copies or a capacity percentage.- See Also:
-
builder
@Stability(Stable) static CfnInferenceComponent.InferenceComponentCapacitySizeProperty.Builder builder()
-