Interface ProductionVariant
- All Superinterfaces:
software.amazon.jsii.JsiiSerializable
- All Known Implementing Classes:
ProductionVariant.Jsii$Proxy
@Generated(value="jsii-pacmak/1.116.0 (build 0eddcff)",
date="2025-10-24T13:34:48.330Z")
@Stability(Stable)
public interface ProductionVariant
extends software.amazon.jsii.JsiiSerializable
Identifies a model that you want to host and the resources to deploy for hosting it.
Example:
// The code below shows an example of how to instantiate this type.
// The values are placeholders you should change.
import software.amazon.awscdk.services.ec2.*;
import software.amazon.awscdk.services.stepfunctions.tasks.*;
AcceleratorType acceleratorType;
InstanceType instanceType;
ProductionVariant productionVariant = ProductionVariant.builder()
.instanceType(instanceType)
.modelName("modelName")
.variantName("variantName")
// the properties below are optional
.acceleratorType(acceleratorType)
.initialInstanceCount(123)
.initialVariantWeight(123)
.build();
- See Also:
-
Nested Class Summary
Nested ClassesModifier and TypeInterfaceDescriptionstatic final classA builder forProductionVariantstatic final classAn implementation forProductionVariant -
Method Summary
Modifier and TypeMethodDescriptionstatic ProductionVariant.Builderbuilder()default AcceleratorTypeThe size of the Elastic Inference (EI) instance to use for the production variant.default NumberNumber of instances to launch initially.default NumberDetermines initial traffic distribution among all of the models that you specify in the endpoint configuration.The ML compute instance type.The name of the model that you want to host.The name of the production variant.Methods inherited from interface software.amazon.jsii.JsiiSerializable
$jsii$toJson
-
Method Details
-
getInstanceType
The ML compute instance type. -
getModelName
The name of the model that you want to host.This is the name that you specified when creating the model.
-
getVariantName
The name of the production variant. -
getAcceleratorType
The size of the Elastic Inference (EI) instance to use for the production variant.Default: - None
-
getInitialInstanceCount
Number of instances to launch initially.Default: - 1
-
getInitialVariantWeight
Determines initial traffic distribution among all of the models that you specify in the endpoint configuration.Default: - 1.0
-
builder
- Returns:
- a
ProductionVariant.BuilderofProductionVariant
-