Interface InvocationsScalingProps
- All Superinterfaces:
BaseTargetTrackingProps,software.amazon.jsii.JsiiSerializable
- All Known Implementing Classes:
InvocationsScalingProps.Jsii$Proxy
@Generated(value="jsii-pacmak/1.116.0 (build 0eddcff)",
date="2025-10-24T13:34:50.435Z")
@Stability(Experimental)
public interface InvocationsScalingProps
extends software.amazon.jsii.JsiiSerializable, BaseTargetTrackingProps
(experimental) Properties for enabling SageMaker Endpoint utilization tracking.
Example:
import software.amazon.awscdk.services.sagemaker.alpha.*;
Model model;
String variantName = "my-variant";
EndpointConfig endpointConfig = EndpointConfig.Builder.create(this, "EndpointConfig")
.instanceProductionVariants(List.of(InstanceProductionVariantProps.builder()
.model(model)
.variantName(variantName)
.build()))
.build();
Endpoint endpoint = Endpoint.Builder.create(this, "Endpoint").endpointConfig(endpointConfig).build();
IEndpointInstanceProductionVariant productionVariant = endpoint.findInstanceProductionVariant(variantName);
ScalableInstanceCount instanceCount = productionVariant.autoScaleInstanceCount(EnableScalingProps.builder()
.maxCapacity(3)
.build());
instanceCount.scaleOnInvocations("LimitRPS", InvocationsScalingProps.builder()
.maxRequestsPerSecond(30)
.build());
-
Nested Class Summary
Nested ClassesModifier and TypeInterfaceDescriptionstatic final classA builder forInvocationsScalingPropsstatic final classAn implementation forInvocationsScalingProps -
Method Summary
Methods inherited from interface software.amazon.awscdk.services.applicationautoscaling.BaseTargetTrackingProps
getDisableScaleIn, getPolicyName, getScaleInCooldown, getScaleOutCooldownMethods inherited from interface software.amazon.jsii.JsiiSerializable
$jsii$toJson
-
Method Details
-
getMaxRequestsPerSecond
(experimental) Max RPS per instance used for calculating the target SageMaker variant invocation per instance.More documentation available here: https://docs.aws.amazon.com/sagemaker/latest/dg/endpoint-scaling-loadtest.html
-
getSafetyFactor
(experimental) Safety factor for calculating the target SageMaker variant invocation per instance.More documentation available here: https://docs.aws.amazon.com/sagemaker/latest/dg/endpoint-scaling-loadtest.html
Default: 0.5
-
builder
- Returns:
- a
InvocationsScalingProps.BuilderofInvocationsScalingProps
-