本文為英文版的機器翻譯版本,如內容有任何歧義或不一致之處,概以英文版為準。
使用 CloudFormation 建立擴展政策
下列範例示範如何使用 在端點上設定模型自動擴展 CloudFormation。
Endpoint: Type: "AWS::SageMaker::Endpoint" Properties: EndpointName:yourEndpointNameEndpointConfigName:yourEndpointConfigNameScalingTarget: Type: "AWS::ApplicationAutoScaling::ScalableTarget" Properties: MaxCapacity:10MinCapacity:2ResourceId: endpoint/my-endpoint/variant/my-variantRoleARN:arnScalableDimension: sagemaker:variant:DesiredInstanceCount ServiceNamespace: sagemaker ScalingPolicy: Type: "AWS::ApplicationAutoScaling::ScalingPolicy" Properties: PolicyName:my-scaling-policyPolicyType: TargetTrackingScaling ScalingTargetId: Ref: ScalingTarget TargetTrackingScalingPolicyConfiguration: TargetValue:70.0ScaleInCooldown:600ScaleOutCooldown:30PredefinedMetricSpecification: PredefinedMetricType: SageMakerVariantInvocationsPerInstance
如需詳細資訊,請參閱《Application Auto Scaling 使用者指南》中的使用 建立 Application Auto Scaling 資源 AWS CloudFormation。 Auto Scaling