interface InferenceComponentCapacitySizeProperty
| Language | Type name |
|---|---|
.NET | Amazon.CDK.Mixins.Preview.AWS.SageMaker.Mixins.CfnInferenceComponentPropsMixin.InferenceComponentCapacitySizeProperty |
Go | github.com/aws/aws-cdk-go/awscdkmixinspreview/v2/awssagemaker/mixins#CfnInferenceComponentPropsMixin_InferenceComponentCapacitySizeProperty |
Java | software.amazon.awscdk.mixins.preview.services.sagemaker.mixins.CfnInferenceComponentPropsMixin.InferenceComponentCapacitySizeProperty |
Python | aws_cdk.mixins_preview.aws_sagemaker.mixins.CfnInferenceComponentPropsMixin.InferenceComponentCapacitySizeProperty |
TypeScript | @aws-cdk/mixins-preview » aws_sagemaker » mixins » CfnInferenceComponentPropsMixin » InferenceComponentCapacitySizeProperty |
Specifies the type and size of the endpoint capacity to activate for a rolling deployment or a rollback strategy.
You can specify your batches as either of the following:
- A count of inference component copies
- The overall percentage or your fleet
For a rollback strategy, if you don't specify the fields in this object, or if you set the Value parameter to 100%, then SageMaker AI uses a blue/green rollback strategy and rolls all traffic back to the blue fleet.
Example
// The code below shows an example of how to instantiate this type.
// The values are placeholders you should change.
import { mixins as sagemaker_mixins } from '@aws-cdk/mixins-preview/aws-sagemaker';
const inferenceComponentCapacitySizeProperty: sagemaker_mixins.CfnInferenceComponentPropsMixin.InferenceComponentCapacitySizeProperty = {
type: 'type',
value: 123,
};
Properties
| Name | Type | Description |
|---|---|---|
| type? | string | Specifies the endpoint capacity type. |
| value? | number | Defines the capacity size, either as a number of inference component copies or a capacity percentage. |
type?
Type:
string
(optional)
Specifies the endpoint capacity type.
- COPY_COUNT - The endpoint activates based on the number of inference component copies.
- CAPACITY_PERCENT - The endpoint activates based on the specified percentage of capacity.
value?
Type:
number
(optional)
Defines the capacity size, either as a number of inference component copies or a capacity percentage.

.NET
Go
Java
Python
TypeScript