interface ServerlessConfigProperty
| Language | Type name |
|---|---|
.NET | Amazon.CDK.Mixins.Preview.AWS.SageMaker.Mixins.CfnEndpointConfigPropsMixin.ServerlessConfigProperty |
Go | github.com/aws/aws-cdk-go/awscdkmixinspreview/v2/awssagemaker/mixins#CfnEndpointConfigPropsMixin_ServerlessConfigProperty |
Java | software.amazon.awscdk.mixins.preview.services.sagemaker.mixins.CfnEndpointConfigPropsMixin.ServerlessConfigProperty |
Python | aws_cdk.mixins_preview.aws_sagemaker.mixins.CfnEndpointConfigPropsMixin.ServerlessConfigProperty |
TypeScript | @aws-cdk/mixins-preview » aws_sagemaker » mixins » CfnEndpointConfigPropsMixin » ServerlessConfigProperty |
Specifies the serverless configuration for an endpoint variant.
Example
// The code below shows an example of how to instantiate this type.
// The values are placeholders you should change.
import { mixins as sagemaker_mixins } from '@aws-cdk/mixins-preview/aws-sagemaker';
const serverlessConfigProperty: sagemaker_mixins.CfnEndpointConfigPropsMixin.ServerlessConfigProperty = {
maxConcurrency: 123,
memorySizeInMb: 123,
provisionedConcurrency: 123,
};
Properties
| Name | Type | Description |
|---|---|---|
| max | number | The maximum number of concurrent invocations your serverless endpoint can process. |
| memory | number | The memory size of your serverless endpoint. |
| provisioned | number | The amount of provisioned concurrency to allocate for the serverless endpoint. |
maxConcurrency?
Type:
number
(optional)
The maximum number of concurrent invocations your serverless endpoint can process.
memorySizeInMb?
Type:
number
(optional)
The memory size of your serverless endpoint.
Valid values are in 1 GB increments: 1024 MB, 2048 MB, 3072 MB, 4096 MB, 5120 MB, or 6144 MB.
provisionedConcurrency?
Type:
number
(optional)
The amount of provisioned concurrency to allocate for the serverless endpoint.
Should be less than or equal to MaxConcurrency .
This field is not supported for serverless endpoint recommendations for Inference Recommender jobs. For more information about creating an Inference Recommender job, see CreateInferenceRecommendationsJobs .

.NET
Go
Java
Python
TypeScript