interface ServerlessConfigProperty
| Language | Type name |
|---|---|
.NET | Amazon.CDK.AWS.Sagemaker.CfnEndpointConfig.ServerlessConfigProperty |
Go | github.com/aws/aws-cdk-go/awscdk/v2/awssagemaker#CfnEndpointConfig_ServerlessConfigProperty |
Java | software.amazon.awscdk.services.sagemaker.CfnEndpointConfig.ServerlessConfigProperty |
Python | aws_cdk.aws_sagemaker.CfnEndpointConfig.ServerlessConfigProperty |
TypeScript | aws-cdk-lib » aws_sagemaker » CfnEndpointConfig » ServerlessConfigProperty |
Specifies the serverless configuration for an endpoint variant.
Example
// The code below shows an example of how to instantiate this type.
// The values are placeholders you should change.
import { aws_sagemaker as sagemaker } from 'aws-cdk-lib';
const serverlessConfigProperty: sagemaker.CfnEndpointConfig.ServerlessConfigProperty = {
maxConcurrency: 123,
memorySizeInMb: 123,
// the properties below are optional
provisionedConcurrency: 123,
};
Properties
| Name | Type | Description |
|---|---|---|
| max | number | The maximum number of concurrent invocations your serverless endpoint can process. |
| memory | number | The memory size of your serverless endpoint. |
| provisioned | number | The amount of provisioned concurrency to allocate for the serverless endpoint. |
maxConcurrency
Type:
number
The maximum number of concurrent invocations your serverless endpoint can process.
memorySizeInMb
Type:
number
The memory size of your serverless endpoint.
Valid values are in 1 GB increments: 1024 MB, 2048 MB, 3072 MB, 4096 MB, 5120 MB, or 6144 MB.
provisionedConcurrency?
Type:
number
(optional)
The amount of provisioned concurrency to allocate for the serverless endpoint.
Should be less than or equal to MaxConcurrency .
This field is not supported for serverless endpoint recommendations for Inference Recommender jobs. For more information about creating an Inference Recommender job, see CreateInferenceRecommendationsJobs .

.NET
Go
Java
Python
TypeScript