interface AsyncInferenceClientConfigProperty
| Language | Type name |
|---|---|
.NET | Amazon.CDK.AWS.Sagemaker.CfnEndpointConfig.AsyncInferenceClientConfigProperty |
Go | github.com/aws/aws-cdk-go/awscdk/v2/awssagemaker#CfnEndpointConfig_AsyncInferenceClientConfigProperty |
Java | software.amazon.awscdk.services.sagemaker.CfnEndpointConfig.AsyncInferenceClientConfigProperty |
Python | aws_cdk.aws_sagemaker.CfnEndpointConfig.AsyncInferenceClientConfigProperty |
TypeScript | aws-cdk-lib » aws_sagemaker » CfnEndpointConfig » AsyncInferenceClientConfigProperty |
Configures the behavior of the client used by SageMaker to interact with the model container during asynchronous inference.
Example
// The code below shows an example of how to instantiate this type.
// The values are placeholders you should change.
import { aws_sagemaker as sagemaker } from 'aws-cdk-lib';
const asyncInferenceClientConfigProperty: sagemaker.CfnEndpointConfig.AsyncInferenceClientConfigProperty = {
maxConcurrentInvocationsPerInstance: 123,
};
Properties
| Name | Type | Description |
|---|---|---|
| max | number | The maximum number of concurrent requests sent by the SageMaker client to the model container. |
maxConcurrentInvocationsPerInstance?
Type:
number
(optional)
The maximum number of concurrent requests sent by the SageMaker client to the model container.
If no value is provided, SageMaker will choose an optimal value for you.

.NET
Go
Java
Python
TypeScript