Interface ApplicationInferenceProfileProps
- All Superinterfaces:
software.amazon.jsii.JsiiSerializable
- All Known Implementing Classes:
ApplicationInferenceProfileProps.Jsii$Proxy
Example:
// Create a cross-region inference profile
CrossRegionInferenceProfile crossRegionProfile = CrossRegionInferenceProfile.fromConfig(CrossRegionInferenceProfileProps.builder()
.geoRegion(CrossRegionInferenceProfileRegion.US)
.model(BedrockFoundationModel.ANTHROPIC_CLAUDE_3_5_SONNET_V2_0)
.build());
// Create an application inference profile across regions
ApplicationInferenceProfile appProfile = ApplicationInferenceProfile.Builder.create(this, "MyMultiRegionProfile")
.applicationInferenceProfileName("claude-35-sonnet-v2-multi-region")
.modelSource(crossRegionProfile)
.description("Multi-region application profile for cost tracking")
.build();
-
Nested Class Summary
Nested ClassesModifier and TypeInterfaceDescriptionstatic final classA builder forApplicationInferenceProfilePropsstatic final classAn implementation forApplicationInferenceProfileProps -
Method Summary
Modifier and TypeMethodDescriptionbuilder()(experimental) The name of the application inference profile.default String(experimental) Description of the inference profile.(experimental) The model source for this inference profile.getTags()(experimental) A list of tags associated with the inference profile.Methods inherited from interface software.amazon.jsii.JsiiSerializable
$jsii$toJson
-
Method Details
-
getApplicationInferenceProfileName
(experimental) The name of the application inference profile.This name will be used to identify the inference profile in the AWS console and APIs.
- Required: Yes
- Maximum length: 64 characters
- Pattern:
^([0-9a-zA-Z:.][ _-]?)+$
- See Also:
-
getModelSource
(experimental) The model source for this inference profile.To create an application inference profile for one Region, specify a foundation model. Usage and costs for requests made to that Region with that model will be tracked.
To create an application inference profile for multiple Regions, specify a cross region (system-defined) inference profile. The inference profile will route requests to the Regions defined in the cross region (system-defined) inference profile that you choose. Usage and costs for requests made to the Regions in the inference profile will be tracked.
-
getDescription
(experimental) Description of the inference profile. Provides additional context about the purpose and usage of this inference profile.- Maximum length: 200 characters when provided
- Pattern:
^([0-9a-zA-Z:.][ _-]?)+$
Default: - No description is provided
- See Also:
-
getTags
(experimental) A list of tags associated with the inference profile.Tags help you organize and categorize your AWS resources.
Default: - No tags are applied
-
builder
-