Class: Aws::SageMaker::Types::InferenceComponentSpecification
- Inherits:
-
Struct
- Object
- Struct
- Aws::SageMaker::Types::InferenceComponentSpecification
- Defined in:
- gems/aws-sdk-sagemaker/lib/aws-sdk-sagemaker/types.rb
Overview
Details about the resources to deploy with this inference component, including the model, container, and compute resources.
Constant Summary collapse
- SENSITIVE =
[]
Instance Attribute Summary collapse
-
#base_inference_component_name ⇒ String
The name of an existing inference component that is to contain the inference component that you're creating with your request.
-
#compute_resource_requirements ⇒ Types::InferenceComponentComputeResourceRequirements
The compute resources allocated to run the model, plus any adapter models, that you assign to the inference component.
-
#container ⇒ Types::InferenceComponentContainerSpecification
Defines a container that provides the runtime environment for a model that you deploy with an inference component.
-
#data_cache_config ⇒ Types::InferenceComponentDataCacheConfig
Settings that affect how the inference component caches data.
-
#model_name ⇒ String
The name of an existing SageMaker AI model object in your account that you want to deploy with the inference component.
-
#startup_parameters ⇒ Types::InferenceComponentStartupParameters
Settings that take effect while the model container starts up.
Instance Attribute Details
#base_inference_component_name ⇒ String
The name of an existing inference component that is to contain the inference component that you're creating with your request.
Specify this parameter only if your request is meant to create an adapter inference component. An adapter inference component contains the path to an adapter model. The purpose of the adapter model is to tailor the inference output of a base foundation model, which is hosted by the base inference component. The adapter inference component uses the compute resources that you assigned to the base inference component.
When you create an adapter inference component, use the Container
parameter to specify the location of the adapter artifacts. In the
parameter value, use the ArtifactUrl parameter of the
InferenceComponentContainerSpecification data type.
Before you can create an adapter inference component, you must have an existing inference component that contains the foundation model that you want to adapt.
27750 27751 27752 27753 27754 27755 27756 27757 27758 27759 |
# File 'gems/aws-sdk-sagemaker/lib/aws-sdk-sagemaker/types.rb', line 27750 class InferenceComponentSpecification < Struct.new( :model_name, :container, :startup_parameters, :compute_resource_requirements, :base_inference_component_name, :data_cache_config) SENSITIVE = [] include Aws::Structure end |
#compute_resource_requirements ⇒ Types::InferenceComponentComputeResourceRequirements
The compute resources allocated to run the model, plus any adapter models, that you assign to the inference component.
Omit this parameter if your request is meant to create an adapter inference component. An adapter inference component is loaded by a base inference component, and it uses the compute resources of the base inference component.
27750 27751 27752 27753 27754 27755 27756 27757 27758 27759 |
# File 'gems/aws-sdk-sagemaker/lib/aws-sdk-sagemaker/types.rb', line 27750 class InferenceComponentSpecification < Struct.new( :model_name, :container, :startup_parameters, :compute_resource_requirements, :base_inference_component_name, :data_cache_config) SENSITIVE = [] include Aws::Structure end |
#container ⇒ Types::InferenceComponentContainerSpecification
Defines a container that provides the runtime environment for a model that you deploy with an inference component.
27750 27751 27752 27753 27754 27755 27756 27757 27758 27759 |
# File 'gems/aws-sdk-sagemaker/lib/aws-sdk-sagemaker/types.rb', line 27750 class InferenceComponentSpecification < Struct.new( :model_name, :container, :startup_parameters, :compute_resource_requirements, :base_inference_component_name, :data_cache_config) SENSITIVE = [] include Aws::Structure end |
#data_cache_config ⇒ Types::InferenceComponentDataCacheConfig
Settings that affect how the inference component caches data.
27750 27751 27752 27753 27754 27755 27756 27757 27758 27759 |
# File 'gems/aws-sdk-sagemaker/lib/aws-sdk-sagemaker/types.rb', line 27750 class InferenceComponentSpecification < Struct.new( :model_name, :container, :startup_parameters, :compute_resource_requirements, :base_inference_component_name, :data_cache_config) SENSITIVE = [] include Aws::Structure end |
#model_name ⇒ String
The name of an existing SageMaker AI model object in your account that you want to deploy with the inference component.
27750 27751 27752 27753 27754 27755 27756 27757 27758 27759 |
# File 'gems/aws-sdk-sagemaker/lib/aws-sdk-sagemaker/types.rb', line 27750 class InferenceComponentSpecification < Struct.new( :model_name, :container, :startup_parameters, :compute_resource_requirements, :base_inference_component_name, :data_cache_config) SENSITIVE = [] include Aws::Structure end |
#startup_parameters ⇒ Types::InferenceComponentStartupParameters
Settings that take effect while the model container starts up.
27750 27751 27752 27753 27754 27755 27756 27757 27758 27759 |
# File 'gems/aws-sdk-sagemaker/lib/aws-sdk-sagemaker/types.rb', line 27750 class InferenceComponentSpecification < Struct.new( :model_name, :container, :startup_parameters, :compute_resource_requirements, :base_inference_component_name, :data_cache_config) SENSITIVE = [] include Aws::Structure end |