DescribeInferenceComponent
Returns information about an inference component.
Request Syntax
{
   "InferenceComponentName": "string"
}
    
      Request Parameters
For information about the parameters that are common to all actions, see Common Parameters.
The request accepts the following data in JSON format.
- InferenceComponentName
 - 
               
The name of the inference component.
Type: String
Length Constraints: Minimum length of 0. Maximum length of 63.
Pattern:
[a-zA-Z0-9]([\-a-zA-Z0-9]*[a-zA-Z0-9])?Required: Yes
 
Response Syntax
{
   "CreationTime": number,
   "EndpointArn": "string",
   "EndpointName": "string",
   "FailureReason": "string",
   "InferenceComponentArn": "string",
   "InferenceComponentName": "string",
   "InferenceComponentStatus": "string",
   "LastDeploymentConfig": { 
      "AutoRollbackConfiguration": { 
         "Alarms": [ 
            { 
               "AlarmName": "string"
            }
         ]
      },
      "RollingUpdatePolicy": { 
         "MaximumBatchSize": { 
            "Type": "string",
            "Value": number
         },
         "MaximumExecutionTimeoutInSeconds": number,
         "RollbackMaximumBatchSize": { 
            "Type": "string",
            "Value": number
         },
         "WaitIntervalInSeconds": number
      }
   },
   "LastModifiedTime": number,
   "RuntimeConfig": { 
      "CurrentCopyCount": number,
      "DesiredCopyCount": number
   },
   "Specification": { 
      "BaseInferenceComponentName": "string",
      "ComputeResourceRequirements": { 
         "MaxMemoryRequiredInMb": number,
         "MinMemoryRequiredInMb": number,
         "NumberOfAcceleratorDevicesRequired": number,
         "NumberOfCpuCoresRequired": number
      },
      "Container": { 
         "ArtifactUrl": "string",
         "DeployedImage": { 
            "ResolutionTime": number,
            "ResolvedImage": "string",
            "SpecifiedImage": "string"
         },
         "Environment": { 
            "string" : "string" 
         }
      },
      "DataCacheConfig": { 
         "EnableCaching": boolean
      },
      "ModelName": "string",
      "StartupParameters": { 
         "ContainerStartupHealthCheckTimeoutInSeconds": number,
         "ModelDataDownloadTimeoutInSeconds": number
      }
   },
   "VariantName": "string"
}
    
      Response Elements
If the action is successful, the service sends back an HTTP 200 response.
The following data is returned in JSON format by the service.
- CreationTime
 - 
               
The time when the inference component was created.
Type: Timestamp
 - EndpointArn
 - 
               
The Amazon Resource Name (ARN) of the endpoint that hosts the inference component.
Type: String
Length Constraints: Minimum length of 20. Maximum length of 2048.
Pattern:
arn:aws[a-z\-]*:sagemaker:[a-z0-9\-]*:[0-9]{12}:endpoint/.* - EndpointName
 - 
               
The name of the endpoint that hosts the inference component.
Type: String
Length Constraints: Minimum length of 0. Maximum length of 63.
Pattern:
[a-zA-Z0-9](-*[a-zA-Z0-9]){0,62} - FailureReason
 - 
               
If the inference component status is
Failed, the reason for the failure.Type: String
Length Constraints: Minimum length of 0. Maximum length of 1024.
 - InferenceComponentArn
 - 
               
The Amazon Resource Name (ARN) of the inference component.
Type: String
Length Constraints: Minimum length of 20. Maximum length of 2048.
 - InferenceComponentName
 - 
               
The name of the inference component.
Type: String
Length Constraints: Minimum length of 0. Maximum length of 63.
Pattern:
[a-zA-Z0-9]([\-a-zA-Z0-9]*[a-zA-Z0-9])? - InferenceComponentStatus
 - 
               
The status of the inference component.
Type: String
Valid Values:
InService | Creating | Updating | Failed | Deleting - LastDeploymentConfig
 - 
               
The deployment and rollback settings that you assigned to the inference component.
Type: InferenceComponentDeploymentConfig object
 - LastModifiedTime
 - 
               
The time when the inference component was last updated.
Type: Timestamp
 - RuntimeConfig
 - 
               
Details about the runtime settings for the model that is deployed with the inference component.
Type: InferenceComponentRuntimeConfigSummary object
 - Specification
 - 
               
Details about the resources that are deployed with this inference component.
Type: InferenceComponentSpecificationSummary object
 - VariantName
 - 
               
The name of the production variant that hosts the inference component.
Type: String
Length Constraints: Minimum length of 0. Maximum length of 63.
Pattern:
[a-zA-Z0-9](-*[a-zA-Z0-9]){0,62} 
Errors
For information about the errors that are common to all actions, see Common Errors.
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: