BatchReplaceClusterNodesError - Amazon SageMaker

BatchReplaceClusterNodesError

Represents an error encountered when replacing a node in a SageMaker HyperPod cluster.

Contents

ErrorCode

The error code associated with the error encountered when replacing a node.

Possible values:

  • InstanceIdNotFound: The instance does not exist in the specified cluster.

  • InvalidInstanceStatus: The instance is in a state that does not allow replacement. Wait for the instance to finish any ongoing changes before retrying.

  • InstanceIdInUse: Another operation is already in progress for this node. Wait for the operation to complete before retrying.

  • InternalServerError: An internal error occurred while processing this node.

Type: String

Valid Values: InstanceIdNotFound | InvalidInstanceStatus | InstanceIdInUse | InternalServerError

Required: Yes

Message

A human-readable message describing the error encountered when replacing a node.

Type: String

Required: Yes

NodeId

The EC2 instance ID of the node that encountered an error during the replacement operation.

Type: String

Length Constraints: Minimum length of 1. Maximum length of 256.

Pattern: i-[a-f0-9]{8}(?:[a-f0-9]{9})?

Required: Yes

See Also

For more information about using this API in one of the language-specific AWS SDKs, see the following: