ListCustomModelDeployments
Lists custom model deployments in your account. You can filter the results by creation time, name, status, and associated model. Use this operation to manage and monitor your custom model deployments.
We recommend using pagination to ensure that the operation returns quickly and successfully.
The following actions are related to the ListCustomModelDeployments
operation:
Request Syntax
GET /model-customization/custom-model-deployments?createdAfter=createdAfter
&createdBefore=createdBefore
&maxResults=maxResults
&modelArnEquals=modelArnEquals
&nameContains=nameContains
&nextToken=nextToken
&sortBy=sortBy
&sortOrder=sortOrder
&statusEquals=statusEquals
HTTP/1.1
URI Request Parameters
The request uses the following URI parameters.
- createdAfter
-
Filters deployments created after the specified date and time.
- createdBefore
-
Filters deployments created before the specified date and time.
- maxResults
-
The maximum number of results to return in a single call.
Valid Range: Minimum value of 1. Maximum value of 1000.
- modelArnEquals
-
Filters deployments by the Amazon Resource Name (ARN) of the associated custom model.
Length Constraints: Minimum length of 20. Maximum length of 1011.
Pattern:
arn:aws(-[^:]+)?:bedrock:[a-z0-9-]{1,20}:[0-9]{12}:custom-model/(imported|[a-z0-9-]{1,63}[.]{1}[a-z0-9-]{1,63}([a-z0-9-]{1,63}[.]){0,2}[a-z0-9-]{1,63}([:][a-z0-9-]{1,63}){0,2})/[a-z0-9]{12}
- nameContains
-
Filters deployments whose names contain the specified string.
Length Constraints: Minimum length of 1. Maximum length of 63.
Pattern:
([0-9a-zA-Z][_-]?){1,63}
- nextToken
-
The token for the next set of results. Use this token to retrieve additional results when the response is truncated.
Length Constraints: Minimum length of 1. Maximum length of 2048.
Pattern:
\S*
- sortBy
-
The field to sort the results by. The only supported value is
CreationTime
.Valid Values:
CreationTime
- sortOrder
-
The sort order for the results. Valid values are
Ascending
andDescending
. Default isDescending
.Valid Values:
Ascending | Descending
- statusEquals
-
Filters deployments by status. Valid values are
CREATING
,ACTIVE
, andFAILED
.Valid Values:
Creating | Active | Failed
Request Body
The request does not have a request body.
Response Syntax
HTTP/1.1 200
Content-type: application/json
{
"modelDeploymentSummaries": [
{
"createdAt": "string",
"customModelDeploymentArn": "string",
"customModelDeploymentName": "string",
"failureMessage": "string",
"lastUpdatedAt": "string",
"modelArn": "string",
"status": "string"
}
],
"nextToken": "string"
}
Response Elements
If the action is successful, the service sends back an HTTP 200 response.
The following data is returned in JSON format by the service.
- modelDeploymentSummaries
-
A list of custom model deployment summaries.
Type: Array of CustomModelDeploymentSummary objects
- nextToken
-
The token for the next set of results. This value is null when there are no more results to return.
Type: String
Length Constraints: Minimum length of 1. Maximum length of 2048.
Pattern:
\S*
Errors
For information about the errors that are common to all actions, see Common Errors.
- AccessDeniedException
-
The request is denied because of missing access permissions.
HTTP Status Code: 403
- InternalServerException
-
An internal server error occurred. Retry your request.
HTTP Status Code: 500
- ThrottlingException
-
The number of requests exceeds the limit. Resubmit your request later.
HTTP Status Code: 429
- ValidationException
-
Input validation failed. Check your request parameters and retry the request.
HTTP Status Code: 400
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: