Quotas for Amazon Bedrock AgentCore - Amazon Bedrock AgentCore

Quotas for Amazon Bedrock AgentCore

Your AWS account has default quotas, formerly referred to as limits, for each AWS service. Unless otherwise noted, each quota is Region-specific. You can request increases for some quotas, and other quotas cannot be increased.

To request a quota increase, contact AWS support.

AgentCore Runtime Service Quotas

When working with AgentCore Runtime, you need to be aware of the service limits that apply to your account. These limits help ensure service stability and availability for all users.

Resource allocation limits

The following table describes the resource allocation limits for AgentCore Runtime:

Resource allocation limits
Limit Default Value Adjustable Notes
Active session workloads per account 1,000 in US East (N. Virginia) and US West (Oregon), and 500 in other AWS Regions. Yes Can be increased via support ticket
Total agents per account 1,000 Yes Can be increased via support ticket
Versions per agent 1,000 Yes Inactive versions deleted after 45 days
Endpoints (aliases) per agent 10 Yes Can be increased via support ticket
Maximum size for a Docker image in an AgentCore Runtime 1 GB No
Maximum hardware allocation per session 2vCPU/8GB No The maximum memory/CPU usage and allocation per Runtime session

Invocation limits

The following table describes the invocation limits for AgentCore Runtime:

Invocation limits
Limit Value Adjustable Notes
Request timeout 15 minutes No Maximum time for synchronous requests
Maximum payload size 100 MB No Maximum size for request/response payloads
Streaming chunk size 10 MB No Maximum size for individual chunks
Streaming maximum duration 60 mins No Maximum time for streaming connections
Asynchronous job maximum duration 8 hours No Maximum execution time for asynchronous jobs
Invocations per second 25 per endpoint Yes Rate limit for API calls

Throttling limits

The following table describes the rate limits for AgentCore Runtime after which you will be throttled:

Throttling limits
Limit Value Adjustable Notes
InvokeAgentRuntime API rate, per agent, per account 25 TPS Yes Transactions per second
New sessions created rate, per endpoint 10 TPM No Transactions per minute
CreateAgentRuntime API rate 5 TPS Yes Transactions per second
CreateAgentRuntimeEndpoint API rate 5 TPS Yes Transactions per second
GetAgentRuntime API rate 50 TPS Yes Transactions per second
GetAgentRuntimeEndpoint API rate 50 TPS Yes Transactions per second
UpdateAgentRuntime API rate 5 TPS Yes Transactions per second
UpdateAgentRuntimeEndpoint API rate 5 TPS Yes Transactions per second
DeleteAgentRuntime API rate 5 TPS Yes Transactions per second
DeleteAgentRuntimeEndpoint API rate 5 TPS Yes Transactions per second
ListAgentRuntimes API rate 5 TPS Yes Transactions per second
ListAgentRuntimeEndpoints API rate 5 TPS Yes Transactions per second
ListAgentRuntimeVersions API rate 5 TPS Yes Transactions per second

Lifetime session lifecycle parameters

The following table describes the lifetime session lifecycle parameters for AgentCore Runtime:

Lifetime session lifecycle parameters
Phase Timeout Adjustable Notes
Idle session timeout 15 minutes of inactivity Yes, through the idleRuntimeSessionTimeout API parameter in the LifecycleConfiguration data type When this limit is reached, the execution environment is terminated and a new one is created for the session
Maximum session duration 8 hrs Yes, through the maxLifetime API parameter in the LifecycleConfiguration data type

AgentCore Memory Service Quotas

The following table describes the lifetime session lifecycle parameters for AgentCore Memory:

AgentCore Memory limits
Limit Value Adjustable Notes
Maximum number of AgentCore Memory resources per AWS Region in an AWS account account 50 Yes
Maximum number of memory strategies per AgentCore Memory resource 6 No
Minimum EventExpirationDuration days in a CreateEvent operation 7 No
Maximum EventExpirationDuration days in a CreateEvent operation 365 No
Maximum prompt size (AppendToPrompt) for custom memory strategy (Extraction/Consolidation) 30 KB No
Maximum number of messages per CreateEvent operation 100 No
Maximum message size in a CreateEvent operation 9 KB No
Maximum event size in a CreateEvent operation 10 MB No
Maximum CreateEvent requests 10 Yes The maximum number of CreateEvent requests per second that you can perform in this AWS account account in the current AWS Region.
Maximum CreateEvent requests per actor, per session, including conversational payloads 0.25 No The maximum number of CreateEvent requests per second, per actor, per session, including conversational payloads that you can perform in this AWS account account in the current AWS Region.
Maximum CreateEvent requests per actor, per session, not including conversational payloads 10 No The maximum number of CreateEvent requests per second, per actor, per session, not including conversational payloads that you can perform in this AWS account account in the current AWS Region.
Maximum RetrieveMemoryRecords requests 5 Yes The maximum number of RetrieveMemoryRecords requests per second that you can perform in this AWS account account in the current AWS Region.
Maximum requests for all other AgentCore Memory APIs 20 Yes The maximum transactions per second (TPS) that can be processed in this AWS account account in the current AWS Region for all other AgentCore Memory APIs.
Maximum number of tokens per minute for long-term memory extraction 150,000 Yes The maximum number of tokens per minute that can be processed for long-term memory extraction for built-in strategies in this AWS account in the current AWS Region. You can monitor token use through the Amazon CloudWatch metric named TokenCount in the Bedrock-AgentCore namespace. You can request an increase to this limit through the Service Quotas console.

AgentCore Identity Service Quotas

When working with AgentCore Identity, you need to be aware of the service limits that apply to your account. These limits help ensure service stability and availability for all users.

AgentCore Identity Limits
Limit Value Adjustable Notes
Workload identities 1,000 No The maximum number of workload identities that you can create in this account in the current Region.
Resource OAuth2 credential providers 50 No The maximum number of OAuth2 credential providers for egress resources that you can create in this account in the current Region.
Resource API key credential providers 50 No The maximum number of API key credential providers for egress resources that you can create in this account in the current Region.

AgentCore Gateway Service Quotas

This section provides information about Amazon Bedrock AgentCore Gateway endpoints and service limits.

Endpoints

Amazon Bedrock AgentCore Gateway provides AWS Region-specific endpoints for management operations and runtime access.

The Amazon Bedrock AgentCore Gateway control plane endpoints use the following format, where you can replace <region> with any of the AWS Regions listed in AWS Regions.

bedrock-agentcore-control.<region>.amazonaws.com

The AgentCore Gateway URLs for runtime access have the following format:

https://{gateway-Id}.gateway.bedrock-agentcore.{Region}.amazonaws.com

Where:

  • {gateway-Id} is the unique identifier for your gateway

  • {Region} is the AWS Region where your gateway is deployed

Gateway ARNs have the following format:

arn:${Partition}:bedrock-agentcore:${Region}:${Account}:gateway/${gateway-Id}

The AgentCore service principal is: bedrock-agentcore.amazonaws.com

Service quotas

Amazon Bedrock AgentCore Gateway has the following service quotas. You can request increases for some quotas using the Service Quotas console.

Amazon Bedrock AgentCore Gateway service quotas
Quota Default value Adjustable
Number of gateways per account 1000 Yes
Number of targets per gateway 100 Yes
Number of tools per target 1000 Yes
Timeout for a gateway invocation 5 minutes Yes
Maximum inline schema size 1 MB Yes
Maximum S3 payload schema size 10 MB Yes
Tool name character limit 256 characters Yes
CreateGateway API rate 5 transactions per second Yes
UpdateGateway API rate 5 transactions per second Yes
GetGateway API rate 10 transactions per second Yes
ListGateways API rate 10 transactions per second Yes
DeleteGateway API rate 5 transactions per second Yes
CreateGatewayTarget API rate 5 transactions per second Yes
UpdateGatewayTarget API rate 5 transactions per second Yes
GetGatewayTarget API rate 10 transactions per second Yes
ListGatewayTargets API rate 10 transactions per second Yes
DeleteGatewayTarget API rate 5 transactions per second Yes
Concurrent target operations (total of Create/Update/DeleteTarget) on same gateway 5 Yes
tool-call/tool-list rate at gateway level 50 concurrent connections Yes
tool-call/tool-list rate at account level 50 concurrent connections Yes
Search-based tool-call rate 25 transactions per minute Yes
Maximum tool-call/tool-list/tool-search payload size 6 MB Yes

For more information about service quotas and how to request increases, see Requesting a quota increase in the Service Quotas User Guide.

AgentCore Browser Service Quotas

The Browser tool has the following service quotas and considerations that apply to your account.

Browser service quotas
Quota Default Value Adjustable Notes
Concurrent active sessions per account for browser and CI 1000 Yes Can be increased via support ticket
Total Browser tool configurations per account 100 Yes Can be increased via support ticket
Hardware configuration per session 1vCPU/4GB No The maximum memory/CPU usage and configuration per account

Browser Invocation Limits

The following table describes the invocation limits for the Browser tool:

Browser invocation limits
Limit Value Adjustable Notes
Automation stream limit per session 1 No Maximum number of automation streams per session
Live view stream limit per session 1 No Maximum number of live view streams per session
Asynchronous command max duration 8 hrs No Maximum execution time for asynchronous commands
Disk size 10 GB No Maximum disk space available per session

AgentCore Code Interpreter Service Quotas

The Code Interpreter tool has the following service quotas and considerations that apply to your account.

Code Interpreter service quotas
Quota Default Value Adjustable Notes
Concurrent active sessions per account 500 Yes Can be increased via support ticket
Total Code Interpreter tool configurations per account 100 Yes Can be increased via support ticket
Hardware configuration per session 2vCPU/8GB No The maximum memory/CPU usage and configuration per account

Code Interpreter Invocation Limits

The following table describes the invocation limits for the Code Interpreter tool:

Code Interpreter invocation limits
Limit Value Adjustable Notes
Request timeout 15 mins No Maximum time for synchronous requests
Max payload size 100 MB No Maximum size for request/response payloads
Invocations per second 3 No Rate limit for API calls per session
Asynchronous command max duration 8 hrs No Maximum execution time for asynchronous commands
Disk size 10 GB No Maximum disk space available per session