Request structure Request parameters Response structure

Using the Invoke API

The Invoke API provides direct access to Amazon Nova models with full control over the request and response format. Unlike the Converse API which abstracts model-specific details, the Invoke API allows you to work directly with the model's native request and response structures.

The Invoke API is useful when you need:

Direct control over request formatting
Access to model-specific response fields
Custom handling of binary data encoding

Note

The Invoke API supports the same features as the Converse API except for document input modality, which is specific to the Converse API.

Request structure

An Invoke API request requires the model ID and a JSON request body:


import boto3
import json

bedrock = boto3.client('bedrock-runtime', region_name='us-east-1')

request_body = {
    'messages': [
        {
            'role': 'user',
            'content': [{'text': 'What is machine learning?'}]
        }
    ],
    'inferenceConfig': {
        'maxTokens': 512,
        'temperature': 0.7
    }
}

response = bedrock.invoke_model(
    modelId='us.amazon.nova-2-lite-v1:0',
    body=json.dumps(request_body)
)

response_body = json.loads(response['body'].read())
content_list = response_body["output"]["message"]["content"]
# Extract the first text block
text_block = next((item for item in content_list if "text" in item), None)
if text_block is not None:
    print(text_block["text"])

Request parameters

The Invoke API supports the following key parameters:

messages: Array of conversation messages with role and content
system: Optional system prompt for context and instructions
inferenceConfig: Parameters controlling model output (temperature, maxTokens, topP, topK, stopSequences, reasoningConfig)
toolConfig: Tool specifications and tool choice for function calling

Response structure

The response includes the model's output and metadata:


{
    "output": {
        "message": {
            "role": "assistant",
            "content": [
                {
                    "text": "Machine learning is a subset of artificial intelligence..."
                }
            ]
        }
    },
    "stopReason": "end_turn",
    "usage": {
        "inputTokens": 15,
        "outputTokens": 128,
        "totalTokens": 143
    }
}

Possible stopReason values:

end_turn: Logical end of the turn was reached
max_tokens: Output reached the maxTokens limit
content_filtered: Output violated Amazon Responsible AI policy
malformed_model_output: Model produced invalid output
malformed_tool_use: Model produced invalid tool use output
service_unavailable: Built-in tool service could not be reached
invalid_query: Query sent to built-in tool was invalid
max_tool_invocations: Built-in tool did not produce valid result after retries

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Using the Converse API

Streaming responses