All Superinterfaces:: software.amazon.jsii.JsiiSerializable

All Known Implementing Classes:: LlmAsAJudgeOptions.Jsii$Proxy

@Generated(value="jsii-pacmak/1.133.0 (build 0f43e37)", date="2026-07-02T13:32:34.162Z") @Stability(Stable) public interface LlmAsAJudgeOptions extends software.amazon.jsii.JsiiSerializable

Options for configuring an LLM-as-a-Judge custom evaluator.

Uses a foundation model to assess agent performance based on custom instructions and a rating scale.

Example:

 // Create a custom LLM-as-a-Judge evaluator
 Evaluator evaluator = Evaluator.Builder.create(this, "MyEvaluator")
         .evaluatorName("my_custom_evaluator")
         .level(EvaluationLevel.SESSION)
         .evaluatorConfig(EvaluatorConfig.llmAsAJudge(LlmAsAJudgeOptions.builder()
                 .instructions("Evaluate whether the agent response is helpful and accurate.")
                 .modelId("us.anthropic.claude-sonnet-4-6")
                 .ratingScale(EvaluatorRatingScale.categorical(List.of(CategoricalRatingOption.builder().label("Good").definition("The response is helpful and accurate.").build(), CategoricalRatingOption.builder().label("Bad").definition("The response is not helpful or contains errors.").build())))
                 .build()))
         .build();
 // Use the custom evaluator in an online evaluation configuration
 // Use the custom evaluator in an online evaluation configuration
 OnlineEvaluationConfig.Builder.create(this, "MyEvaluation")
         .onlineEvaluationConfigName("my_evaluation")
         .evaluators(List.of(EvaluatorSelector.builtin(BuiltinEvaluator.HELPFULNESS), EvaluatorSelector.custom(evaluator)))
         .dataSource(DataSourceConfig.fromCloudWatchLogs(CloudWatchLogsDataSourceConfig.builder()
                 .logGroupNames(List.of("/aws/bedrock-agentcore/my-agent"))
                 .serviceNames(List.of("my-agent.default"))
                 .build()))
         .build();

Nested Class Summary

Nested Classes

Modifier and Type

Interface

Description

static final class

LlmAsAJudgeOptions.Builder

A builder for LlmAsAJudgeOptions

static final class

LlmAsAJudgeOptions.Jsii$Proxy

An implementation for LlmAsAJudgeOptions
Method Summary

Modifier and Type

Method

Description

static LlmAsAJudgeOptions.Builder

builder()

default Map<String,Object>

getAdditionalModelRequestFields()

Additional model-specific request fields.

default EvaluatorInferenceConfig

getInferenceConfig()

Optional inference configuration parameters that control model behavior during evaluation.

String

getInstructions()

The evaluation instructions that guide the language model in assessing agent performance.

String

getModelId()

The identifier of the Amazon Bedrock model to use for evaluation.

EvaluatorRatingScale

getRatingScale()

The rating scale that defines how the evaluator should score agent performance.

Methods inherited from interface software.amazon.jsii.JsiiSerializable
$jsii$toJson

Method Details
- getInstructions
  
  @Stability(Stable) @NotNull String getInstructions()
  
  The evaluation instructions that guide the language model in assessing agent performance.
  These instructions define the evaluation criteria, context, and expected behavior. Instructions must contain placeholders appropriate for the evaluation level (e.g., {context}, {available_tools} for SESSION level).
  Note: Evaluators using reference-input placeholders (e.g., {expected_tool_trajectory}, {assertions}, {expected_response}) are only compatible with on-demand evaluation, not online evaluation.
  See Also:
  
  https://docs.aws.amazon.com/bedrock-agentcore/latest/devguide/custom-evaluators.html
- getModelId
  
  @Stability(Stable) @NotNull String getModelId()
  
  The identifier of the Amazon Bedrock model to use for evaluation.
  Accepts standard model IDs (e.g., 'anthropic.claude-sonnet-4-6') and cross-region inference profile IDs with region prefixes (e.g., 'us.anthropic.claude-sonnet-4-6', 'eu.anthropic.claude-sonnet-4-6').
- getRatingScale
  
  @Stability(Stable) @NotNull EvaluatorRatingScale getRatingScale()
  
  The rating scale that defines how the evaluator should score agent performance.
- getAdditionalModelRequestFields
  
  @Stability(Stable) @Nullable default Map<String,Object> getAdditionalModelRequestFields()
  
  Additional model-specific request fields.
  Default: - No additional fields
- getInferenceConfig
  
  @Stability(Stable) @Nullable default EvaluatorInferenceConfig getInferenceConfig()
  
  Optional inference configuration parameters that control model behavior during evaluation.
  When not specified, the foundation model uses its own default values for maxTokens, temperature, and topP.
  Default: - The foundation model's default inference parameters are used
  See Also:
  
  https://docs.aws.amazon.com/bedrock-agentcore/latest/devguide/custom-evaluators.html
- builder
  
  @Stability(Stable) static LlmAsAJudgeOptions.Builder builder()
  
  Returns:
  
  a LlmAsAJudgeOptions.Builder of LlmAsAJudgeOptions

Interface LlmAsAJudgeOptions

Nested Class Summary

Method Summary

Methods inherited from interface software.amazon.jsii.JsiiSerializable

Method Details

getInstructions

getModelId

getRatingScale

getAdditionalModelRequestFields

getInferenceConfig

builder