GraderConfig

sealed class GraderConfig

Configuration for the grader used in reinforcement fine-tuning to evaluate model responses and provide reward signals.

Inheritors

Types

Link copied to clipboard

Configuration for using an AWS Lambda function as the grader for evaluating model responses and provide reward signals in reinforcement fine-tuning.

Link copied to clipboard

Functions

Link copied to clipboard
Link copied to clipboard