Builder

class Builder

Properties

Link copied to clipboard

Configuration for the grader that evaluates model responses and provides reward signals during RFT training.

Link copied to clipboard

Hyperparameters that control the reinforcement fine-tuning training process, including learning rate, batch size, and epoch count.

Functions