graderConfig

Configuration for the grader that evaluates model responses and provides reward signals during RFT training.