instructions

The evaluation instructions that guide the language model in assessing agent performance, including criteria and evaluation guidelines.