AWS services or capabilities described in AWS Documentation may vary by region/location. Click Getting Started with Amazon AWS to see specific differences applicable to the China (Beijing) Region.
Hyperparameters for controlling the reinforcement fine-tuning training process, including learning settings and evaluation intervals.
Namespace: Amazon.Bedrock.Model
Assembly: AWSSDK.Bedrock.dll
Version: 3.x.y.z
public class RFTHyperParameters
The RFTHyperParameters type exposes the following members
| Name | Description | |
|---|---|---|
|
RFTHyperParameters() |
| Name | Type | Description | |
|---|---|---|---|
|
BatchSize | System.Int32 |
Gets and sets the property BatchSize. Number of training samples processed in each batch during reinforcement fine-tuning (RFT) training. Larger batches may improve training stability. |
|
EpochCount | System.Int32 |
Gets and sets the property EpochCount. Number of training epochs to run during reinforcement fine-tuning. Higher values may improve performance but increase training time. |
|
EvalInterval | System.Int32 |
Gets and sets the property EvalInterval. Interval between evaluation runs during RFT training, measured in training steps. More frequent evaluation provides better monitoring. |
|
InferenceMaxTokens | System.Int32 |
Gets and sets the property InferenceMaxTokens. Maximum number of tokens the model can generate in response to each prompt during RFT training. |
|
LearningRate | System.Single |
Gets and sets the property LearningRate. Learning rate for the reinforcement fine-tuning. Controls how quickly the model adapts to reward signals. |
|
MaxPromptLength | System.Int32 |
Gets and sets the property MaxPromptLength. Maximum length of input prompts during RFT training, measured in tokens. Longer prompts allow more context but increase memory usage and training-time. |
|
ReasoningEffort | Amazon.Bedrock.ReasoningEffort |
Gets and sets the property ReasoningEffort. Level of reasoning effort applied during RFT training. Higher values may improve response quality but increase training time. |
|
TrainingSamplePerPrompt | System.Int32 |
Gets and sets the property TrainingSamplePerPrompt. Number of response samples generated per prompt during RFT training. More samples provide better reward signal estimation. |
.NET:
Supported in: 8.0 and newer, Core 3.1
.NET Standard:
Supported in: 2.0
.NET Framework:
Supported in: 4.5 and newer, 3.5