ModelSpeculativeDecodingConfig
Settings for the model speculative decoding technique that's applied by a model optimization job.
Types
Properties
Link copied to clipboard
The speculative decoding technique to apply during model optimization.
Link copied to clipboard
The location of the training data to use for speculative decoding. The data must be formatted as ShareGPT, OpenAI Completions or OpenAI Chat Completions. The input can also be unencrypted captured data from a SageMaker endpoint as long as the endpoint uses one of the above formats.
Functions
Link copied to clipboard
inline fun copy(block: ModelSpeculativeDecodingConfig.Builder.() -> Unit = {}): ModelSpeculativeDecodingConfig