Customizing Amazon Nova models on SageMaker AI
Note
This documentation is for Amazon Nova Version 1. Amazon Nova 2 is now available with new models and enhanced capabilities. For information on how to customize Amazon Nova 2, visit Customizing Amazon Nova 2 models.
You can customize Amazon Nova models, including the enhanced Amazon Nova 2.0 models, through recipes and train them on SageMaker. These recipes support techniques such as supervised fine-tuning (SFT), Direct Preference Optimization (DPO), and Reinforcement Fine-Tuning (RFT), with both full-rank and low-rank adaptation (LoRA) options.
The end-to-end customization workflow involves stages like model training, model evaluation, and deployment for inference. This model customization approach on SageMaker provides greater flexibility and control to fine-tune its supported Amazon Nova models, optimize hyperparameters with precision, and implement techniques such as LoRA parameter-efficient fine-tuning (PEFT), full-rank SFT, DPO, RFT, Continued Pre-Training (CPT), Proximal Policy Optimization (PPO), etc.
SageMaker offers two environments for customizing Amazon Nova models.
-
SageMaker training jobs provides a fully managed environment for customizing Amazon Nova models where you don't need to create or maintain any clusters. The service automatically handles all infrastructure provisioning, scaling, and resource management, allowing you to focus solely on configuring your training parameters and submitting your job. You can customize Nova models on SageMaker training jobs with techniques like Parameter Efficient Fine-tuning (PEFT), Full rank fine tuning, Direct Preference Optimization (DPO), and Reinforcement Fine-Tuning (RFT). For more information, see Amazon Nova customization on SageMaker Training Jobs.
Note
If you provide a KMS key to your Amazon Nova model customization training job for encryption in the Amazon-owned output S3 bucket:
-
You must provide the same KMS key when calling subsequent iterative training jobs, or when calling the Amazon Bedrock CreateCustomModel API leveraging the encrypted model.
-
The identity calling the
CreateTrainingJobAPI (rather than the execution role) must have permissions toCreateGrant,RetireGrant,Encrypt, andGenerateDataKeyas defined in KMS key policy.
-
-
SageMaker HyperPod offers a specialized environment to train Amazon Nova models by requiring you to create and manage EKS clusters with restricted instance groups (RIGs). This environment gives you flexibility in configuring your training environment with specialized GPU instances and integrated Amazon FSx for Lustre storage, making it particularly well-suited for advanced distributed training scenarios and ongoing model development. For more information, see Amazon Nova customization on SageMaker HyperPod .