Best practices for the Amazon Nova Sonic system prompt
Note
This documentation is for Amazon Nova Version 1. For the Amazon Nova 2 Speech-to-Speech prompt engineering guide, visit Voice conversation prompts.
When crafting your system prompt for Amazon Nova Sonic, you should understand that speech-based interactions differ from text-based ones. While many prompting best practices for Amazon Nova understanding models apply to Amazon Nova Sonic, they must be adapted for the unique dynamics of spoken conversation. For instance, a user's typed response is a reliable source of input. However, a corresponding speech interaction may be limited in context and require a back-and-forth interaction to ask for more information before moving the conversation forward. Additionally, prompts that cater to long form outputs might result in a bad experience for users due to time spent listening to find the correct answer.
As outlined in the Amazon Nova prompting guidance, prompt engineering optimizes input to improve model output quality. For Amazon Nova Sonic, these principles must be tailored for conversational speech patterns.
Consider the following details when writing your system prompts:
Clarity and precision
Instructions that are clear in text may need to be reformulated for speech contexts. Ensure your prompts seek confirmation of understanding before taking action through tools to prevent mishaps.
| Task | Traditional prompt | Speech-optimized prompt |
|---|---|---|
First, verify a user's identity information and reservation number |
|
|
Conversational flow
Prioritize natural dialogue flow over formal instructional structures.
| Task | Traditional prompt | Speech-optimized prompt |
|---|---|---|
Troubleshoot Wi-Fi connectivity issues |
|
|
Memory constraints
Remember that spoken interactions have different memory dynamics compared to text. For example, listeners can't "refer back" to previous text as easily when it's spoken.
| Task | Traditional prompt | Speech-optimized prompt |
|---|---|---|
Explain a return policy |
|
|