

# Cost-aware prompting
<a name="gencost03"></a>


| GENCOST03: How do you engineer prompts to optimize cost? | 
| --- | 
|   | 

Prompts are engineered to optimize workloads cost as well as workload performance.

**Topics**
+ [GENCOST03-BP01 Optimize prompt token length](gencost03-bp01.md)
+ [GENCOST03-BP02 Control model response length](gencost03-bp02.md)
+ [GENCOST03-BP03 Implement prompt caching to reduce token costs](gencost03-bp03.md)
+ [GENCOST03-BP04 Annotate user input to enable cost-aware content filtering](gencost03-bp04.md)