View a markdown version of this page

OPS08-BP02 Define workload metrics - AWS Well-Architected Framework (2023-04-10)

OPS08-BP02 Define workload metrics

Define metrics that measure the health of the workload. Workload health is measured by the achievement of business outcomes (KPIs) and the state of workload components and applications. Examples of KPIs are abandoned shopping carts, orders placed, cost, price, and allocated workload expense. While you may collect telemetry from multiple components, select a subset that provides insight into the overall workload health. Adjust workload metrics over time as business needs change.

Desired outcome:

  • You have identified metrics that validate the achievement of KPIs that reflect business outcomes.

  • You have metrics that show a consistent view of workload health.

  • Workload metrics are evaluated periodically as business needs change.

Common anti-patterns:

  • You are monitoring all the applications in your workload but are unable to determine if your workload is achieving business outcomes.

  • You have defined workload metrics but they are not associated to any business KPIs.

Benefits of establishing this best practice:

  • You can measure your workload against the achievement of business outcomes.

  • You know if your workload is in a healthy state or needs intervention.

Level of risk exposed if this best practice is not established: High

Implementation guidance

The goal of this best practice is that you can answer the following question: is my workload healthy? Workload health is determined by the achievement of business outcomes and the state of applications and components in the workload. Work backwards from business KPIs to identify metrics. Identify key metrics from components and applications. Periodically review workload metrics as business needs change.

Customer example

Workload health is determined at AnyCompany Retail by a collection of application and component metrics. Starting with business KPIs, they identify metrics like order rate that can show they are achieving business outcomes. They also include key application metrics like page response and component metrics like open database connections. On a quarterly basis, they re-evaluate workload metrics to make sure they are still valid in determining workload health.

Implementation steps

  1. Starting with business KPIs, identify metrics that show you are achieving business outcomes. If there are KPIs that do not have metrics, instrument your workload with additional metrics for any missing business KPIs.

    1. You can publish custom metrics from your applications to Amazon CloudWatch.

    2. The AWS Distro for OpenTelemetry can collect metrics from existing applications and be used to add new metrics.

    3. Customers with Enterprise Support can request the Building a Monitoring Strategy Workshop from their Technical Account Manager. This workshop will help you build an observability strategy for your workload.

  2. Identify metrics for applications and components in the workload. What are key metrics that show the health of individual components and applications? Applications and components may emit many different metrics, but choose one to three key metrics that show their overall health.

  3. Implement a mechanism to evaluate workload metrics periodically. When business KPIs change, work with stakeholders to update workload metrics. As your workload components and applications evolve, adjust your workload metrics.

Level of effort for the implementation plan: Medium. Adding metrics for business KPIs to applications may require moderate effort.

Resources

Related best practices:

Related documents:

Related videos:

Related examples:

Related services: