Dashboard organization and usage - Amazon GameLift Servers

Dashboard organization and usage

View your metrics on comprehensive dashboards in Amazon Managed Grafana. The available dashboards depend on your fleet type:

Dashboard availability by fleet type

The following table shows which dashboards are available for each fleet type:

Dashboard Fleet Type Description
EC2 Fleet Overview EC2 Fleet Displays information on concurrent players (CCU), instances and player capacity
Instances Overview EC2 Fleet Displays average CPU, memory, and network utilization across all fleet instances
Instance Performance EC2 Fleet Displays detailed metrics (CPU, memory, disk, network) for an individual instance
Container Fleet Overview Container Fleet Displays average resource utilization of all containers in a managed container fleet
Container Performance Container Fleet Displays detailed metrics of individual containers within a specific ECS task
Server Performance Both Displays the network, memory and runtime performance of a specified game server process (separate versions for EC2 and Container fleets)

Managed EC2 Fleets:

  • EC2 Fleet Overview provides high-level fleet capacity and scaling insights.

  • Use Instances Overview and Instance Performance dashboards for host-level monitoring.

  • Metrics collected via hostmetrics receiver for system-level visibility.

  • Focus on EC2 instance resource utilization and performance.

  • Server Performance (EC2) monitors game server application metrics independent of underlying infrastructure.

Managed Container Fleets:

  • Use Container Fleet Overview and Container Performance dashboards for ECS task and container-level monitoring.

  • Metrics collected via ECS Container Receiver for containerized workload visibility.

  • Focus on task-level aggregation and container resource isolation.

  • Server Performance (Container) monitors game server application metrics independent of underlying infrastructure.

EC2 Fleet Overview dashboard

This dashboard provides a high-level overview of your fleet's utilization and capacity globally and by location. It features graphs showing counts for game server stops, starts, and crashes, as well as the percentage of healthy game servers. You can filter by FleetID and Location.

Fleet Overview metrics

The following table shows the metrics available on the Fleet Overview dashboard:

Dashboard Metric name Definition
Global CCU and Capacity CCU Shows the number of concurrent users in all the game servers over all the instances in all locations
Global CCU Shows the number of concurrent users in all the game servers over all the instances globally
Active Instances Shows the total number of instances in the fleet that are active
Active Processes Shows the number of active game server processes that are ready to host a game session
Game Server Starts Shows the number of game sessions that started across the fleet
Global CCU and Capacity Healthy Game Servers Shows the average percentage of game servers that report healthy to Amazon GameLift Servers across the fleet
Crashed Game Sessions Shows the game session IDs of the crashed game sessions. Click the link of game session ARN to navigate to the Server Performance dashboard
Location CCU Location Players Shows the number of concurrent players in a location, including all the game servers over all the instances in the location
Location Capacity Shows capacity utilization (%) in a location, and the percentage of game servers in use in the location
Note

CCU metrics require implementation in your game server code. These metrics are not automatically collected and must be implemented and reported by your application.

Instances Overview dashboard

This dashboard provides aggregated host-level metrics across all instances in your fleet. Current averages show overall health of the instances. When performance degrades, check CPU usage, memory consumption, network and disk consumption for bottlenecks. You can filter by FleetID and Location.

Instances Overview metrics

The following table shows the metrics available on the Instances Overview dashboard:

Dashboard Metric name Definition
Instance Summary Average CPU Usage Instance summary tracks average resource usage by instances. Average percentage of CPU resources being used across all fleet instances. High utilization patterns require checking individual instance metrics and resource distribution
Peak CPU Usage Highest CPU utilization recorded across any instance in the fleet during the time period
CPU Usage by Instance Individual CPU utilization for each instance in the fleet, enabling identification of resource hotspots
Instance Summary Average Memory Usage Average percentage of RAM being used across all fleet instances. High utilization patterns require checking individual instance metrics and resource distribution like memory consumption
Peak Memory Usage Highest memory utilization recorded across any instance in the fleet during the time period
Memory Usage by Instance Individual memory utilization for each instance in the fleet, helping identify memory-intensive workloads
Network Summary Network I/O (Bytes) Network summary reflects average instance connectivity. Average network traffic volume (sent and received) across all fleet instances. For latency or connectivity issues, check individual instance metrics like Network I/O and Packet rates
Network I/O (Packets) Average network packet rate (sent and received) across all fleet instances
Network Usage by Instance Individual network utilization for each instance, useful for identifying network bottlenecks
Disk Summary Disk I/O Operations Disk summary indicates average disk performance of the instances. Average disk read/write operations across all fleet instances. Slow response times suggest examining Disk operations, I/O wait times, and Pending operations at the individual instance level
Disk I/O Throughput Average disk read/write throughput across all fleet instances
Disk Usage by Instance Individual disk utilization for each instance, helping identify storage performance issues
Note

Instance-level metrics are collected via the hostmetrics receiver and provide system-level visibility into your fleet's infrastructure performance. Use this dashboard to identify overall fleet health trends and drill down to individual instances when performance issues are detected.

Instance Performance dashboard

This dashboard provides detailed performance metrics for individual instances. Current averages show overall instance health. When performance degrades, check CPU usage, memory consumption, and file system consumption for bottlenecks. It features "Top N Memory Consuming Game Sessions" and "Top N CPU Consuming Game Sessions" tables that help identify which processes contribute the most to instance resource consumption. Clicking on Game Session links enables deeper investigation of detailed metrics. You can filter by specific Instance ID.

Instance Performance metrics

The following table shows the metrics available on the Instance Performance dashboard:

Dashboard Metric name Definition
Load Summary Top N Memory Consuming Game Sessions Load summary tracks resource usage by game servers. Ranked list of game sessions consuming the most memory resources on this instance. High utilization patterns require checking individual session metrics and resource distribution. Clicking on Game Session links enables deeper investigation of detailed metrics
Top N CPU Consuming Game Sessions Ranked list of game sessions consuming the most CPU resources on this instance. High utilization patterns require checking individual session metrics and resource distribution like CPU and memory. Clicking on Game Session links enables deeper investigation of detailed metrics
CPU Usage per Game Session Individual CPU utilization breakdown showing resource consumption by each active game session
Memory Usage per Game Session Individual memory utilization breakdown showing resource consumption by each active game session
Current Averages Instance CPU Usage Overall CPU utilization for the selected instance over time
Instance Memory Usage Overall memory utilization for the selected instance over time
Instance File System Usage File system consumption for the selected instance, useful for identifying storage capacity issues
Network Instance Network I/O Network reflects instance connectivity. Network traffic volume and packet rates for the selected instance. For latency or connectivity issues, investigate Network I/O, Packet rates, and Error counts
Network Connections Number of active network connections on the selected instance
Network Error Counts Network error statistics for identifying connectivity problems
Disk Disk I/O Operations Disk indicates disk performance. Disk read/write activity and utilization for the selected instance. Slow response times suggest examining Disk operations, I/O wait times, and Pending operations
Disk I/O Wait Times Time spent waiting for disk operations to complete
Pending Disk Operations Number of disk operations waiting to be processed
Note

The Top N Memory and CPU Consuming Game Sessions tables are essential for identifying performance bottlenecks and resource-intensive processes that may impact overall instance performance. These rankings enable quick identification of problematic game sessions for further investigation.

Container Fleet Overview dashboard

This dashboard provides a high-level overview of your container fleet's resource utilization and capacity. It displays average resource utilization of all containers in a managed container fleet, including CPU reservation, memory utilization, and container group status. You can filter by FleetID and Location.

Container Fleet Overview metrics

The following table shows the metrics available on the Container Fleet Overview dashboard:

Dashboard Metric name Definition
Container Group Status Active Game Server Container Groups Container groups that are currently active and ready to host game sessions
Idle Game Server Container Groups Container groups that are active but not currently hosting game sessions
Pending Game Server Container Groups Container groups that are in the process of starting up
Terminating Game Server Container Groups Container groups that are in the process of shutting down
Resource Utilization Container CPU Utilization Average CPU utilization across all containers in the fleet
Container Memory Utilization Average memory utilization across all containers in the fleet
Container CPU Reservation Percentage of CPU resources reserved by containers across the fleet
Network Activity Container Network In Average network bytes received by containers across the fleet
Container Network Out Average network bytes sent by containers across the fleet
Note

Container fleet metrics are collected via ECS Container Receiver and provide containerized workload visibility with focus on task-level aggregation and container resource isolation.

Container Performance dashboard

This dashboard provides detailed performance metrics for individual containers within specific ECS tasks. It displays detailed metrics of individual containers including CPU utilization, memory usage, network I/O, and storage performance. You can filter by specific Container ID or ECS Task.

Container Performance metrics

The following table shows the metrics available on the Container Performance dashboard:

Dashboard Metric name Definition
Resource Performance Container CPU Utilization CPU utilization for the selected container over time
Container Memory Utilization Memory utilization for the selected container over time
Container Memory Reservation Percentage of memory resources reserved by the selected container
Network Performance Container Network I/O Network traffic volume (bytes in/out) for the selected container
Network Connections Number of active network connections for the selected container
Storage Performance Container Storage Read/Write Storage read and write activity for the selected container
Container Storage I/O Operations Number of storage I/O operations performed by the selected container
Note

Container performance metrics provide detailed visibility into individual container resource consumption and performance characteristics within ECS tasks.

Server Performance dashboard

The Server Performance dashboard shows metrics related to server timings, network activity, memory, and CPU usage for individual game sessions. You can filter by Game Session ID and export metrics directly to Amazon CloudWatch or Amazon Managed Grafana.

Server Performance metrics

The following table shows the metrics available on the Server Performance dashboard:

Dashboard Metric name Definition
Server Timings Server Delta Time Difference in time between the current server tick and the previous server tick. It's a measure of the consistency of the server's tick rate
Server Tick Rate Shows the number of times per second the server is processing updates
Server Tick Time The amount of time it takes for the server to process a single tick or update
Server World Tick Time The amount of time it takes for the server to update the game world with each tick
Network Connections The total number of network connections the server has established
Network I/O (Bytes) The total number of bytes being sent and received by the server over the network
Network I/O (Packets) The total number of network packets being sent and received by the server
Packet Loss The percentage of network packets that are being lost during transmission
Memory Memory Usage (Units) The total amount of memory being consumed by the server processes
Physical Memory Usage (%) The percentage of the server's total physical memory that is currently being utilized
CPU Usage CPU Usage (%) The percentage of CPU resources being utilized by the game server process