기계 번역으로 제공되는 번역입니다. 제공된 번역과 원본 영어의 내용이 상충하는 경우에는 영어 버전이 우선합니다.

# CloudWatch 지표를 사용하여 Amazon Managed Service for Prometheus 리소스 모니터링
<a name="AMP-CW-usage-metrics"></a>

Amazon Managed Service for Prometheus는 CloudWatch에 사용량 지표를 제공합니다. 이러한 지표는 워크스페이스 사용률에 대한 가시성을 제공합니다. 판매 지표는 CloudWatch의 `AWS/Usage` 및 `AWS/Prometheus` 네임스페이스에서 찾을 수 있습니다. 이러한 지표는 CloudWatch에서 무료로 사용할 수 있습니다. 사용량 지표에 대한 자세한 내용은 [CloudWatch 사용량 지표](https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/CloudWatch-Usage-Metrics.html)를 참조하세요.


| CloudWatch 지표 명칭 | 리소스 이름 | CloudWatch 네임스페이스 | 설명 | 
| --- | --- | --- | --- | 
| ResourceCount\* | CreateAlertManagerAlertsTPS | `AWS/Usage` | 워크스페이스별로 초당 수행할 수 있는 최대 `CreateAlertManagerAlerts` API 작업 수입니다. | 
| ResourceCount\* | DeleteAlertManagerSilencesTPS | `AWS/Usage` | 워크스페이스별로 초당 수행할 수 있는 최대 `DeleteAlertManagerSilences` API 작업 수입니다. | 
| ResourceCount\* | GetAlertManagerSilenceTPS | `AWS/Usage` | 워크스페이스별로 초당 수행할 수 있는 최대 `GetAlertManagerSilence` API 작업 수입니다. | 
| ResourceCount\* | GetAlertManagerStatusTPS | `AWS/Usage` | 워크스페이스별로 초당 수행할 수 있는 최대 `GetAlertManagerStatus` API 작업 수입니다. | 
| ResourceCount\* | GetLabelsTPS | `AWS/Usage` | 워크스페이스별로 초당 수행할 수 있는 최대 `GetLabels` API 작업 수입니다. | 
| ResourceCount\* | GetMetricMetadataTPS | `AWS/Usage` | 워크스페이스별로 초당 수행할 수 있는 최대 `GetMetricMetadata` API 작업 수입니다. | 
| ResourceCount\* | GetSeriesTPS | `AWS/Usage` | 워크스페이스별로 초당 수행할 수 있는 최대 `GetSeries` API 작업 수입니다. | 
| ResourceCount | InhibitionRulesInAlertManagerDefinition | `AWS/Usage` | 알림 관리자 정의 파일의 최대 금지 규칙 수입니다. | 
| ResourceCount\* | ListAlertManagerAlertGroupInfosTPS | `AWS/Usage` | 워크스페이스별로 초당 수행할 수 있는 최대 `ListAlertManagerAlertGroupInfos` API 작업 수입니다. | 
| ResourceCount\* | ListAlertManagerAlertGroupsTPS | `AWS/Usage` | 워크스페이스별로 초당 수행할 수 있는 최대 `ListAlertManagerAlertGroups` API 작업 수입니다. | 
| ResourceCount\* | ListAlertManagerAlertsTPS | `AWS/Usage` | 워크스페이스별로 초당 수행할 수 있는 최대 `ListAlertManagerAlerts` API 작업 수입니다. | 
| ResourceCount\* | ListAlertManagerReceiversTPS | `AWS/Usage` | 워크스페이스별로 초당 수행할 수 있는 최대 `ListAlertManagerReceivers` API 작업 수입니다. | 
| ResourceCount\* | ListAlertManagerSilencesTPS | `AWS/Usage` | 워크스페이스별로 초당 수행할 수 있는 최대 `ListAlertManagerSilences` API 작업 수입니다. | 
| ResourceCount\* | ListAlertsTPS | `AWS/Usage` | 워크스페이스별로 초당 수행할 수 있는 최대 `ListAlerts` API 작업 수입니다. | 
| ResourceCount\* | ListRulesTPS | `AWS/Usage` | 워크스페이스별로 초당 수행할 수 있는 최대 `ListRules` API 작업 수입니다. | 
| ResourceCount\* | PutAlertManagerSilencesTPS | `AWS/Usage` | 워크스페이스별로 초당 수행할 수 있는 최대 `PutAlertManagerSilences` API 작업 수입니다. | 
| ResourceCount | HAReplicaGroupCount | `AWS/Usage` | 고가용성 복제본 그룹 수 | 
| ResourceCount\* | QueryMetricsTPS | `AWS/Usage` | 초당 쿼리 작업 수 | 
| ResourceCount\* | RemoteWriteTPS | `AWS/Usage` | 초당 원격 쓰기 작업 수 | 
| ResourceCount | ActiveAlerts | `AWS/Usage` | 워크스페이스당 활성 알림 수<br />단위: 개<br />유효한 통계: 평균, 최소, 최대 | 
| ResourceCount | ActiveSeries | `AWS/Usage` | 워크스페이스당 활성 시리즈 수<br />단위: 개<br />유효한 통계: 평균, 최소, 최대 | 
| ResourceCount | AlertAggregationGroupSize | `AWS/Usage` | 알림 관리자 정의 파일에 있는 알림 집계 그룹의 최대 크기입니다. `group_by`의 각 레이블 값 조합으로 집계 그룹이 생성됩니다. | 
| ResourceCount | AlertManagerDefinitionSizeBytes | `AWS/Usage` | 알림 관리자 정의 파일의 최대 크기(바이트)입니다. | 
| ResourceCount | AllSilences | `AWS/Usage` | 워크스페이스당 최대 무음 수(만료, 활성 및 보류 중인 무음 포함)입니다. | 
| ResourceCount | IngestionRate | `AWS/Usage` | 샘플 수집 속도<br />단위: 초당 개수<br />유효한 통계: 평균, 최소, 최대 | 
| ResourceCount | RuleEvaluationInterval | `AWS/Usage` | 최소 규칙 평가 간격입니다. | 
| ResourceCount | RuleGroupNamespaceDefinitionSizeBytes | `AWS/Usage` | 규칙 그룹 네임스페이스 정의 파일의 최대 크기(바이트)입니다. | 
| ResourceCount | TemplatesInAlertManagerDefinition | `AWS/Usage` | 알림 관리자 정의 파일의 최대 템플릿 수입니다. | 
| ResourceCount | WorkspaceCount | `AWS/Usage` | 계정당 리전별 최대 워크스페이스 수입니다. | 
| ResourceCount | SizeOfAlerts | `AWS/Usage` | 워크스페이스의 모든 알림의 총 크기, 바이트<br />단위: 바이트<br />유효한 통계: 평균, 최소, 최대 | 
| ResourceCount | SuppressedAlerts | `AWS/Usage` | WorkSpace당 숨김 상태 알림 수 알림은 무음 또는 금지로 억제할 수 있습니다.<br />단위: 개<br />유효한 통계: 평균, 최소, 최대 | 
| ResourceCount | UnprocessedAlerts | `AWS/Usage` | WorkSpace당 처리되지 않은 상태인 알림의 수 AlertManager에서 알림을 수신하면 해당 경고는 처리되지 않은 상태가 되지만 다음 집계 그룹 평가를 기다리고 있습니다.<br />단위: 개<br />유효한 통계: 평균, 최소, 최대 | 
| ResourceCount | AllAlerts | `AWS/Usage` | 워크스페이스당 모든 상태의 알림 수<br />단위: 개<br />유효한 통계: 평균, 최소, 최대 | 
| ResourceCount | AllRules | `AWS/Usage` | 워크스페이스당 모든 상태의 규칙 수<br />단위: 개<br />유효한 통계: 평균, 최소, 최대 | 
| ActiveSeriesPerLabelSet |  - | `AWS/Prometheus` | 각 사용자 정의 레이블 세트의 현재 활성 시리즈 사용량입니다.<br />단위: 개<br />유효한 통계: Average, Minimum, Maximum, Sum | 
| ActiveSeriesLimitPerLabelSet |  - | `AWS/Prometheus` | 각 사용자 정의 레이블 세트의 현재 활성 시리즈 제한 값입니다.<br />단위: 개<br />유효한 통계: Average, Minimum, Maximum, Sum | 
| AlertManagerAlertsReceived |  - | `AWS/Prometheus` | 알림 관리자가 수신한 총 성공 알림 수<br />단위: 개<br />유효한 통계: Average, Minimum, Maximum, Sum | 
| AlertManagerNotificationsFailed |  - | `AWS/Prometheus` | 실패한 알림 전송 수<br />단위: 개<br />유효한 통계: Average, Minimum, Maximum, Sum | 
| AlertManagerNotificationsThrottled |  - | `AWS/Prometheus` | 병목 현상이 발생한 알림 수<br />단위: 개<br />유효한 통계: Average, Minimum, Maximum, Sum | 
| AnomalyDetectors | WorkspaceId | `AWS/Prometheus` | 지정된 워크스페이스에 대한 총 이상 탐지기 수<br />단위: 개<br />유효한 통계: 평균, 최소, 최대 | 
| AnomalyDetectorEvaluations | WorkspaceId, AnomalyDetectorId | `AWS/Prometheus` | 총 이상 탐지기 평가 수<br />단위: 개<br />유효한 통계: Average, Minimum, Maximum, Sum | 
| AnomalyDetectorEvaluationFailures | WorkspaceId, AnomalyDetectorId | `AWS/Prometheus` | 간격 내 이상 탐지기 실패 횟수<br />단위: 개<br />유효한 통계: Average, Minimum, Maximum, Sum | 
| AnomalyDetectorLastEvaluationDuration | WorkspaceId, AnomalyDetectorId | `AWS/Prometheus` | 이상 탐지기의 마지막 평가 기간<br />단위: 초<br />유효한 통계: Average, Minimum, Maximum, Sum | 
| AnomalyDetectorMissedEvaluations | WorkspaceId, AnomalyDetectorId | `AWS/Prometheus` | 간격 동안 누락된 이상 탐지기 평가 횟수<br />단위: 개<br />유효한 통계: Average, Minimum, Maximum, Sum | 
| DiscardedSamples\*\* |  - | `AWS/Prometheus` | 이유별 폐기된 샘플 수<br />단위: 개<br />유효한 통계: Average, Minimum, Maximum, Sum | 
| DiscardedSeries\*\* |  - | `AWS/Prometheus` | 이유별 폐기된 샘플이 포함된 시리즈 수<br />단위: 개<br />유효한 통계: Average, Minimum, Maximum, Sum | 
| DiscardedSamplesPerLabelSet |  - | `AWS/Prometheus` | 사용자 정의 레이블 세트별 폐기된 샘플 수<br />단위: 개<br />유효한 통계: Average, Minimum, Maximum, Sum | 
| DiscardedSeriesPerLabelSet |  - | `AWS/Prometheus` | 각 사용자 정의 레이블 세트에 대해 폐기된 샘플을 포함하는 시리즈 수<br />단위: 개<br />유효한 통계: Average, Minimum, Maximum, Sum | 
| IngestionRatePerLabelSet |  - | `AWS/Prometheus` | 사용자 정의 레이블 세트별 수집 속도<br />단위: 개<br />유효한 통계: Average, Minimum, Maximum, Sum | 
| QuerySamplesProcessed |  - | `AWS/Prometheus` | 처리된 쿼리 샘플 수<br />단위: 개<br />유효한 통계: Average, Minimum, Maximum, Sum | 
| RuleEvaluations |  - | `AWS/Prometheus` | 총 규칙 평가 수<br />단위: 개<br />유효한 통계: Average, Minimum, Maximum, Sum | 
| RuleEvaluationFailures |  - | `AWS/Prometheus` | 해당 간격 내의 규칙 평가 실패 횟수<br />단위: 개<br />유효한 통계: Average, Minimum, Maximum, Sum | 
| RuleGroupIterationsMissed |  - | `AWS/Prometheus` | 해당 간격 동안 누락된 규칙 그룹 반복 횟수<br />단위: 개<br />유효한 통계: Average, Minimum, Maximum, Sum | 
| RuleGroupLastEvaluationDuration |  - | `AWS/Prometheus` | 규칙 그룹의 마지막 평가 기간<br />단위: 초<br />유효한 통계: Average, Minimum, Maximum, Sum | 

\*TPS 지표는 매분 생성되며 해당 1분 동안의 초당 평균입니다. 짧은 버스트 기간은 TPS 지표에 반영되지 않습니다.

\*\*샘플이 폐기되는 몇 가지 이유는 다음과 같습니다. 아래의 모든 이유가 DiscardedSeries 지표에 표시되는 것은 아닙니다.


|  이유  |  의미  | 
| --- | --- | 
| greater\_than\_max\_sample\_age | 1시간이 지난 샘플은 폐기합니다. | 
| new-value-for-timestamp | 이전 샘플과 동일한 타임스탬프를 사용하지만 다른 값을 가진 중복된 샘플이 전송되었습니다. | 
| per\_labelset\_series\_limit | 레이블 세트당 총 활성 시리즈 수 제한에 도달했습니다. | 
| per\_metric\_series\_limit | 지표별 활성 시리즈 제한에 도달했습니다. | 
| per\_user\_series\_limit | 총 활성 시리즈 수 제한에 도달했습니다. | 
| rate\_limited | 수집 속도가 제한되었습니다. | 
| sample-out-of-order | 샘플이 잘못된 순서로 전송되어 처리할 수 없습니다. | 
| label\_value\_too\_long | 레이블 값이 허용된 문자 제한보다 깁니다. | 
| max\_label\_names\_per\_series | 지표별 레이블 이름에 도달했습니다. | 
| missing\_metric\_name | 지표 이름은 제공되지 않습니다. | 
| metric\_name\_invalid | 잘못된 지표 이름이 제공되었습니다. | 
| label\_invalid | 잘못된 레이블이 제공되었습니다. | 
| duplicate\_label\_names | 중복된 레이블 이름이 제공되었습니다. | 

**참고**  
존재하지 않거나 누락된 지표는 해당 지표의 값이 0인 것과 같습니다.

**참고**  
`RuleGroupIterationsMissed`, `RuleEvaluations`, `RuleEvaluationFailures`, `RuleGroupLastEvaluationDuration`에는 다음과 같은 구조의 `RuleGroup` 차원이 있습니다.  
{{RuleGroupNamespace}}, {{RuleGroup}}

## Prometheus 판매 지표에 CloudWatch 경보 설정
<a name="AMP-CW-examples"></a>

CloudWatch 경보를 사용하여 Prometheus 리소스 사용을 모니터링할 수 있습니다.

**Prometheus의 **ActiveSeries** 수에 대한 경보를 설정하려면**

1. **그래프로 표시된 지표** 탭을 선택하고 **ActiveSeries** 레이블이 나올 때까지 아래로 스크롤합니다.

   **그래프로 표시된 지표** 보기에서는 현재 수집 중인 지표만 표시됩니다.

1. **작업** 열에서 **알림** 아이콘을 선택합니다.

1. **지표 및 조건 지정**에서 **조건 값** 필드에 임곗값 조건을 입력하고 **다음**을 선택합니다.

1. **작업 구성**에서 기존 SNS 주제를 선택하거나 알림을 보낼 새 SNS 주제를 생성합니다.

1. **이름 및 설명 추가**에서 경보 이름과 설명(선택 사항)을 추가합니다.

1. **경보 생성**을 선택하세요.