Difference between revisions of "PEC-OU/Current/CXCPEGuide/LMMetrics"

Latest revision as of 14:23, February 7, 2022

This topic is part of the manual Outbound (CX Contact) Private Edition Guide for version Current of Outbound (CX Contact).

Metrics[edit source]

Metric and description	Metric details	Indicator of
cxc_list_manager_executed_jobs_count Total executed jobs count.	Unit: Type: Counter Label: n/a Sample value: 42
cxc_list_manager_running_jobs_count Running jobs count.	Unit: Type: Gauge Label: n/a Sample value: 4.2
cxc_list_manager_rejected_jobs_count Rejected jobs count.	Unit: Type: Counter Label: n/a Sample value: 42
cxc_list_manager_jobs_duration Job duration, in milliseconds.	Unit: Type: Histogram Label: n/a Sample value: [1, 2, 3]
cxc_list_manager_responses_summary Response time, in milliseconds.	Unit: Type: Summary Label: "'method', 'path', 'status'" Sample value: 42
cxc_list_manager_healthy_instance Healthy instance.	Unit: Type: Gauge Label: n/a Sample value: 4.2
cxc_list_manager_downloaded_compliance_files_count Count of downloaded compliance files.	Unit: Type: Counter Label: n/a Sample value: 42
cxc_list_manager_contacts_lists_created_count Count of created Contacts Lists.	Unit: Type: Counter Label: "'ccid','tenant_name'" Sample value: 42
cxc_list_manager_import_contacts_requests_processed_count Count of created Contacts Lists.	Unit: Type: Counter Label: "'ccid','tenant_name'" Sample value: 42
cxc_list_manager_import_contacts_requests_failed_count Count of created Contacts Lists.	Unit: Type: Counter Label: "'ccid','tenant_name'" Sample value: 42

Alerts[edit source]

The following alerts are defined for List Manager.

Alert	Severity	Description	Threshold
CXC-LM-LatencyHigh	HIGH	Triggered when the latency for list manager is above the defined threshold	5000ms for 5m
cxc_list_manager_too_many_errors_from_auth	HIGH	Triggered when there are too many error responses from the auth service (list manager) for more than the specified time threshold.	1m
CXC-CPUUsage	HIGH	Triggered when the CPU utilization of a pod is beyond the threshold	300% for 5m
CXC-MemoryUsage	HIGH	Triggered when the memory utilization of a pod is beyond the threshold.	70% for 5m
CXC-PodNotReadyCount	HIGH	Triggered when the number of pods ready for a CX Contact deployment is less than or equal to the threshold.	1 for 5m
CXC-PodRestartsCount	HIGH	Triggered when the restart count for a pod is beyond the threshold.	1 for 5m
CXC-MemoryUsagePD	HIGH	Triggered when the memory usage of a pod is above the critical threshold.	90% for 5m
CXC-PodRestartsCountPD	HIGH	Triggered when the restart count is beyond the critical threshold.	5 for 5m
CXC-PodsNotReadyPD	HIGH	Triggered when there are no pods ready for CX Contact deployment.	0 for 1m

@@ Line 6: / Line 6: @@
 |Endpoint=/metrics
 |MetricsUpdateInterval=15 seconds
+|MetricsDefined=Yes
 |PEMetric={{PEMetric
 |Metric=cxc_list_manager_executed_jobs_count
@@ Line 68: / Line 69: @@
 }}
 |AlertsDefined=Yes
+|PEAlert={{PEAlert
+|Alert=CXC-LM-LatencyHigh
+|Severity=HIGH
+|AlertDescription=Triggered when the latency for list manager is above the defined threshold
+|Threshold=5000ms for 5m
+}}{{PEAlert
+|Alert=cxc_list_manager_too_many_errors_from_auth
+|Severity=HIGH
+|AlertDescription=Triggered when there are too many error responses from the auth service (list manager) for more than the specified time threshold.
+|Threshold=1m
+}}{{PEAlert
+|Alert=CXC-CPUUsage
+|Severity=HIGH
+|AlertDescription=Triggered when the CPU utilization of a pod is beyond the threshold
+|Threshold=300% for 5m
+}}{{PEAlert
+|Alert=CXC-MemoryUsage
+|Severity=HIGH
+|AlertDescription=Triggered when the memory utilization of a pod is beyond the threshold.
+|Threshold=70% for 5m
+}}{{PEAlert
+|Alert=CXC-PodNotReadyCount
+|Severity=HIGH
+|AlertDescription=Triggered when the number of pods ready for a CX Contact deployment is less than or equal to the threshold.
+|Threshold=1 for 5m
+}}{{PEAlert
+|Alert=CXC-PodRestartsCount
+|Severity=HIGH
+|AlertDescription=Triggered when the restart count for a pod is beyond the threshold.
+|Threshold=1 for 5m
+}}{{PEAlert
+|Alert=CXC-MemoryUsagePD
+|Severity=HIGH
+|AlertDescription=Triggered when the memory usage of a pod is above the critical threshold.
+|Threshold=90% for 5m
+}}{{PEAlert
+|Alert=CXC-PodRestartsCountPD
+|Severity=HIGH
+|AlertDescription=Triggered when the restart count is beyond the critical threshold.
+|Threshold=5 for 5m
+}}{{PEAlert
+|Alert=CXC-PodsNotReadyPD
+|Severity=HIGH
+|AlertDescription=Triggered when there are no pods ready for CX Contact deployment.
+|Threshold=0 for 1m
+}}
 }}

Outbound (CX Contact) Private Edition Guide

Overview

Configure and deploy

Upgrade, roll back, or uninstall

Observability

Difference between revisions of "PEC-OU/Current/CXCPEGuide/LMMetrics"

Latest revision as of 14:23, February 7, 2022

Contents

Metrics[edit source]

Alerts[edit source]