List Manager metrics and alerts
Find the metrics LM exposes and the alerts defined for LM.
Service | CRD or annotations? | Port | Endpoint/Selector | Metrics update interval |
---|---|---|---|---|
List Manager | ServiceMonitor | 3105 | /metrics | 15 seconds |
See details about:
Metrics[edit source]
Metric and description | Metric details | Indicator of |
---|---|---|
cxc_ Total executed jobs count. |
Unit: Type: Counter |
|
cxc_ Running jobs count. |
Unit: Type: Gauge |
|
cxc_ Rejected jobs count. |
Unit: Type: Counter |
|
cxc_ Job duration, in milliseconds. |
Unit: Type: Histogram |
|
cxc_ Response time, in milliseconds. |
Unit: Type: Summary |
|
cxc_ Healthy instance. |
Unit: Type: Gauge |
|
cxc_ Count of downloaded compliance files. |
Unit: Type: Counter |
|
cxc_ Count of created Contacts Lists. |
Unit: Type: Counter |
|
cxc_ Count of created Contacts Lists. |
Unit: Type: Counter |
|
cxc_ Count of created Contacts Lists. |
Unit: Type: Counter |
Alerts[edit source]
The following alerts are defined for List Manager.
Alert | Severity | Description | Based on | Threshold |
---|---|---|---|---|
CXC-LM-LatencyHigh | HIGH | Triggered when the latency for list manager is above the defined threshold | 5000ms for 5m
| |
cxc_list_manager_too_many_errors_from_auth | HIGH | Triggered when there are too many error responses from the auth service (list manager) for more than the specified time threshold. | 1m
| |
CXC-CPUUsage | HIGH | Triggered when the CPU utilization of a pod is beyond the threshold | 300% for 5m
| |
CXC-MemoryUsage | HIGH | Triggered when the memory utilization of a pod is beyond the threshold. | 70% for 5m
| |
CXC-PodNotReadyCount | HIGH | Triggered when the number of pods ready for a CX Contact deployment is less than or equal to the threshold. | 1 for 5m
| |
CXC-PodRestartsCount | HIGH | Triggered when the restart count for a pod is beyond the threshold. | 1 for 5m
| |
CXC-MemoryUsagePD | HIGH | Triggered when the memory usage of a pod is above the critical threshold. | 90% for 5m
| |
CXC-PodRestartsCountPD | HIGH | Triggered when the restart count is beyond the critical threshold. | 5 for 5m
| |
CXC-PodsNotReadyPD | HIGH | Triggered when there are no pods ready for CX Contact deployment. | 0 for 1m |