Metrics
From Genesys Documentation
Revision as of 19:31, June 2, 2021 by Tony.gilchrist@genesys.com (talk | contribs)
This is a draft page; it has not yet been published.
Contents
Learn which metrics you should monitor for <service_name> and when to sound the alarm.
Pushgateway and Kubernetes metrics
Pushgateway:
| Metric | Description | Alarm suggestion |
|---|---|---|
| *bds_pod_processing_start* | start time of BDS processing | If over > 43200 sec ago |
| *bds_pod_processing_end* | end time of BDS processing | If over bds_pod_processing_start + 43200 sec |
| *bds_processing_exit_code* | processing exit code | When != 0 |
Kubernetes:
| Metric | Description | Alarm suggestion |
|---|---|---|
| *kube_pod_container_status_terminated_reason* | shows info about container termination reason | On Error |
| *kube_job_status_start_time* | can be used to trigger alerts if processing is to long | > 43200 sec |
| *kube_job_status_failed* | can be used to trigger alerts of failed jobs | > 0 |
| *kube_pod_created* | used to count pod lifetime | Info |
| *kube_job_status_start_time* | shows started jobs | Info |
| *container_cpu_usage_seconds_total* | info about used CPU | Info |
| *container_memory_usage_bytes* | info about used MEM | Info |