Metrics

From Genesys Documentation
Jump to: navigation, search
This is a draft page; the published version of this page can be found at UCS/Current/UCSPEGuide/Metrics.

Learn which metrics you should monitor for <service_name> and when to sound the alarm.

Include a link to the "suite-level" documentation for monitoring:
#mintydocs_link must be called from a MintyDocs-enabled page (Draft:UCS/Current/UCSPEGuide/Metrics).

Describe metrics (compatible with Prometheus endpoints) that customers can use to create their own monitoring dashboard in a tool like Grafana.

Make to identify any metrics that are important to monitor for alarming purposes, and include sample thresholds the customer should alarm on.
UCS-X provides internal monitoring metrics through a Prometheus endpoint on port 10052.


Common Performance Metrics

Name (type) Key Units Additional
keys
Alarm condition Description Common description
ucsx_performance
(Gauge)
metric="cpuUsage" % 'nodeId',
'pid'
> 95 CPU usage basic metric,
created with starting metrics service,
cannot be disabled
metric="loopDelay" ms event loop lag
ucsx_internal_queue
(Gauge)
metric="queueSize" 'nodeId',
'pid',
'endpoint'
number requests waiting for processing
metric="queueDelay" ms > 10000 request's waiting for processing time
ucsx_timings

(Gauge)

metric="cpuTime" s

'nodeId',
'pid'


CPU time used by process basic metric,
created with starting metrics service,
cannot be disabled
metric="sysTime" Syetm mode time
metric="userTime" User mode time
metric="upTime" Process running time
ucsx_memory

(Gauge)

metric="rss" Mb

'nodeId',
'pid',


> 1024 Resident Set Size basic metric,
created with starting metrics service,
cannot be disabled
metric="heapTotal"
metric="heapUsed"