Difference between revisions of "Draft: UCS/Current/UCSPEGuide/Metrics"
From Genesys Documentation
m |
m |
||
Line 172: | Line 172: | ||
|sectionHeading=SQL Monitoring | |sectionHeading=SQL Monitoring | ||
|alignment=Vertical | |alignment=Vertical | ||
+ | |structuredtext={{{!}} class="wikitable" | ||
+ | {{!}}- | ||
+ | !Name (type) | ||
+ | !Key | ||
+ | !Units | ||
+ | !Additional<br />keys | ||
+ | !Description | ||
+ | !Common description | ||
+ | {{!}}- | ||
+ | {{!}}ucsx_sql_bucket<br />(Histogram) | ||
+ | {{!}}le="10" | ||
+ | {{!}}ms | ||
+ | {{!}}'nodeId',<br />'pid',<br />'operation',<br />'ccId',<br />'ccName' | ||
+ | {{!}}Count SQL request's of duration less 10ms | ||
+ | {{!}}Metrics of raw SQL request duration.<br />Key 'operation' might have values: 'update','insert','delete','query','rawSql' | ||
+ | {{!}}- | ||
+ | {{!}} | ||
+ | {{!}}le="50" | ||
+ | {{!}} | ||
+ | {{!}} | ||
+ | {{!}}Count SQL request's of duration less 50ms | ||
+ | {{!}} | ||
+ | {{!}}- | ||
+ | {{!}} | ||
+ | {{!}}le="200" | ||
+ | {{!}} | ||
+ | {{!}} | ||
+ | {{!}}Count SQL request's of duration less 200ms | ||
+ | {{!}} | ||
+ | {{!}}- | ||
+ | {{!}} | ||
+ | {{!}}le="1000" | ||
+ | {{!}} | ||
+ | {{!}} | ||
+ | {{!}}Count SQL request's of duration less 1000ms | ||
+ | {{!}} | ||
+ | {{!}}- | ||
+ | {{!}} | ||
+ | {{!}}le="+Inf" | ||
+ | {{!}} | ||
+ | {{!}} | ||
+ | {{!}}Count all SQL request's | ||
+ | {{!}} | ||
+ | {{!}}- | ||
+ | {{!}}ucsx_sql_sum<br />(Histogram) | ||
+ | {{!}} | ||
+ | {{!}} | ||
+ | {{!}} | ||
+ | {{!}}Total duration of all raw SQL requests | ||
+ | {{!}} | ||
+ | {{!}}- | ||
+ | {{!}}ucsx_sql_count<br />(Histogram) | ||
+ | {{!}} | ||
+ | {{!}} | ||
+ | {{!}} | ||
+ | {{!}}Total number of all raw SQL requests | ||
+ | {{!}} | ||
+ | {{!}}} | ||
|Status=No | |Status=No | ||
}}{{Section | }}{{Section | ||
Line 185: | Line 243: | ||
!Common description | !Common description | ||
{{!}}- | {{!}}- | ||
− | {{!}} rowspan="6" {{!}}ucsx_cache<br />(Counter) | + | {{!}} rowspan="6"{{!}}ucsx_cache<br />(Counter) |
{{!}}method="set" | {{!}}method="set" | ||
− | {{!}} rowspan="6" {{!}} | + | {{!}} rowspan="6"{{!}} |
− | {{!}} rowspan="6" {{!}}'nodeId',<br />'pid',<br />'key' | + | {{!}} rowspan="6"{{!}}'nodeId',<br />'pid',<br />'key' |
{{!}}Count writes to cache | {{!}}Count writes to cache | ||
− | {{!}} rowspan="6" {{!}}Metrics of internal application cache | + | {{!}} rowspan="6"{{!}}Metrics of internal application cache |
{{!}}- | {{!}}- | ||
{{!}}method="get" | {{!}}method="get" |
Revision as of 07:24, June 17, 2021
This is a draft page; the published version of this page can be found at UCS/Current/UCSPEGuide/Metrics.
Contents
Learn which metrics you should monitor for <service_name> and when to sound the alarm.
Include a link to the "suite-level" documentation for monitoring:
UCS-X provides internal monitoring metrics through a Prometheus endpoint on port 10052.
#mintydocs_link must be called from a MintyDocs-enabled page (Draft:UCS/Current/UCSPEGuide/Metrics).
Describe metrics (compatible with Prometheus endpoints) that customers can use to create their own monitoring dashboard in a tool like Grafana.
Make to identify any metrics that are important to monitor for alarming purposes, and include sample thresholds the customer should alarm on.
Common Performance Metrics
Name (type) | Key | Units | Additional keys |
Alarm condition | Description | Common description |
---|---|---|---|---|---|---|
ucsx_performance (Gauge) |
metric="cpuUsage" | % | 'nodeId', 'pid' |
> 95 | CPU usage | basic metric, created with starting metrics service, cannot be disabled |
metric="loopDelay" | ms | event loop lag | ||||
ucsx_internal_queue (Gauge) |
metric="queueSize" | 'nodeId', 'pid', 'endpoint' |
number requests waiting for processing | |||
metric="queueDelay" | ms | > 10000 | request's waiting for processing time | |||
ucsx_timings
(Gauge) |
metric="cpuTime" | s |
'nodeId', 'pid' |
CPU time used by process | basic metric, created with starting metrics service, cannot be disabled | |
metric="sysTime" | Syetm mode time | |||||
metric="userTime" | User mode time | |||||
metric="upTime" | Process running time | |||||
ucsx_memory
(Gauge) |
metric="rss" | Mb |
'nodeId', 'pid', |
> 1024 | Resident Set Size | basic metric, created with starting metrics service, cannot be disabled |
metric="heapTotal" | ||||||
metric="heapUsed" |
Database Connection Metrics
Name (type) | Key | Units | Additional keys |
Alarm condition | Description | Common description |
---|---|---|---|---|---|---|
ucsx_dbPool (Gauge) |
metric="total" | 'nodeId', 'pid', 'ccId', 'address' |
Total count of connection to particular database (defined by key 'ccId') | basic metric, created with starting call-center-storage service, cannot be disabled, shows usage of connection pool | ||
metric="idle" | Total count of idle connection to particular database (defined by key 'ccId') | |||||
metric="wait" | Total count of connection in 'waiting' state to particular database (defined by key 'ccId') | |||||
metric="max" | Max available connection for this instance of database |
HTTP Request Metrics
CometD Metrics
Internal Functions Calls Monitoring
Name (type) | Key | Units | Additional keys |
Description | Common description |
---|---|---|---|---|---|
ucsx_cache (Counter) |
method="set" | 'nodeId', 'pid', 'key' |
Count writes to cache | Metrics of internal application cache | |
method="get" | Count successful reads from cache | ||||
method="delete" | Count delete operations from cache | ||||
method="expired" | Count unsuccessful reads from cache due to data expiration | ||||
method="remove" | Count remove operation by timer | ||||
method="miss" | Count unsuccessful reads from cache due to data absence |
SQL Monitoring
Name (type) | Key | Units | Additional keys |
Description | Common description |
---|---|---|---|---|---|
ucsx_sql_bucket (Histogram) |
le="10" | ms | 'nodeId', 'pid', 'operation', 'ccId', 'ccName' |
Count SQL request's of duration less 10ms | Metrics of raw SQL request duration. Key 'operation' might have values: 'update','insert','delete','query','rawSql' |
le="50" | Count SQL request's of duration less 50ms | ||||
le="200" | Count SQL request's of duration less 200ms | ||||
le="1000" | Count SQL request's of duration less 1000ms | ||||
le="+Inf" | Count all SQL request's | ||||
ucsx_sql_sum (Histogram) |
Total duration of all raw SQL requests | ||||
ucsx_sql_count (Histogram) |
Total number of all raw SQL requests |
Internal Cache Monitoring
Name (type) | Key | Units | Additional keys |
Description | Common description |
---|---|---|---|---|---|
ucsx_cache (Counter) |
method="set" | 'nodeId', 'pid', 'key' |
Count writes to cache | Metrics of internal application cache | |
method="get" | Count successful reads from cache | ||||
method="delete" | Count delete operations from cache | ||||
method="expired" | Count unsuccessful reads from cache due to data expiration | ||||
method="remove" | Count remove operation by timer | ||||
method="miss" | Count unsuccessful reads from cache due to data absence |
Elasticsearch Monitoring
Name (type) | Key | Units | Additional keys |
Alarm condition | Description | Common description |
---|---|---|---|---|---|---|
ucsx_elastic_search_bucket (Histogram) |
le="10" | ms | 'nodeId', 'pid', 'operation', 'ccId', 'ccName' |
Count ES request's of duration less 10ms | Metrics of ElasticSearch request duration. Key 'operation' can have values: 'read','write'. | |
le="50" | Count ES request's of duration less 50ms | |||||
le="200" | Count ES request's of duration less 200ms | |||||
le="1000" | Count ES request's of duration less 1000ms | |||||
le="+Inf" | Count all ES request's | |||||
ucsx_elastic_search_sum (Histogram) |
Total duration of all ElasticSearch requests | |||||
ucsx_elastic_search_count (Histogram) |
Total number of all ElasticSearch requests |
Session Metrics
Name (type) | Key | Units | Additional keys |
Description | Common description |
---|---|---|---|---|---|
ucsx_sessions (Gauge) |
state="active" | counter | 'nodeId', 'pid', 'state', 'session' |
Count of sessions that have had some activity within configured interval of time. | Available session types are 'HTTP' and 'cometD' |
state="idle" | Count of sessions that have not had some activity within configured interval of time but still alive. | ||||
state="newPerMinute" | Count of new opened sessions per last minute |