<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://all.docs.genesys.com/index.php?action=history&amp;feed=atom&amp;title=VM%2FCurrent%2FVMPEGuide%2FVoiceOrchestrationServiceMetrics</id>
	<title>VM/Current/VMPEGuide/VoiceOrchestrationServiceMetrics - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://all.docs.genesys.com/index.php?action=history&amp;feed=atom&amp;title=VM%2FCurrent%2FVMPEGuide%2FVoiceOrchestrationServiceMetrics"/>
	<link rel="alternate" type="text/html" href="https://all.docs.genesys.com/index.php?title=VM/Current/VMPEGuide/VoiceOrchestrationServiceMetrics&amp;action=history"/>
	<updated>2026-04-15T00:57:20Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.31.1</generator>
	<entry>
		<id>https://all.docs.genesys.com/index.php?title=VM/Current/VMPEGuide/VoiceOrchestrationServiceMetrics&amp;diff=116229&amp;oldid=prev</id>
		<title>Corinneh: Published</title>
		<link rel="alternate" type="text/html" href="https://all.docs.genesys.com/index.php?title=VM/Current/VMPEGuide/VoiceOrchestrationServiceMetrics&amp;diff=116229&amp;oldid=prev"/>
		<updated>2022-02-23T20:56:32Z</updated>

		<summary type="html">&lt;p&gt;Published&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;{{ArticlePEServiceMetrics&lt;br /&gt;
|IncludedServiceId=79f6035b-590e-4c34-9432-fcd4b5ff519d&lt;br /&gt;
|CRD=Supports both CRD and annotations&lt;br /&gt;
|Port=11200&lt;br /&gt;
|Endpoint=http://&amp;lt;pod-ipaddress&amp;gt;:11200/metrics&lt;br /&gt;
|MetricsUpdateInterval=30 seconds&lt;br /&gt;
|MetricsDefined=Yes&lt;br /&gt;
|MetricsIntro=You can query Prometheus directly to see all the metrics that the Voice Orchestration Service exposes. The following metrics are likely to be particularly useful. Genesys does not commit to maintain other currently available Orchestration Service metrics not documented on this page.&lt;br /&gt;
|PEMetric={{PEMetric&lt;br /&gt;
|Metric=orsnode_callevents&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Total number of received call events.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_ha_writes&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The number of HA writes to Redis.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_ha_reads&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The number of HA reads from Redis.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_interactions&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The number of active interactions.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_total_interactions&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The total number of interactions that have been created.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_cleared_interactions&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The total number of call interactions that have been cleared.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_strategies&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The number of strategies that are running.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_total_strategies&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The total number of strategies that have been created.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_load_errors&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The total number of strategy load errors.&lt;br /&gt;
|UsedFor=Errors&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_fetch_errors&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The total number of errors encountered when a strategy tried to fetch data from a Designer Application Server (DAS).&lt;br /&gt;
|UsedFor=Errors&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_config_errors&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The total number of strategy configuration errors.&lt;br /&gt;
|UsedFor=Errors&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_invoke_errors&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The total number of strategy invoke errors.&lt;br /&gt;
|UsedFor=Errors&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_treatments&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The total number of strategy treatments.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_failed_treatments&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The total number of failed strategy treatments.&lt;br /&gt;
|UsedFor=Errors&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_userdata_updates&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The total number of times that a strategy updated user data.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_scxml_transitions&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The total number of SCXML transitions.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_scxml_events&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The total number of SCXML events.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_scxml_error_events&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The total number of SCXML &amp;lt;tt&amp;gt;error.*&amp;lt;/tt&amp;gt; events.&lt;br /&gt;
|UsedFor=Errors&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_http_fetch_requests&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The total number of HTTP &amp;lt;tt&amp;gt;fetch&amp;lt;/tt&amp;gt; requests.&lt;br /&gt;
|UsedFor=Errors&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_http_fetch_duration&lt;br /&gt;
|Type=histogram&lt;br /&gt;
|Unit=milliseconds&lt;br /&gt;
|MetricDescription=The HTTP fetch time, measured in milliseconds (ms).&lt;br /&gt;
|UsedFor=Latency&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_http_fetch_errors&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The total number of HTTP &amp;lt;tt&amp;gt;fetch&amp;lt;/tt&amp;gt; errors.&lt;br /&gt;
|UsedFor=Errors&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_http_fetch_error_status&lt;br /&gt;
|Type=histogram&lt;br /&gt;
|MetricDescription=Status of the HTTP &amp;lt;tt&amp;gt;fetch&amp;lt;/tt&amp;gt; error.&lt;br /&gt;
|UsedFor=Errors&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_urs_rlib_latency_msec&lt;br /&gt;
|Type=histogram&lt;br /&gt;
|Unit=milliseconds&lt;br /&gt;
|MetricDescription=The Universal Routing Server (URS) &amp;lt;tt&amp;gt;rlib&amp;lt;/tt&amp;gt; latency, measured in milliseconds (ms).&lt;br /&gt;
|UsedFor=Latency&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_urs_rlib_errors&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The total number of URS &amp;lt;tt&amp;gt;rlib&amp;lt;/tt&amp;gt; errors.&lt;br /&gt;
|UsedFor=Errors&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_urs_rlib_requests&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The total number of URS &amp;lt;tt&amp;gt;rlib&amp;lt;/tt&amp;gt; requests.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_urs_rlib_events&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The total number of URS &amp;lt;tt&amp;gt;rlib&amp;lt;/tt&amp;gt; events.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_urs_rlib_timeouts&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The total number of URS &amp;lt;tt&amp;gt;rlib&amp;lt;/tt&amp;gt; timeouts.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_redis_state&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|Label=redis_cluster_name&lt;br /&gt;
|MetricDescription=Current Redis connection state.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_redis_disconnect&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The number of times that the ORS node disconnected from Redis.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_sdr_messages_sent&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The number of SDR messages that have been sent.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_rq_latency_msec&lt;br /&gt;
|Type=histogram&lt;br /&gt;
|Unit=milliseconds&lt;br /&gt;
|Label=le, service&lt;br /&gt;
|MetricDescription=Redis queue latency, measured in milliseconds (ms).&lt;br /&gt;
|UsedFor=Latency&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_routing_latency_msec&lt;br /&gt;
|Type=histogram&lt;br /&gt;
|Unit=milliseconds&lt;br /&gt;
|MetricDescription=Routing latency, measured in milliseconds (ms).&lt;br /&gt;
|UsedFor=Latency&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_rstream_latency_msec&lt;br /&gt;
|Type=histogram&lt;br /&gt;
|Unit=milliseconds&lt;br /&gt;
|Label=le, node&lt;br /&gt;
|MetricDescription=Redis stream latency, measured in (ms).&lt;br /&gt;
|UsedFor=Latency&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_digital_latency_msec&lt;br /&gt;
|Type=histogram&lt;br /&gt;
|Unit=milliseconds&lt;br /&gt;
|MetricDescription=Digital stream latency, measured in milliseconds (ms).&lt;br /&gt;
|UsedFor=Latency&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_sip_health_check&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|Label=node&lt;br /&gt;
|MetricDescription=ORS health check.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_ixn_health_check&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Interaction health check.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_rq_state&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Current Redis queue connection state.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_ixn_events&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Total number of interaction stream events received.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_rq_disconnect&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Number of times the ORS node disconnected from the RQ Service.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=service_version_info&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|Label=version&lt;br /&gt;
|MetricDescription=Displays the version of Voice Orchestration Service that is currently running. In the case of this metric, the labels provide the important information. The metric value is always 1 and does not provide any information.&lt;br /&gt;
|SampleValue=service_version_info{version=&amp;quot;100.0.1000006&amp;quot;} 1&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_route_redirected&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Total number of EventRouteUsed events without a ReferenceID.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_balancer_stream_state&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|Label=balancer_stream_type&lt;br /&gt;
|MetricDescription=The state of the voice balancer stream.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_high_memory&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Indicates when the ORS node is using a lot of memory.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_urs_rlib_state&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Indicates a Tenant &amp;lt;tt&amp;gt;rlib&amp;lt;/tt&amp;gt; request timeout.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_stuck_interactions&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The number of stuck interactions.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_urs_scxml_submit_requests&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The total number of URS &amp;lt;tt&amp;gt;SCXMLSubmit&amp;lt;/tt&amp;gt; requests.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_urs_scxml_cancel_requests&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The total number of URS &amp;lt;tt&amp;gt;SCXMLQueueCancel&amp;lt;/tt&amp;gt; requests.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_urs_queue_submit_done_events&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Total number of URS &amp;lt;tt&amp;gt;queue.submit.done&amp;lt;/tt&amp;gt; events.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_health_level&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Summarized health level of the ORS node: &lt;br /&gt;
&lt;br /&gt;
-1 – fail&amp;lt;br /&amp;gt;&lt;br /&gt;
0 – starting&amp;lt;br /&amp;gt;&lt;br /&gt;
1 – degraded&amp;lt;br /&amp;gt;&lt;br /&gt;
2 – pass&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_health_check_error&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|Label=reason&lt;br /&gt;
|MetricDescription=Health check errors for the ORS node:&lt;br /&gt;
&lt;br /&gt;
1 – has error&amp;lt;br /&amp;gt;&lt;br /&gt;
0 – no error&lt;br /&gt;
|UsedFor=Errors&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_running_applications&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The number of active sessions for each Designer application.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_failed_applications&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The number of failed sessions for each Designer application.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_total_applications&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The total number of sessions created for each Designer application.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_failed_scripts&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The number of scripts that failed to load in the Tenant Service configuration management environment.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_session_load_time_msec&lt;br /&gt;
|Type=histogram&lt;br /&gt;
|Unit=milliseconds&lt;br /&gt;
|MetricDescription=The time it takes for the strategy to be compiled and go through its initial states.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_service_started&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|Label=started&lt;br /&gt;
|MetricDescription=Timestamp when the ORS node started.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_total_terminal_requests&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Total number of terminal requests (like Deliver, PlaceInQueue, StopProcessing for Digital and RequestClearCall, RequestRouteCall for Voice).&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_total_non_terminal_requests&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Total number of non-terminal requests to the Interaction Server (for Digital) or SIP Server (for Voice).&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_sip_post_errors&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Total number of errors encountered in POST requests to the SIP node.&lt;br /&gt;
|UsedFor=Errors&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_pending_tlib_requests&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Total number of pending TLib requests.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_sips_rest_connections&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The number of active REST connections with SIP Cluster Service.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_number_compiled_applications&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The number of compiled applications in the cache.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_cached_applications_size&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|MetricDescription=The sum of the sizes of the cached applications.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_tlib_latency_msec&lt;br /&gt;
|Type=histogram&lt;br /&gt;
|Unit=milliseconds&lt;br /&gt;
|Label=le&lt;br /&gt;
|MetricDescription=The TLib Rest API request latency, measured in (ms).&lt;br /&gt;
|UsedFor=Latency&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_application_size&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|MetricDescription=The compiled size of the Designer application.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_application_microstep_count&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The number of microsteps while executing the Designer application.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_application_run_time_msec&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=milliseconds&lt;br /&gt;
|MetricDescription=The length of time the Designer application was running, measured in milliseconds (ms).&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_application_compiled_date&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The date on which the Designer application was compiled.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=orsnode_application_last_invoked_date&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The date when the Designer application was last invoked.&lt;br /&gt;
}}&lt;br /&gt;
|AlertsDefined=Yes&lt;br /&gt;
|PEAlert={{PEAlert&lt;br /&gt;
|Alert=Number of running strategies is too high&lt;br /&gt;
|Severity=Warning&lt;br /&gt;
|AlertDescription=Too many active sessions.&lt;br /&gt;
&lt;br /&gt;
Actions:&lt;br /&gt;
&lt;br /&gt;
*Check whether the horizontal pod autoscaler has triggered and if the maximum number of pods has been reached.&lt;br /&gt;
*Check the number of voice, digital, and callback calls in the system.&lt;br /&gt;
|BasedOn=orsnode_strategies&lt;br /&gt;
|Threshold=More than 400 strategies running in 5 consecutive seconds.&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=Number of running strategies is critical&lt;br /&gt;
|Severity=Critical&lt;br /&gt;
|AlertDescription=Too many active sessions.&lt;br /&gt;
&lt;br /&gt;
Actions:&lt;br /&gt;
&lt;br /&gt;
*Check whether the horizontal pod autoscaler has triggered and if the maximum number of pods has been reached.&lt;br /&gt;
*Check the number of voice, digital, and callback calls in the system.&lt;br /&gt;
|BasedOn=orsnode_strategies&lt;br /&gt;
|Threshold=More than 600 strategies running in 5 consecutive seconds.&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=Redis disconnected for 5 minutes&lt;br /&gt;
|Severity=Warning&lt;br /&gt;
|AlertDescription=Actions:&lt;br /&gt;
&lt;br /&gt;
*If the alarm is triggered for multiple services, make sure there are no issues with Redis, and then restart Redis.&lt;br /&gt;
*If alarm is triggered only for pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt;, check if there is an issue with the pod.&lt;br /&gt;
|BasedOn=redis_state&lt;br /&gt;
|Threshold=Redis is not available for the pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; for 5 minutes.&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=Redis disconnected for 10 minutes&lt;br /&gt;
|Severity=Critical&lt;br /&gt;
|AlertDescription=Actions:&lt;br /&gt;
&lt;br /&gt;
*If the alarm is triggered for multiple services, make sure there are no issues with Redis, and then restart Redis.&lt;br /&gt;
*If the alarm is triggered only for pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt;, check if there is an issue with the pod.&lt;br /&gt;
|BasedOn=redis_state&lt;br /&gt;
|Threshold=Redis is not available for the pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; for 10 minutes.&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=Pod status Failed&lt;br /&gt;
|Severity=Warning&lt;br /&gt;
|AlertDescription=Pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; failed.&lt;br /&gt;
&lt;br /&gt;
Actions:&lt;br /&gt;
&lt;br /&gt;
*One of the containers in the pod has entered a Failed state. Check the Kibana logs for the reason.&lt;br /&gt;
|BasedOn=kube_pod_status_phase&lt;br /&gt;
|Threshold=Pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; is in Failed state.&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=Pod in Unknown state&lt;br /&gt;
|Severity=Warning&lt;br /&gt;
|AlertDescription=Pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; is in Unknown state.&lt;br /&gt;
&lt;br /&gt;
Actions:&lt;br /&gt;
&lt;br /&gt;
*If the alarm is triggered for multiple services, make sure there are no issues with the Kubernetes cluster.&lt;br /&gt;
*If the alarm is triggered only for pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt;, check whether the image is correct and if the container is starting up.&lt;br /&gt;
|BasedOn=kube_pod_status_phase&lt;br /&gt;
|Threshold=Pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; is in Unknown state for 5 minutes.&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=Pod in Pending state&lt;br /&gt;
|Severity=Warning&lt;br /&gt;
|AlertDescription=Pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; is in Pending state.&lt;br /&gt;
&lt;br /&gt;
Actions:&lt;br /&gt;
&lt;br /&gt;
*If the alarm is triggered for multiple services, make sure the Kubernetes nodes where the pod is running are alive in the cluster.&lt;br /&gt;
*If the alarm is triggered only for the pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt;, check the health of the pod.&lt;br /&gt;
|BasedOn=kube_pod_status_phase&lt;br /&gt;
|Threshold=Pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; is in Pending state for 5 minutes.&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=Pod Not ready for 10 minutes&lt;br /&gt;
|Severity=Critical&lt;br /&gt;
|AlertDescription=Pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; in NotReady state.&lt;br /&gt;
&lt;br /&gt;
Actions:&lt;br /&gt;
&lt;br /&gt;
*If this alarm is triggered, check whether the CPU is available for the pods.&lt;br /&gt;
*Check whether the port of the pod is running and serving the request.&lt;br /&gt;
|BasedOn=kube_pod_status_ready&lt;br /&gt;
|Threshold=Pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; in NotReady state for 10 minutes.&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=Container restored repeatedly&lt;br /&gt;
|Severity=Critical&lt;br /&gt;
|AlertDescription=Actions:&lt;br /&gt;
&lt;br /&gt;
*One of the containers in the pod has entered a failed state. Check the Kibana logs for the reason.&lt;br /&gt;
|BasedOn=kube_pod_container_status_restarts_total&lt;br /&gt;
|Threshold=Container &amp;lt;nowiki&amp;gt;{{ $labels.container }}&amp;lt;/nowiki&amp;gt; was restarted 5 or more times within 15 minutes.&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=Pod memory greater than 65%&lt;br /&gt;
|Severity=Warning&lt;br /&gt;
|AlertDescription=High memory usage for pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
Actions:&lt;br /&gt;
&lt;br /&gt;
*Check whether the horizontal pod autoscaler has triggered and if the maximum number of pods has been reached.&lt;br /&gt;
*Check Grafana for abnormal load.&lt;br /&gt;
*Collect the service logs; raise an investigation ticket.&lt;br /&gt;
|BasedOn=container_memory_working_set_bytes, kube_pod_container_resource_requests_memory_bytes&lt;br /&gt;
|Threshold=Container &amp;lt;nowiki&amp;gt;{{ $labels.container }}&amp;lt;/nowiki&amp;gt; memory usage exceeded 65% for 5 minutes.&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=Pod memory greater than 80%&lt;br /&gt;
|Severity=Critical&lt;br /&gt;
|AlertDescription=Critical memory usage for pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
Actions:&lt;br /&gt;
&lt;br /&gt;
*Check whether the horizontal pod autoscaler has triggered and if the maximum number of pods has been reached.&lt;br /&gt;
*Check Grafana for abnormal load.&lt;br /&gt;
*Restart the service.&lt;br /&gt;
*Collect the service logs; raise an investigation ticket.&lt;br /&gt;
|BasedOn=container_memory_working_set_bytes, kube_pod_container_resource_requests_memory_bytes&lt;br /&gt;
|Threshold=Container &amp;lt;nowiki&amp;gt;{{ $labels.container }}&amp;lt;/nowiki&amp;gt; memory usage exceeded 80% for 5 minutes.&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=Pod CPU greater than 65%&lt;br /&gt;
|Severity=Warning&lt;br /&gt;
|AlertDescription=High CPU load for pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
Actions:&lt;br /&gt;
&lt;br /&gt;
*Check whether the horizontal pod autoscaler has triggered and if the maximum number of pods has been reached.&lt;br /&gt;
*Check Grafana for abnormal load.&lt;br /&gt;
*Collect the service logs; raise an investigation ticket.&lt;br /&gt;
|BasedOn=container_cpu_usage_seconds_total, container_spec_cpu_period&lt;br /&gt;
|Threshold=Container &amp;lt;nowiki&amp;gt;{{ $labels.container }}&amp;lt;/nowiki&amp;gt; CPU usage exceeded 65% for 5 minutes.&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=Pod CPU greater than 80%&lt;br /&gt;
|Severity=Critical&lt;br /&gt;
|AlertDescription=Critical CPU load for pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
Actions:&lt;br /&gt;
&lt;br /&gt;
*Check whether the horizontal pod autoscaler has triggered and if the maximum number of pods has been reached.&lt;br /&gt;
*Check Grafana for abnormal load.&lt;br /&gt;
*Restart the service.&lt;br /&gt;
*Collect the service logs; raise an investigation ticket.&lt;br /&gt;
|BasedOn=container_cpu_usage_seconds_total, container_spec_cpu_period&lt;br /&gt;
|Threshold=Container &amp;lt;nowiki&amp;gt;{{ $labels.container }}&amp;lt;/nowiki&amp;gt; CPU usage exceeded 80% for 5 minutes.&lt;br /&gt;
}}&lt;br /&gt;
}}&lt;/div&gt;</summary>
		<author><name>Corinneh</name></author>
		
	</entry>
</feed>