<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://all.docs.genesys.com/index.php?action=history&amp;feed=atom&amp;title=VM%2FCurrent%2FVMPEGuide%2FVoiceSIPProxyServiceMetrics</id>
	<title>VM/Current/VMPEGuide/VoiceSIPProxyServiceMetrics - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://all.docs.genesys.com/index.php?action=history&amp;feed=atom&amp;title=VM%2FCurrent%2FVMPEGuide%2FVoiceSIPProxyServiceMetrics"/>
	<link rel="alternate" type="text/html" href="https://all.docs.genesys.com/index.php?title=VM/Current/VMPEGuide/VoiceSIPProxyServiceMetrics&amp;action=history"/>
	<updated>2026-04-14T23:39:18Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.31.1</generator>
	<entry>
		<id>https://all.docs.genesys.com/index.php?title=VM/Current/VMPEGuide/VoiceSIPProxyServiceMetrics&amp;diff=116227&amp;oldid=prev</id>
		<title>Corinneh: Published</title>
		<link rel="alternate" type="text/html" href="https://all.docs.genesys.com/index.php?title=VM/Current/VMPEGuide/VoiceSIPProxyServiceMetrics&amp;diff=116227&amp;oldid=prev"/>
		<updated>2022-02-23T20:56:29Z</updated>

		<summary type="html">&lt;p&gt;Published&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;{{ArticlePEServiceMetrics&lt;br /&gt;
|IncludedServiceId=17df7b83-4bd2-492a-a705-d03c8cbab63a&lt;br /&gt;
|CRD=Supports both CRD and annotations&lt;br /&gt;
|Port=11400&lt;br /&gt;
|Endpoint=http://&amp;lt;pod-ipaddress&amp;gt;:11400/metrics&lt;br /&gt;
|MetricsUpdateInterval=30 seconds&lt;br /&gt;
|MetricsDefined=Yes&lt;br /&gt;
|MetricsIntro=Voice SIP Proxy Service exposes Genesys-defined, SIP Proxy Service–specific metrics as well as some standard Kafka metrics. You can query Prometheus directly to see all the metrics that the SIP Proxy Service exposes. The following metrics are likely to be particularly useful. Genesys does not commit to maintain other currently available SIP Proxy Service metrics not documented on this page.&lt;br /&gt;
|PEMetric={{PEMetric&lt;br /&gt;
|Metric=sipproxy_requests_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|Label=method&lt;br /&gt;
|MetricDescription=Total number of received requests.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sipproxy_rejected_requests_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=The total number of rejected requests.&lt;br /&gt;
|UsedFor=Errors&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sipproxy_requests_processed_self_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|Label=method&lt;br /&gt;
|MetricDescription=The total number of received requests that were processed by SIP Proxy itself.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sipproxy_requests_forwarded_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|Label=method, request_direction, sip_node_id&lt;br /&gt;
|MetricDescription=The total number of forwarded requests.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sipproxy_requests_sip_node_reselected_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Total count of sip-node reselection.&lt;br /&gt;
|UsedFor=Errors&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sipproxy_responses_forwarded_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|Label=method, sip_node_id, request_direction&lt;br /&gt;
|MetricDescription=Total count of forwarded responses.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sipproxy_response_latency&lt;br /&gt;
|Type=histogram&lt;br /&gt;
|Label=le, sip_node_id, request_direction, target, node_in_cache&lt;br /&gt;
|MetricDescription=SIP response latency.&lt;br /&gt;
|UsedFor=Latency&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sipproxy_register_processed_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Total number of REGISTER requests that SIP Proxy received for processing.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sipproxy_register_rejected_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Total number of REGISTER requests for processing that were rejected.&lt;br /&gt;
|UsedFor=Errors&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sipproxy_calls_per_second_count&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Current calculated calls per second.&lt;br /&gt;
|UsedFor=Saturation&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sipproxy_active_sip_nodes_count&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Current number of active SIP nodes.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sipproxy_sip_nodes_count&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Current number of discovered SIP nodes.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sipproxy_tenants_count&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Current count of discovered tenants.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sipproxy_consul_record_processing_errors_count&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Current number of errors while processing records got from Consul.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sipproxy_consul_errors_count&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Current number of Consul errors.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sipproxy_sip_node_is_capacity_available&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Label=sip_node_id&lt;br /&gt;
|MetricDescription=Indicates whether SIP node has available capacity or not.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=service_version_info&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|Label=version&lt;br /&gt;
|MetricDescription=Displays the version of Voice SIP Proxy Service that is currently running. In the case of this metric, the labels provide the important information. The metric value is always 1 and does not provide any information.&lt;br /&gt;
|SampleValue=service_version_info{version=&amp;quot;100.0.1000006&amp;quot;} 1&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sipproxy_health_level&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Health level of the SIP Proxy node:&lt;br /&gt;
&lt;br /&gt;
-1 – fail&amp;lt;br /&amp;gt;&lt;br /&gt;
0 – starting&amp;lt;br /&amp;gt;&lt;br /&gt;
1 – degraded&amp;lt;br /&amp;gt;&lt;br /&gt;
2 – pass&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sipproxy_envoy_proxy_status&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Status of the Envoy proxy:&lt;br /&gt;
&lt;br /&gt;
-1 – error&amp;lt;br /&amp;gt;&lt;br /&gt;
0 – disconnected&amp;lt;br /&amp;gt;&lt;br /&gt;
1 – connected&lt;br /&gt;
|SampleValue=1&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sipproxy_config_node_status&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Status of the Config node connection: &lt;br /&gt;
&lt;br /&gt;
0 – disconnected&amp;lt;br /&amp;gt;&lt;br /&gt;
1 – connected&lt;br /&gt;
|SampleValue=1&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sip_server_transactions_created_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Total number of created server transactions.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sip_client_transactions_created_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Total number of created client transactions.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sip_server_transactions_deleted_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Total number of deleted server transactions.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sip_client_transactions_deleted_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Total number of deleted client transactions.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sip_client_transactions_count&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Current number of client transactions.&lt;br /&gt;
|UsedFor=Saturation&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sip_server_transactions_count&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Current number of server transactions.&lt;br /&gt;
|UsedFor=Saturation&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sip_server_transactions_rejected_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Total number of server transactions rejected for internal reasons.&lt;br /&gt;
|UsedFor=Errors&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sip_proxy_contexts_count&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Current number of active SIP Proxy forwarding contexts.&lt;br /&gt;
|UsedFor=Saturation&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sip_received_bytes_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=bytes&lt;br /&gt;
|Label=transport&lt;br /&gt;
|MetricDescription=Total traffic received, measured in bytes.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sip_sent_bytes_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=bytes&lt;br /&gt;
|Label=transport&lt;br /&gt;
|MetricDescription=Total traffic sent, measured in bytes.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sip_transport_errors_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|Label=transport, address&lt;br /&gt;
|MetricDescription=Total number of transport errors.&lt;br /&gt;
|UsedFor=Errors&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sip_stream_transport_wait_drain_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Total number of requests to wait for drain events on stream transports.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=sip_stream_transport_flood_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|MetricDescription=Total number of flood events on the stream transports.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=http_client_request_duration_seconds&lt;br /&gt;
|Type=histogram&lt;br /&gt;
|Unit=seconds&lt;br /&gt;
|Label=le, target_service_name&lt;br /&gt;
|MetricDescription=The time duration between the HTTP client request and the response, measured in seconds.&lt;br /&gt;
|UsedFor=Latency&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=http_client_response_count&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=N/A&lt;br /&gt;
|Label=target_service_name, status&lt;br /&gt;
|MetricDescription=The number of HTTP client responses received.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=log_output_bytes_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Unit=bytes&lt;br /&gt;
|Label=level, format, module&lt;br /&gt;
|MetricDescription=The total amount of log output, measured in bytes.&lt;br /&gt;
|SampleValue=&amp;lt;font size=&amp;quot;-1&amp;quot;&amp;gt;&amp;lt;tt&amp;gt;log_output_bytes_total{level=&amp;quot;info&amp;quot;,format=&amp;quot;txt&amp;quot;,module=&amp;quot;sipproxy_node@config-manager&amp;quot;}&amp;amp;nbsp;3175&amp;lt;/tt&amp;gt;&amp;lt;/font&amp;gt;&amp;lt;br/&amp;gt;&amp;lt;font size=&amp;quot;-1&amp;quot;&amp;gt;&amp;lt;tt&amp;gt;log_output_bytes_total{level=&amp;quot;info&amp;quot;,format=&amp;quot;txt&amp;quot;,module=&amp;quot;sipproxy_node@sipproxy-node&amp;quot;}&amp;amp;nbsp;96&amp;lt;/tt&amp;gt;&amp;lt;/font&amp;gt;&amp;lt;br/&amp;gt;&amp;lt;font size=&amp;quot;-1&amp;quot;&amp;gt;&amp;lt;tt&amp;gt;log_output_bytes_total{level=&amp;quot;info&amp;quot;,format=&amp;quot;txt&amp;quot;,module=&amp;quot;sipproxy_node@sipproxy@sip&amp;quot;}&amp;amp;nbsp;181&amp;lt;/tt&amp;gt;&amp;lt;/font&amp;gt;&amp;lt;br/&amp;gt;&amp;lt;font size=&amp;quot;-1&amp;quot;&amp;gt;&amp;lt;tt&amp;gt;log_output_bytes_total{level=&amp;quot;info&amp;quot;,format=&amp;quot;json&amp;quot;,module=&amp;quot;sipproxy_node@config-manager&amp;quot;}&amp;amp;nbsp;4184&amp;lt;/tt&amp;gt;&amp;lt;/font&amp;gt;&amp;lt;br/&amp;gt;&amp;lt;font size=&amp;quot;-1&amp;quot;&amp;gt;&amp;lt;tt&amp;gt;log_output_bytes_total{level=&amp;quot;info&amp;quot;,format=&amp;quot;json&amp;quot;,module=&amp;quot;sipproxy_node@sipproxy-node&amp;quot;}&amp;amp;nbsp;135&amp;lt;/tt&amp;gt;&amp;lt;/font&amp;gt;&amp;lt;br/&amp;gt;&amp;lt;font size=&amp;quot;-1&amp;quot;&amp;gt;&amp;lt;tt&amp;gt;log_output_bytes_total{level=&amp;quot;info&amp;quot;,format=&amp;quot;json&amp;quot;,module=&amp;quot;sipproxy_node@sipproxy@sip&amp;quot;}&amp;amp;nbsp;259&amp;lt;/tt&amp;gt;&amp;lt;/font&amp;gt;&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=kafka_consumer_recv_messages_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|MetricDescription=Number of messages received from Kafka.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=kafka_consumer_error_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|MetricDescription=Number of Kafka consumer errors.&lt;br /&gt;
|UsedFor=Errors&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=kafka_consumer_latency&lt;br /&gt;
|Type=histogram&lt;br /&gt;
|MetricDescription=Consumer latency is the time difference between when the message is produced and when the message is consumed. That is, the time when the consumer received the message minus the time when the producer produced the message.&lt;br /&gt;
|UsedFor=Latency&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=kafka_consumer_rebalance_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|MetricDescription=Number of Kafka consumer rebalance events.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=kafka_consumer_state&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|MetricDescription=Current state of the Kafka consumer.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=kafka_producer__messages_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|MetricDescription=Number of messages received from Kafka.&lt;br /&gt;
|UsedFor=Traffic&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=kafka_producer_queue_depth&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Label=kafka_location&lt;br /&gt;
|MetricDescription=Number of Kafka producer pending events.&lt;br /&gt;
|UsedFor=Saturation&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=kafka_producer_queue_age_seconds&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Unit=seconds&lt;br /&gt;
|Label=kafka_location&lt;br /&gt;
|MetricDescription=Age of the oldest producer pending event in seconds.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=kafka_producer_error_total&lt;br /&gt;
|Type=counter&lt;br /&gt;
|Label=kafka_location&lt;br /&gt;
|MetricDescription=Number of Kafka producer errors.&lt;br /&gt;
|UsedFor=Errors&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=kafka_producer_state&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Label=kafka_location&lt;br /&gt;
|MetricDescription=Current state of the Kafka producer.&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=kafka_producer_biggest_event_size&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Label=kafka_location, topic&lt;br /&gt;
|MetricDescription=Biggest event size so far.&lt;br /&gt;
|SampleValue=231&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=kafka_max_request_size&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|Label=kafka_location&lt;br /&gt;
|MetricDescription=Exposed config to compare with biggest event size.&lt;br /&gt;
|SampleValue=1000000&lt;br /&gt;
}}{{PEMetric&lt;br /&gt;
|Metric=kafka_producer_dropped_event_number&lt;br /&gt;
|Type=gauge&lt;br /&gt;
|MetricDescription=Number of dropped events.&lt;br /&gt;
}}&lt;br /&gt;
|AlertsDefined=Yes&lt;br /&gt;
|PEAlert={{PEAlert&lt;br /&gt;
|Alert=Too many Kafka pending events&lt;br /&gt;
|Severity=Critical&lt;br /&gt;
|AlertDescription=Too many Kafka producer pending events for pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt;. This alert means there are issues with SIP REGISTER processing on this voice-sipproxy.&lt;br /&gt;
&lt;br /&gt;
Actions:&lt;br /&gt;
&lt;br /&gt;
*Make sure there are no issues with Kafka or with the &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; pod's CPU and network.&lt;br /&gt;
|BasedOn=kafka_producer_queue_depth&lt;br /&gt;
|Threshold=Too many Kafka producer pending events for service &amp;lt;nowiki&amp;gt;{{ $labels.container }}&amp;lt;/nowiki&amp;gt; (more than 100 in 5 minutes).&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=SIP server response time too high&lt;br /&gt;
|Severity=Warning&lt;br /&gt;
|AlertDescription=Actions:&lt;br /&gt;
&lt;br /&gt;
*If the alarm is triggered for multiple sipproxy-nodes, make sure there are no issues on &amp;lt;nowiki&amp;gt;{{ $labels.sip_node_id }}&amp;lt;/nowiki&amp;gt;.&lt;br /&gt;
*If the alarm is triggered only for sipproxy-node &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt;, check to see if there is an issue with the service related to the topic (CPU, memory, or network overload).&lt;br /&gt;
|BasedOn=sipproxy_response_latency_bucket&lt;br /&gt;
|Threshold=SIP response latency for more than 95% of messages forwarded to &amp;lt;nowiki&amp;gt;{{ $labels.sip_node_id }}&amp;lt;/nowiki&amp;gt; is more than 1 second for sipproxy-node &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt;.&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=Pod status failed&lt;br /&gt;
|Severity=Warning&lt;br /&gt;
|AlertDescription=Actions:&lt;br /&gt;
&lt;br /&gt;
*Restart the pod and check to see if there are any issues with the pod after restart.&lt;br /&gt;
|BasedOn=kube_pod_status_phase&lt;br /&gt;
|Threshold=Pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; is in Failed state.&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=Pod status Unknown&lt;br /&gt;
|Severity=Warning&lt;br /&gt;
|AlertDescription=Pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; is in Unknown state.&lt;br /&gt;
&lt;br /&gt;
Actions:&lt;br /&gt;
&lt;br /&gt;
*Restart the pod and check to see if there are any issues with the pod after restart.&lt;br /&gt;
|BasedOn=kube_pod_status_phase&lt;br /&gt;
|Threshold=Pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; is in Unknown state for 5 minutes.&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=Pod status Pending&lt;br /&gt;
|Severity=Warning&lt;br /&gt;
|AlertDescription=Pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; is in Pending state.&lt;br /&gt;
&lt;br /&gt;
Actions:&lt;br /&gt;
&lt;br /&gt;
*Restart the pod and check to see if there are any issues with the pod after restart.&lt;br /&gt;
|BasedOn=kube_pod_status_phase&lt;br /&gt;
|Threshold=Pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; is in Pending state for 5 minutes.&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=Pod status NotReady&lt;br /&gt;
|Severity=Critical&lt;br /&gt;
|AlertDescription=Pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; is in NotReady state.&lt;br /&gt;
&lt;br /&gt;
Actions:&lt;br /&gt;
&lt;br /&gt;
*Restart the pod and check to see if there are any issues with the pod after restart.&lt;br /&gt;
|BasedOn=kube_pod_status_ready&lt;br /&gt;
|Threshold=Pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; is in NotReady state for 5 minutes.&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=Container restarted repeatedly&lt;br /&gt;
|Severity=Critical&lt;br /&gt;
|AlertDescription=Container &amp;lt;nowiki&amp;gt;{{ $labels.container }}&amp;lt;/nowiki&amp;gt; was repeatedly restarted.&lt;br /&gt;
&lt;br /&gt;
Actions:&lt;br /&gt;
&lt;br /&gt;
*Check to see if a new version of the image was deployed. Also check for issues with the Kubernetes cluster.&lt;br /&gt;
|BasedOn=kube_pod_container_status_restarts_total&lt;br /&gt;
|Threshold=Container &amp;lt;nowiki&amp;gt;{{ $labels.container }}&amp;lt;/nowiki&amp;gt; was restarted 5 or more times within 15 minutes.&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=No sip-nodes available for 2 minutes&lt;br /&gt;
|Severity=Critical&lt;br /&gt;
|AlertDescription=No sip-nodes are available for the pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
Actions:&lt;br /&gt;
&lt;br /&gt;
*If the alarm is triggered for multiple services, make sure there are no issues with sip-nodes.&lt;br /&gt;
*If the alarm is triggered only for pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt;, check to see if there is any issues with the pod.&lt;br /&gt;
|BasedOn=sipproxy_active_sip_nodes_count&lt;br /&gt;
|Threshold=No sip-nodes are available for the pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; for 2 minutes.&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=sip-node capacity limit reached&lt;br /&gt;
|Severity=Warning&lt;br /&gt;
|AlertDescription=The sip-node &amp;lt;nowiki&amp;gt;{{ $labels.sip_node_id }}&amp;lt;/nowiki&amp;gt; hit capacity limit on &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
Actions:&lt;br /&gt;
&lt;br /&gt;
*If alarm is triggered for multiple services make sure there is no issues with sip-node &amp;lt;nowiki&amp;gt;{{ $labels.sip_node_id }}&amp;lt;/nowiki&amp;gt;.&lt;br /&gt;
*If alarm is triggered only for pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; check if there is any issue with the pod&lt;br /&gt;
|BasedOn=sipproxy_sip_node_is_capacity_available&lt;br /&gt;
|Threshold=The sip-node &amp;lt;nowiki&amp;gt;{{ $labels.sip_node_id }}&amp;lt;/nowiki&amp;gt; hit capacity limit on &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; for 3 consecutive minutes.&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=Pod CPU greater than 80%&lt;br /&gt;
|Severity=Critical&lt;br /&gt;
|AlertDescription=Critical CPU load for pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
Actions:&lt;br /&gt;
&lt;br /&gt;
*Check whether the horizontal pod autoscaler has triggered and the maximum number of pods has been reached.&lt;br /&gt;
*Check Grafana for abnormal load.&lt;br /&gt;
*Collect the service logs for pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; and raise an investigation ticket.&lt;br /&gt;
|BasedOn=container_cpu_usage_seconds_total, container_spec_cpu_period&lt;br /&gt;
|Threshold=Container &amp;lt;nowiki&amp;gt;{{ $labels.container }}&amp;lt;/nowiki&amp;gt; CPU usage exceeded 80% for 5 minutes.&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=Pod CPU greater than 65%&lt;br /&gt;
|Severity=Warning&lt;br /&gt;
|AlertDescription=High CPU load for pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
Actions:&lt;br /&gt;
&lt;br /&gt;
*Check whether the horizontal pod autoscaler has triggered and the maximum number of pods has been reached.&lt;br /&gt;
*Check Grafana for abnormal load.&lt;br /&gt;
*Collect the service logs for pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; and raise an investigation ticket.&lt;br /&gt;
|BasedOn=container_cpu_usage_seconds_total, container_spec_cpu_period&lt;br /&gt;
|Threshold=Container &amp;lt;nowiki&amp;gt;{{ $labels.container }}&amp;lt;/nowiki&amp;gt; CPU usage exceeded 65% for 5 minutes.&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=Pod memory greater than 80%&lt;br /&gt;
|Severity=Critical&lt;br /&gt;
|AlertDescription=Critical memory usage for pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
Actions:&lt;br /&gt;
&lt;br /&gt;
*Check whether the horizontal pod autoscaler has triggered and the maximum number of pods has been reached.&lt;br /&gt;
*Check Grafana for abnormal load.&lt;br /&gt;
*Restart the service for pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt;.&lt;br /&gt;
|BasedOn=container_memory_working_set_bytes, kube_pod_container_resource_requests_memory_bytes&lt;br /&gt;
|Threshold=Container &amp;lt;nowiki&amp;gt;{{ $labels.container }}&amp;lt;/nowiki&amp;gt; memory usage exceeded 80% for 5 minutes&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=Pod memory greater than 65%&lt;br /&gt;
|Severity=Warning&lt;br /&gt;
|AlertDescription=Pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; has high memory usage.&lt;br /&gt;
&lt;br /&gt;
Actions:&lt;br /&gt;
&lt;br /&gt;
*Check whether the horizontal pod autoscaler has triggered and the maximum number of pods has been reached.&lt;br /&gt;
*Check Grafana for abnormal load.&lt;br /&gt;
*Collect the service logs for pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; and raise an investigation ticket&lt;br /&gt;
|BasedOn=container_memory_working_set_bytes, kube_pod_container_resource_requests_memory_bytes&lt;br /&gt;
|Threshold=Container &amp;lt;nowiki&amp;gt;{{ $labels.container }}&amp;lt;/nowiki&amp;gt; memory usage exceeded 65% for 5 minutes.&lt;br /&gt;
}}{{PEAlert&lt;br /&gt;
|Alert=Config node fail&lt;br /&gt;
|Severity=Warning&lt;br /&gt;
|AlertDescription=The request to the config node failed.&lt;br /&gt;
&lt;br /&gt;
Action:&lt;br /&gt;
&lt;br /&gt;
*Check if there is any problem with pod &amp;lt;nowiki&amp;gt;{{ $labels.pod }}&amp;lt;/nowiki&amp;gt; and config node.&lt;br /&gt;
|BasedOn=http_client_response_count&lt;br /&gt;
|Threshold=Requests to the config node fail for 5 consecutive minutes.&lt;br /&gt;
}}&lt;br /&gt;
}}&lt;/div&gt;</summary>
		<author><name>Corinneh</name></author>
		
	</entry>
</feed>