Interaction Server (IXN) metrics and alerts
Find the metrics Interaction Server (IXN) exposes and the alerts defined for Interaction Server (IXN).
Service | CRD or annotations? | Port | Endpoint/Selector | Metrics update interval |
---|---|---|---|---|
Interaction Server (IXN) | PodMonitor | 13131,
13133, 13139 |
option ixnServer.ports.health - default port 13131 - Endpoint: “/health/prometheus/all”
option ixnNode.ports.default - default port 13133 - Endpoint: “/metrics” option ixnVQNode.ports.health - default port 13139 - Endpoint: “/metrics” Note: The above options are references to ports that match endpoints. Use these options to perform the associated query. |
Default |
Metrics[edit source]
This table includes IXN Server metrics and IXN Node metrics. IXN Node metrics begin with the prefix ixnnode.
Note: There are more metrics than the ones listed in the table. However, only the metrics listed in the table are supported.
Metric and description | Metric details | Indicator of |
---|---|---|
ixn_ Indicates the number of clients that are connected to IXN at the moment. |
Unit: Amount Type: Gauge |
Workload |
ixn_ Indicates the number of 'connected to' IXN routers. |
Unit: Amount Type: Gauge |
Workload, Operability |
ixn_ Indicates the number of clients with specified type, connected to IXN. Label descriptions:
|
Unit: Amount Type: Gauge |
Workload |
ixn_ Indicates the total number of interactions that have been submitted to the router. Label descriptions: router_name - the name of the router into which the interactions have been submitted. |
Unit: Amount Type: Counter |
Workload, Operability |
ixn_ Indicates the number of strategies with specified name loaded into a specified router. Label descriptions:
|
Unit: Amount Type: Gauge |
Workload |
ixn_ Indicates the current capacity of a specified router - the number of interactions, not including those already submitted, that can be submitted into the router. Label descriptions: router_name - name of router. |
Unit: Amount Type: Gauge |
Workload, Operability |
ixn_ Indicates the number of interactions that are in a specified router. Label descriptions: router_name - name of router. |
Unit: Amount Type: Gauge |
Workload, Operability |
ixn_ Indicates number of strategies which are associated with active submitters. |
Unit: Amount Type: Gauge |
Workload |
ixn_ Indicates the maximum capacity of specified router - the number of interactions, that can be submitted into the router. Label descriptions:
ixn_health_info_router_max_submitted = ixn_health_info_router_currently_submitted + ixn_health_info_router_current_capacity
|
Unit: Amount Type: Gauge |
Workload, Operability |
ixn_ Indicates the current database requests queue length. |
Unit: Amount Type: Gauge |
Workload, Operability |
ixn_ Indicates the number of processed database requests from IXN application start till current moment. |
Unit: Amount Type: Counter |
Workload, Operability |
ixn_ Indicates the current number of DB connections. |
Unit: Amount Type: Gauge |
Workload, Operability |
ixn_ Indicates the total number of database queries that end up with a deadlock for all the time since IXN started. |
Unit: Amount Type: Counter |
Workload, Operability |
ixn_ Indicates the Unix timestamp when last interaction has been submitted to router for specified strategy. Label descriptions:
|
Unit: Unix timestamp Type: Gauge |
Workload, Operability |
ixn_ Indicates the current number of interactions with specified media type that are waiting for processing in a specified queue. Label descriptions:
Note: This value is provided in Pulse as well. |
Unit: Amount Type: Gauge |
Workload |
ixn_ Indicates the current number of logged in agents. Label descriptions: agent_tenant - tenant number. |
Unit: Amount Type: Gauge |
Workload |
ixn_ Indicates the number of the interactions with specified media type from a specified queue being routed. Label descriptions:
|
Unit: Amount Type: Gauge |
Workload, Operability |
ixn_ Indicates the number of the interactions with specified media type from specified queue being handled by agents. Label descriptions:
|
Unit: Amount Type: Gauge |
Workload, Operability |
ixn_ Indicates the number of interactions with specified media type that are waiting processing in specified queue and were never delivered to agent. Label descriptions:
Note: This value is provided in Pulse as well. |
Unit: Amount Type: Gauge |
Workload, Operability |
ixn_ Indicates the sum of the interactions with specified media type from specified queue being routed by routers and being handled by agents. Label descriptions:
ixn_health_info_queue_media_in_processing = ixn_health_info_queue_media_in_router + ixn_health_info_queue_media_on_agent Note: This value is provided in Pulse as well. |
Unit: Amount Type: Gauge |
Workload, Operability |
ixn_ Indicates the number of interactions which are submitted to specified router by specified strategy at the moment. Label descriptions:
|
Unit: Amount Type: Gauge |
Workload, Operability |
ixn_ Indicates the number of interactions that can be submitted more to specified router by specified strategy. Label descriptions:
|
Unit: Amount Type: Gauge |
Workload, Operability |
ixn_ Indicates the number of interactions that were submitted to specified router by specified strategy since IXN app start till now. Label descriptions:
|
Unit: Amount Type: Counter |
Workload, Operability |
ixnnode_ Indicates the total number of the interactions pulled for the specific strategy. Label descriptions: strategy - The name of the strategy for which interactions are pulled. |
Unit: Amount Type: Counter |
Workload, Operability |
ixnnode_ Indicates the current number of the routing sessions in routing. |
Unit: Amount Type: Gauge |
Workload |
ixnnode_ Indicates the total number of instructions (of any type) received from ORS service. |
Unit: Amount Type: Counter |
Workload, Operability |
ixnnode_ Indicates the total number of received routing instructions. Label descriptions:
|
Unit: Amount Type: Counter |
Workload, Operability |
ixnnode_ Indicates the status of Redis client. Label descriptions: redis_client - The Redis client instance for which the metric is present. It takes values "reader" and "writer". Value: 0 - Not Ready 1 - Ready |
Unit: Status Type: Gauge |
Operability |
ixnnode_ Indicates the total number of errors occurred on Redis client. Label descriptions: redis_client - The Redis client instance for which the metric is present. It takes values "reader" and "writer". |
Unit: Amount Type: Counter |
Error |
ixnnode_ Indicates the status of connection to individual nodes of Redis server (in singleton mode matches to ixnnode_redis_client_status). Label descriptions:
Value: 0 - Ready 1 - Not Ready 2 - Wait (so far there have been no connection attempts) |
Unit: Status Type: Gauge |
Operability |
ixnnode_ Indicates the total number of errors occurred on individual nodes of Redis client (in singleton mode matches to ixnnode_redis_client_errors_total). Label descriptions:
|
Unit: Amount Type: Counter |
Error |
ixnnode_ Indicates the total number of successfully completed redis commands. Label descriptions:
|
Unit: Amount Type: Counter |
Workload, Operability |
ixnnode_ Indicates the total number of failed redis commands. Label desriptions:
|
Unit: Amount Type: Counter |
Error |
ixnnode_ Indicates the status of connection to RQ Service nodes. Label descriptions: rq_node - RQ Service node for which the metric is present. |
Unit: Status Type: Gauge |
Operability |
ixnnode_ Indicates the total number of failed requests to RQ Service. Label descriptions: type - The type of the failed requests. It takes values "isp_event" - interaction protocol evnts and "ixn_ping" - health check messages. |
Unit: Amount Type: Counter |
Error |
ixnnode_ Indicates the maximum number of routing instructions that can be processed in parallel. |
Unit: Amount Type: Gauge |
n/a |
ixnnode_ Indicates the number of instructions received from ORS currently being processed. Label descriptions: type - The type of the instruction. It takes values "isp_request" - routing instruction and "ixn_ping" - reply to health check message. |
Unit: Amount Type: Gauge |
Workload, Operability |
ixnnode_ Indicates the total number of RequestPull requests successfully completed by InteractionServer. Label descriptions: strategy - The strategy for which interactions are pulled. |
Unit: Amount Type: Counter |
Workload, Operability |
ixnnode_ Indicates the total number of route requests successfully sent to ORS. Label descriptions: strategy - The strategy to which requests are sent. |
Unit: Amount Type: Counter |
Workload, Operability |
ixnnode_ Indicates the total number of route requests failed to send to ORS. Label descriptions: strategy - The strategy to which requests are sent. |
Unit: Amount Type: Counter |
Error |
ixnnode_ Indicates the number of routing instructions currently being processed by IXN Server. |
Unit: Amount Type: Gauge |
Workload |
ixnnode_ Indicates the total number of times an interaction was placed back in queue. Label descriptions:
|
Unit: Amount Type: Counter |
Workload, Error, ORS Error |
ixnnode_ Indicates the number of the strategies for which interactions currently are being pulled. |
Unit: Amount Type: Gauge |
Operability |
ixnnode_ Indicates the number of the strategies read from configuration for which interactions should be pulled. |
Unit: Amount Type: Gauge |
Operability |
ixnnode_ Indicates the total number of error occurred while fetching configuration from Configuration Service. |
Unit: Amount Type: Counter |
Error |
ixnnode_ Indicates the last time the configuration was successfully fetched from Configuration Service as the number of seconds since January 1 1970 UTC. |
Unit: Timestamp Type: Gauge |
Operability |
Alerts[edit source]
No alerts are defined for Interaction Server.