Supported metrics for alerts

APM

Metric name
ErrorLoadResponse time
Instance response timeInstance errorInstance load
Endpoint response timeEndpoint errorEndpoint load
Error codeApdexInstance availability
Endpoint error code

.Net metrics

Metric name
Instance Clr CPUInstance Clr Gen0 Collect CountInstance Clr Gen1 Collect Count
Instance Clr Gen2 Collect CountInstance Clr Heap MemoryInstance Clr Total Committed Bytes
Instance Clr Total Reserved BytesInstance Clr Gen0 Heap SizeInstance Clr Gen1 Heap Size
Instance Clr Gen2 Heap SizeInstance Clr Bytes In All HeapsInstance Clr Available Completion Port Threads
Instance Clr Available Worker ThreadsInstance Clr Max Completion Port ThreadsInstance Clr Max Worker Threads
Instance Clr Current Logical ThreadInstance Clr Current Physical ThreadsInstance Clr Contention Rate
Instance Clr Current Queue LengthInstance Clr Total ContentionsInstance Clr Exception Thrown Per Sec
Instance Clr Exception ThrownInstance Clr Throw To Catch Depth Per SecInstance Clr Filters Per Sec
Instance Clr Finallys Per SecInstance Clr Percent Time LoadingInstance Clr Bytes In Loader Heap
Instance Clr Total Load FailuresInstance Clr Total App DomainsInstance Clr Total App Domains Unloaded
Instance Clr Total AssembliesInstance Clr Total Classes LoadedInstance Clr Link Time Checks
Instance Clr Time in RT ChecksInstance Clr Time Sig AuthenticatingInstance Clr Total Runtime Checks
Instance Clr CCWSInstance Clr MarshallingInstance Clr Stubs
Instance CLR TLB Exports Per SecInstance CLR TLB Imports Per SecInstance CLR CIL Bytes Jitted
Instance CLR Methods JittedInstance CLR Time In JITInstance CLR IL Bytes Jitted Per Sec
Instance CLR Bytes ReceivedInstance CLR Bytes SentInstance CLR Connections Established
Instance CLR Datagrams ReceivedInstance CLR Datagrams Sent

Browser Monitoring

Metric name
ErrorLoadRedirect time
DNS timeTTFB timeTCP time
Trans timeDOM analysis timeFPT time
DOM ready timePage load timeResponse time
SSL timeTTL timeFirst pack time
FMP timeApdex

Mobile Monitoring

Metric name
Total CrashesNumber of RequestsHTTP Response Time

Network Monitoring

Metric NameDescription
ICMP Device AvailabilityIndicates whether the device is reachable using ICMP (ping). A successful response means the device is up and accessible.
ICMP Latency (ms)Measures the round-trip time in milliseconds for an ICMP ping request to reach the device and return.
ICMP Packet LossRepresents the percentage of ICMP packets that were sent but did not receive a response from the device.
Active ConnectionsThe number of currently active network connections established on the device.
CPU UtilisationPercentage of CPU resources currently being used by the device.
Bandwidth In (bps)The rate at which data is received on the interface measured in bits per second.
Bandwidth Out (bps)The rate at which data is transmitted from the interface measured in bits per second.
Total Packets ReceivedTotal number of packets successfully received by the device or interface.
Total Packets SentTotal number of packets successfully transmitted by the device or interface.
Total PacketsCombined total of packets sent and received on the device or interface.
Incoming ErrorsNumber of packets received with errors such as CRC errors, frame errors, or alignment errors.
Outgoing ErrorsNumber of packets that failed to transmit due to errors.
Total ErrorsTotal count of both incoming and outgoing packet errors.
Bytes ReceivedTotal number of bytes successfully received by the interface.
Bytes SentTotal number of bytes successfully transmitted from the interface.
Unicast Packets IncomingNumber of incoming packets addressed to a single specific destination.
Unicast Packets OutgoingNumber of outgoing packets sent to a single specific destination.
Multicast Packets IncomingNumber of packets received that are addressed to a multicast group.
Multicast Packets OutgoingNumber of packets transmitted to multicast group addresses.
Broadcast Packets IncomingNumber of broadcast packets received by the device or interface.
Broadcast Packets OutgoingNumber of broadcast packets transmitted by the device or interface.
Traffic IncomingThe amount of incoming network traffic received by the interface over time.
Traffic OutgoingThe amount of outgoing network traffic transmitted from the interface over time.
Ocets ReceivedTotal number of octets (bytes) received by the interface as reported by SNMP counters.
Ocets SentTotal number of octets (bytes) transmitted by the interface as reported by SNMP counters.
Disk UsagePercentage or amount of disk storage currently used on the device.
Disk Usage AvailableAmount of disk storage space remaining and available for use.
Bandwidth Utilisation In (%)Percentage of incoming bandwidth currently being used relative to the interface's total capacity.
Bandwidth Utilisation Out (%)Percentage of outgoing bandwidth currently being used relative to the interface's total capacity.
Interface UtilisationOverall usage level of a network interface based on traffic relative to its maximum speed.
Interface SpeedMaximum data transfer capacity of the network interface, typically measured in bits per second (bps).
Memory UtilisationPercentage of system memory (RAM) currently being used by the device.

VM Monitoring

Metric name
CPU usage (%)Memory used (Bytes)Memory available (%)
Memory swap available (Bytes)Memory swap used (%)File system total available (%)
Disk reads (Bytes/sec)Disk writes (Bytes/sec)Network received (Bytes/sec)
Network transmitted (Bytes/sec)Network total (Bytes/sec)Instance availability
File system available (%) - Monitors the common File system available % on the instances selected

Solaris VM

Metric NameDescription
CPU total usage (%)Total CPU usage across all cores of the system.
CPU user usage (%)Percentage of CPU time spent running user-level processes.
CPU system usage (%)Percentage of CPU time spent running system/kernel processes.
CPU steal usage (%)Percentage of CPU time stolen from the VM by the hypervisor for other tasks.
CPU idle usage (%)Percentage of CPU time spent idle.
CPU load 1minAverage number of processes in the CPU run queue over the last 1 minute.
CPU load 5minAverage number of processes in the CPU run queue over the last 5 minutes.
CPU load 15minAverage number of processes in the CPU run queue over the last 15 minutes.
CPU context switches/secNumber of times per second the CPU switches between processes.
CPU interrupts/secNumber of hardware interrupts handled by the CPU per second.
Page in/out rateRate at which memory pages are swapped in and out per second, indicating memory pressure.
Memory totalTotal physical memory available on the host.
Memory availableAmount of physical memory currently free/available.
Memory usedAmount of physical memory currently in use.
Memory used (%)Percentage of physical memory currently used.
Swap usedAmount of swap space currently used.
Swap freeAmount of swap space currently free.
Swap totalTotal configured swap space on the system.
Disk totalTotal disk storage capacity across all volumes/disks.
Disk availableAmount of disk storage currently free/available.
Disk usedAmount of disk storage currently used.
Disk throughput totalCombined read and write throughput of the disk in bits/bytes per second.
Disk throughput readDisk read throughput in bits/bytes per second.
Disk throughput writeDisk write throughput in bits/bytes per second.
Disk operations read per secondNumber of read operations performed by the disk per second.
Disk operations write per secondNumber of write operations performed by the disk per second.
Disk operations total per secondTotal number of read and write operations per second.
Disk avg queue lengthAverage number of I/O requests waiting in the disk queue. High values indicate potential disk bottlenecks.
Disk avg wait timeAverage time an I/O request spends waiting in the disk queue before being serviced.
Disk io read time (ms)Average time taken to complete read operations from the disk.
Disk io write time (ms)Average time taken to complete write operations to the disk.
Disk io total time (ms)Average time taken to complete combined read and write operations on the disk.
Network packets sent/secNumber of network packets sent per second by the VM.
Network packets received/secNumber of network packets received per second by the VM.
Network packets total per secondTotal number of network packets sent and received per second.
Network collisions per secondNumber of network collisions per second. High values may indicate network congestion (mainly for legacy Ethernet).
Network errors sent per secondNumber of network errors encountered while sending data per second.
Network errors received per secondNumber of network errors encountered while receiving data per second.
Network errors totalTotal number of network errors (sent + received) per second.
Network throughput sentVolume of data sent over the network per second.
Network throughput receivedVolume of data received over the network per second.
Network throughput totalTotal network throughput (sent + received) per second.
Network nic countTotal number of network interfaces attached to the VM.
Network active nic countNumber of network interfaces currently active and transmitting data.
Network full duplex countNumber of network interfaces operating in full duplex mode.
Network half duplex countNumber of network interfaces operating in half duplex mode.
AvailabilityIndicates whether the VM/system or service is available and reachable.
Storage available (%)Percentage of total storage that is currently free/available.
Memory available (%)Percentage of total memory that is currently free/available.

Synthetic Monitoring

Group level

  • Response time
  • Availability (%)

Endpoint level

  • Endpoint response time
  • Endpoint availability
note

You can configure alerts at group level which will be evaluated aross all endpoints in the group and at the endpoint level for endpoint monitoring.

Application level

Metric name
ErrorLoadRedirect time
DNS timeTTFB timeTCP time
Trans timeDOM Analysis timeFTP time
DOM ready timePage load timeResponse time
SSL timeTTL timeFirst pack time
FMP time

Database Monitoring

Oracle

Metric name
CPU usage per secondAmount of file read time per secondAmount of file write time per second
Number of long running (>60s) queries per secondI/O megabytes per secondI/O requests per second
User transaction per secondNumber of redo log file switchNumber of redo log waits
Hit ratio for the SGASGA cacheable memory per statement in MBSGA cacheable memory per user in MB
Memory usage in MBActive SessionsAbandoned Sessions
Active ConnectionsInactive ConnectionsInstance Availability

MsSQL

Metric name
Buffer pool size per databaseIO stallTransaction log growth
Page file availablePage file totalActive connections
Memory availableMemory utilizationBackground processes count
Blocked processes countDisk sizeForce parameterizations per second
Runnable tasks countRunning processes countSleeping processes count
Suspended processes countTransactions per secondUser connections
Deadlocks per secondLock waits per secondSQL compilations per second
SQL recompilations per secondUser errors per secondBufferpool batch requests per second
Bufferpool page life expectancyBuffer pool sizeSystem buffer pool hit percent
System wait time in milliseconds per secondsBuffer checkpoint pages per secondsAccess page splits per seconds

MySQL

Metric name
Inno DB buffer pool pages dataInno DB buffer pool pages freeInno DB row lock current waits
Inno DB row lock time averageInno DB row lock waits per secondTotal open files
Total open tablesTable locks waited per secondMax connection errors per second
Max used connectionsInsert queries per secondSelect queries per second
Prepared statement count per secondQueries per secondSlow queries per second
Total threads connectedTotal threads running

Postgres

Metric name
Number of active connectionsNumber of disk blocks readsNumber of buffer hits
Cpu utilization (%)Memory usageNetwork bytes received
Number of inactive connectionsNumber of queries cancelled due to deadlocksNumber of queries cancelled due to dropped tablespaces
Number of queries cancelled due to lock timeoutsNumber of queries cancelled due to old snapshotsNumber of queries cancelled due to pinned buffers
Number of rows deletedNumber of rows fetchedNumber of rows inserted
Number of rows returnedNumber of rows updatedNumber of Sessions
Number of sessions abandonedNumber of transactions committedNumber of transactions rolled back
Instance availabilityNumber of live tuplesNumber of dead tuples

IBM DB2

Metric NameDescription
Disk usageDisk space utilization
CPU usageCPU utilization of the host
Disk requestDisk I/O request rate in DB2 Database
Memory usageMemory utilization
Total active connectionsNumber of currently active connections
Total active sessionsTotal number of active DB sessions
Sessions tableTable displaying details of DB sessions
Insert queries per secondNumber of insert queries executed per second
Slow queries per secondNumber of slow queries executed per second
Total queriesTotal number of queries executed
Locks waited per secondNumber of lock waits per second
Max connection per secondMaximum number of connections per second
Total buffer pool readsTotal number of reads from buffer pool
Buffer pool cache hit ratioPercentage of reads served from buffer pool (cache hits)
Transactions per secondNumber of transactions executed per second
Buffer pool size per DBBuffer pool size allocated per database
System wait time (ms)Average wait time of system resources
Deadlocks per secondNumber of deadlocks detected per second
Total open tablesTotal number of open tables in the database

Log monitoring

Metric name: Value

Kubernetes monitoring

Metric name
Total cluster nodes in Ready statusTotal cluster nodes in Not Ready statusTotal cluster deployments in Available status
Total cluster deployments in Not Available statusTotal cluster pods in Running statusTotal cluster pods in Not Running status
Total service pods in Not Running statusNode storage used in %Pod waiting reason
pod last terminated reason

AWS monitoring

RDS MySQL, RDS MsSQL, and Postgres

Metric name
CPU UtilizationRead IOPSWrite IOPS
I/O operations waiting for disk accessMilliseconds per read I/O operationMilliseconds per write
I/O operationFreeable MemorySwap Usage
Free Storage SpaceNetwork Receive ThroughputDatabase Connections
Replica LagRead ThroughputWrite Throughput

Lambda

Metric nameMetric nameMetric name
InvocationsErrorsDuration
Concurrent ExecutionsIterator AgeDead Letter Errors
ThrottlesEstimated CostMemory Utilization
Cold_StartsCPU Total Time

Ingress

Metric nameMetric nameMetric name
ALB ErrorsNLB Rejected Request CountNLB Errors
NLB Port Allocation ErrorNLB Client TLS Negotiation Error CountNLB Target TLS Negotiation Error Count
NLB Processed BytesNLB Active Flow CountCLB Total Request Count
CLB Estimated Processed BytesCLB LatencyCLB Errors
CLB Rejected Request CountCLB 2XX CountCLB 3XX Count
CLB 4XX ErrorsCLB 5XX Errors

MQ Monitoring

Redis MQ

Metric NameDescription
Total connectionsNumber of currently connected clients
Total commandsDelta of total commands processed
Connection usage (%)% of connections used (connected_clients / maxclients)
Last chances (bytes)Maximum of recent input/output buffers per client
Blocked clientsNumber of clients blocked on commands
Connected blocked clientsNumber of clients blocked
Expiring keyNumber of keys with TTL
Non-expiring keyNumber of keys without TTL
Total number of keysTotal keys across all Redis databases
Command per secondCommands processed per second
Evicted objectsDelta of keys evicted due to memory limits
Expired objectsDelta of keys expired
CPU Usage(%)Sum of system + user CPU usage %
Response timePing latency to Redis instance (ms)
Network in throughputDelta of network input bytes
Network out throughputDelta of network output bytes
LatencyEvent loop latency (ms)
Cache hit ratioCache hit ratio % (hits / (hits + misses))
EvictionsDelta of evicted keys
Fragmentation ratioMemory fragmentation ratio
Used Memory (%)Percentage of used memory vs total system memory
Active connectionsConnected clients minus blocked clients
Blocked connectionsNumber of clients blocked on commands
Rejected connectionsDelta of rejected connections
Connected clientsNumber of currently connected clients
Connected slavesNumber of connected Redis replicas
Client input bufferMaximum recent input buffer per client
Client output listMaximum recent output buffer per client
Keyspace missesDelta of keyspace misses
Master link down since secondsTime in seconds since master link went down
Connected replicasConnected Redis replicas count
Error countDelta of general Redis errors
Auth error countDelta of authentication errors