Component Types and Metrics for Alert Policies
You create an alert policy for a component type. The component type drives the list of metrics to select for creating a threshold.
The following table lists the component types and metrics for an alert policy:
| Component Type | Metric | Description |
|---|---|---|
| Topic | UNDER REPLICATED PARTITIONS COUNT | Total number of partitions that are under replicated for a topic. |
| BYTES IN PER SEC | Bytes per second coming in to a topic. | |
| BYTES OUT PER SEC | Bytes per second going out from a topic. It does not count the internal replication traffic. | |
| OUT OF SYNC REPLICA COUNT | Total number of replicas that are not in sync with the leader for a topic. | |
| Producer | IS PRODUCER ACTIVE | Checks whether a producer is active. |
| Cluster | ACTIVE CONTROLLER COUNT | Number of brokers in the cluster reporting as the active controller in the last interval. |
| ONLINE BROKER COUNT | Number of brokers that are currently online. | |
| UNCLEAN LEADER ELECTION COUNT | Number of unclean partition leader elections in the cluster reported in the last interval. | |
| UNDER REPLICATED PARTITIONS COUNT | Total number of topic partitions in the cluster that are under replicated. | |
| LEADER ELECTION PER SEC | Rate of partition leader elections. | |
| OFFLINE PARTITIONS COUNT | Total number of topic partitions, in the cluster, that are offline. | |
| Consumer | CONSUMER GROUP LAG | How far consumer groups are behind the producers. |
| IS CONSUMER ACTIVE | Checks whether a consumer is active. | |
| Broker | BYTES IN PER SEC | Number of bytes per second produced to a broker. |
| ZOOKEEPER SESSION EXPIRATION PER SEC | Rate at which brokers are experiencing Zookeeper session expirations per second. | |
| TOTAL PRODUCE REQUESTS PER SEC | Total number of produce requests to a broker per second. | |
| PARTITION IMBALANCE PERCENTAGE | The partition imbalance for a broker. It is calculated as:
(abs(average_no_of_partitions_per_broker - actual_no_of_partitions_per_broker) /
average_no_of_partitions_per_broker) * 100 |
|
| BYTES OUT PER SEC | Number of bytes per second fetched from a broker. It does not count the internal replication traffic. | |
| IS BROKER DOWN | Checks whether a broker is down. | |
| TOTAL PRODUCE REQUEST LATENCY | Latency of produce requests to this broker at the 99th percentile (in ms). | |
| ISR SHRINKS PER SEC | Rate at which brokers are experiencing InSync Replica Shrinks (number of shrinks per second). | |
| TOTAL FETCH CONSUMER REQUEST LATENCY | Latency of fetch consumer requests to this broker at 99th percentile (in ms). |
