Environment Status
Cluster Monitoring
Rules for the PMK SaaS Management Plane
The Prometheus agent is configured to report numerous cluster metrics. Below is the YAML-based rule set that Catapult uses, including alert names, expression, timeframe, labels with type and severity, and annotations which contain the summary and description of the notification.
x
bash-5.0# cat qbert.ymlgroupsnameqbert rulesalertClusterStatusNotOK exprqbert_cluster_status_okjob="qbert" == 0 for10m labels typepf9 severitywarning annotations summaryCluster status is not ok! descriptionCluster status of $labels.cluster_name of type $labels.cloud_provider_type is not okalertNodeNotReady exprqbert_kube_node_readynode_status!="Ready" cluster_name!="undefined" == 0 for10m labels typepf9 severitywarning annotations summaryNode $labels.node_name is not ready! descriptionNode $labels.node_name cluster $labels.cluster_name role $labels.node_role is in state $labels.node_status alertK8sApiNotResponding exprqbert_node_status_oknode_role="master" == 0 for10m labels typepf9 severitywarning annotations summaryK8s api of cluster $labels.cluster_name is not responding! descriptionK8s api is not responding on cluster $labels.cluster_name Node $labels.node_hostname role $labels.node_role alertWorkerNodeNotResponding exprqbert_node_status_oknode_role="worker" == 0 for15m labels typepf9 severitywarning annotations summaryWorker node not responding! descriptionWorker node not responding on cluster $labels.cluster_name Node $labels.node_hostname role $labels.node_role bash-5.0#Was this page helpful?