Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

Monitoring

Node Readiness Controller exposes Prometheus-compatible metrics. This page describes the Prometheus metrics exposed by Node Readiness Controller for monitoring rule evaluation, taint operations, failures, and bootstrap progress.

Metrics Endpoint

The controller serves metrics on /metrics only when metrics are explicitly enabled. Depending on the installation, the endpoint is served either over HTTP or over HTTPS. See Installation for deployment details.

Supported Metrics

node_readiness_rules_total

Number of NodeReadinessRule objects tracked by the controller.

PropertyValue
Typegauge
Labelsnone
Recorded whenThe controller refreshes or removes a tracked rule

node_readiness_taint_operations_total

Total number of taint operations performed by the controller.

PropertyValue
Typecounter
Labelsrule, operation
Recorded whenThe controller successfully adds or removes a taint

Labels

LabelDescriptionValues
ruleNodeReadinessRule nameAny rule name
operationTaint operation performed by the controlleradd, remove

node_readiness_evaluation_duration_seconds

Duration of rule evaluations.

PropertyValue
Typehistogram
Labelsnone
BucketsPrometheus default histogram buckets
Recorded whenThe controller evaluates a rule against a node

node_readiness_failures_total

Total number of failure events recorded by the controller.

PropertyValue
Typecounter
Labelsrule, reason
Recorded whenThe controller records an evaluation failure or taint add/remove failure

Labels

LabelDescriptionValues
ruleNodeReadinessRule nameAny rule name
reasonFailure label recorded by the controllerEvaluationError, AddTaintError, RemoveTaintError

node_readiness_bootstrap_completed_total

Total number of nodes that have completed bootstrap.

PropertyValue
Typecounter
Labelsrule
Recorded whenThe controller marks bootstrap as completed for a node under a bootstrap-only rule

Labels

LabelDescriptionValues
ruleNodeReadinessRule nameAny rule name