Prometheus Metrics Reference
This page captures the metrics supplied to Prometheus by the Couchbase Autonomous Operator and links reference pages of a number of additional metrics that are exported by third party libraries.
Operator Metrics
Metric |
Type |
Unit |
Labels |
Optional Labels |
Stability |
Added |
Total number of backup jobs that have been created by the operator |
counter |
namespace,backup_type |
cluster_uuid,cluster_name |
committed |
2.8.0 |
|
Total cpu requests for operator managed pods in k8s cpu units |
gauge |
namespace,name |
cluster_uuid,cluster_name |
committed |
2.8.0 |
|
The number of times in place upgrades have failed |
counter |
name |
cluster_uuid,cluster_name |
committed |
2.7.0 |
|
Total number of in place upgrades performed by operator |
counter |
name |
cluster_uuid,cluster_name |
committed |
2.7.0 |
|
Total failed requests to the Kubernetes API by the operator |
counter |
method,host,path |
committed |
2.8.0 |
||
Length of time per request to the Kubernetes API |
histogram |
milliseconds |
method,host,path |
committed |
2.8.0 |
|
Total requests made to the Kubernetes API by the operator |
counter |
method,host,path |
committed |
2.8.0 |
||
Total memory requests for operator managed pods in bytes |
gauge |
bytes |
namespace,name |
cluster_uuid,cluster_name |
committed |
2.8.0 |
The time it takes for a pod to enter a ready state |
gauge |
milliseconds |
name,serverClass |
cluster_uuid,cluster_name |
committed |
2.7.0 |
Total number of times operator has recovered a pod when the pod has been down |
counter |
name,podName |
cluster_uuid,cluster_name |
committed |
2.7.0 |
|
Total number of times operator has failed to recover a pod |
counter |
name,podName |
cluster_uuid,cluster_name |
committed |
2.7.0 |
|
Total number of times pods have failed to be recovered by the operator |
counter |
name |
cluster_uuid,cluster_name |
committed |
2.7.0 |
|
The amount of times operator has replaced a couchbase server pod due to a change in a couchbase cluster resources |
counter |
name |
cluster_uuid,cluster_name |
committed |
2.7.0 |
|
Total failed reconcile operations performed on a specific cluster |
counter |
namespace,name |
cluster_uuid,cluster_name |
committed |
2.3.0 |
|
Length of time per reconcile for a specific cluster |
histogram |
seconds |
namespace,name |
cluster_uuid,cluster_name |
committed |
2.3.0 |
Total reconcile operations performed on a specific cluster |
counter |
namespace,name,result |
cluster_uuid,cluster_name |
committed |
2.3.0 |
|
Total HTTP requests to Couchbase Server for a specific cluster, method and status code returned |
counter |
name,method,code,service,host |
name,namespace |
committed |
2.3.0 |
|
Total failed HTTP requests to Couchbase Server for a specific cluster |
counter |
name,method,service,host |
name,namespace |
committed |
2.3.0 |
|
Length of time per request for a specific cluster |
histogram |
milliseconds |
name,method,service,host |
name,namespace |
committed |
2.3.0 |
Total HTTP requests to Couchbase Server for a specific cluster |
counter |
name,method,service,host |
name,namespace |
committed |
2.3.0 |
|
Total number of times swap rebalances have failed |
counter |
name |
cluster_uuid,cluster_name |
committed |
2.7.0 |
|
Total number of swap rebalances performed by the operator |
counter |
name |
cluster_uuid,cluster_name |
committed |
2.7.0 |
|
The time taken to perform an upgrade |
milliseconds |
name |
cluster_uuid,cluster_name |
committed |
2.7.0 |
|
Total number of times the size of volumes have been increased under management |
counter |
name,volumeName |
cluster_uuid,cluster_name |
committed |
2.7.0 |
|
Total memory claimed by volumes under management by the operator in bytes |
gauge |
bytes |
namespace,name |
cluster_uuid,cluster_name |
committed |
2.8.0 |