Performing Hard Failover
- reference
Hard failover can be used to take a node out of a cluster, when the node has become unresponsive.
Description
When a node has become unresponsive, and active vBuckets have thereby become unavailable for the serving of data, hard failover should be used to take the unresponsive node out of the cluster.
During hard failover, the Cluster Manager promotes replica vBuckets to active status on the remaining cluster-nodes. A revised cluster-map is then communicated to all clients. When hard failover is complete, no further attempts are made to access vBuckets on the failed-over node, but the node has not yet been fully taken out of the cluster. The cluster is at this point considered to be in a degraded state, in terms of availability; as it now contains a reduced number of replica vBuckets, and may lack resources for handling a further node-outage. Rebalance should therefore be performed.
Hard failover should not be used as part of regular, planned maintenance activities that can be conducted with all cluster-nodes available. For information, see Failover.
Curl Syntax
curl -v -X POST -u <username>:<password> http://<ip-address-or-hostname>:<port>/controller/failOver -d [otpNode=node_1@<hostname>] -d allowUnsafe=< true | false >
The value of each specified otpNode
parameter must be the name of a node that will be hard-failed over.
The value of allowUnsafe
must be either true
or false
(which is the default); indicating whether or not hard failover should be attempted in unsafe mode.
For information, see Hard Failover in Default and Unsafe Modes.
Responses
Success returns 200 OK
.
Incorrectly specifying the target cluster returns Failed to connect
.
Incorrectly specifying the URI returns a 404 Object Not Found
error, and a Requested resource not found
notification.
Specifying an unknown server to be failed over returns a 400 Bad Request
notification, and an error message such as the following: Unknown server given: ["ns_1@10.143.201.106"]
.
Failure to authenticate returns a 401 Unauthorized
notification.
Examples
The following curl statement performs a hard failover of node 10.142.180.102
from a cluster identified as 10.142.180.1
.
bournecurl -v -X POST -u Administrator:password \ http://10.142.180.1:8091/controller/failOver \ -d 'otpNode=ns_1@10.142.180.102'
The following performs a hard failover of nodes 10.142.180.102
and 10.142.180.103
from a cluster identified as 10.142.180.1
.
Note that multi-node failover is always hard failover.
bournecurl -v -X POST -u Administrator:password \ http://10.142.180.1:8091/controller/failOver \ -d 'otpNode=ns_1@10.142.180.102' \ -d 'otpNode=ns_1@10.142.180.103'
The following performs an unsafe, hard failover of nodes 10.142.180.102
and 10.142.180.103
from a cluster identified as 10.142.180.1
.
bournecurl -v -X POST -u Administrator:password \ http://10.142.180.1:8091/controller/failOver \ -d otpNode=ns_1%4010.142.180.102\&otpNode=ns_1%4010.142.180.103 \ -d allowUnsafe=true
See Also
For a general introduction to failover, see Failover. For specifics related to hard failover, including safe and unsafe modes, see Hard Failover. For information on managing different kinds of failover with UI, CLI, and the REST API, see Fail a Node Over and Rebalance. For information on unsafe hard failover, see Hard Failover in Default and Unsafe Modes.