SolarWinds - How to Configure Failover Functionality

Follow


Summary

This Knowledge Base article provides information about how to configure failover functionality by modifying heartbeat settings.



More Information

The primary and secondary servers in a Orion Failover Engine pair are continuously connected via the SolarWinds Channel. In absence of replication, the passive server issues heartbeats at regular intervals of time to ensure the active server is online and available. The active server can become unresponsive due to a hardware crash, lost network connection, or a power outage. When the passive server finds that the heartbeat process has failed, it will initiate a failover.

The time interval between two heartbeats, can be changed using the SolarWinds Orion Failover Manager. The default value is configured for a 60 second interval. This value should be carefully changed as it can affect the server pair’s functionality.

Example:

  • Configuring the Failover timeout to a high value will increase the time interval before a failover.  As a result, the active server might stay unavailable too long before the passive server initiates the failover process.
  • Configuring the Failover timeout to a low value for a server pair that has low channel bandwidth, may cause a Split-brain syndrome. For more information about split-brain syndrome, see SWREFID - 1935 Resolve Two Active Servers .

Procedure

To configure the Failover timeout period:

  1. From the SolarWinds Orion Failover Manager, select the Server: Monitoring tab.
  2. Click Configure Failover .
  3. Adjust the Failover timeout (seconds).
  4. Click OK .

To disable automatic failover, perform the following steps.

  1. From the SolarWinds Orion Failover Manager, select the Server: Monitoring tab.
  2. Click Configure Failover .
  3. Clear the check box Failover from Primary server to Secondary server if channel heartbeat is lost for failover timeout .
  4. Clear the check box Failover from Secondary server to Primary server if channel heartbeat is lost for failover timeout .
  5. Click OK .


Applies To

All Versions



Related Information

None

KBID-1968

0 out of 0 found this helpful

Comments

0 comments

Please sign in to leave a comment.