Summary
This Knowledgebase article provides information about the symptoms, causes, and resolutions of two active servers.
The occurrence of two active servers is not by design and when detected, should be resolved immediately.
When there are two active servers live on the same network, Neverfail refers to the condition as Split-Brain syndrome.
Symptoms
Split-Brain syndrome can be identified by the following symptoms:
-
Two servers in a cluster are running and in an active state. This should be visible on the Taskbar icon as P / A (Primary and active), S / A (Secondary and active) or T / A (Tertiary
and active).
-
An IP address conflict may be detected in a cluster running Neverfail Engine on the Principal (Public) IP address.
-
A name conflict may be detected in a cluster running Neverfail Engine. In a typical WAN environment, the Primary and Secondary (in a Pair) or Primary, Secondary, and Tertiary servers (in a Trio) connect to the network using different IP addresses and no IP conflict occurs. However, if the servers are running with the same name then a name conflict may result. This will only happen if the servers are visible to each other across the WAN.
-
Clients (for example, Outlook) cannot connect to the server running Neverfail Engine.
Cause
Two active servers (split-brain syndrome) can be caused by a number of issues. The most common causes of two active servers are:
-
Loss of the Neverfail Channel connection (most common in a WAN environment).
-
The active server being too busy to respond to Heartbeats.
-
Misconfiguration of the Neverfail Engine software.
It is important to determine the cause of the split-brain syndrome and resolve the issue to prevent reoccurrences of the issue.
Resolution
Important Note: Once split-brain syndrome has occurred, the server with the most up-to-date data must be identified and then made Active. If the wrong server is made Active after this point, it can result in data loss. Care should be taken to reinstate the correct Active server.
The following can help identify the server with the most up-to-date data:
-
Check the date and time of files on both servers. The most up-to-date server should be made the active server.
-
From a client PC on a LAN, run
nbtstat -A <Public_IP> where the Public_IP should be the Principal (Public) IP address of your server. This can help identify the MAC address of the server currently visible to client machines.
Note:
If the two active servers have both been servicing clients, perhaps at different WAN locations, one and only one server can be made active. Both servers will contain recent data, which cannot be merged using Neverfail Engine. One server must be made active and one server made passive in order to restart replication. Once replication is restarted, ALL data on the passive server will be overwritten by the data on the active server. It may be possible to extract the up-to-date data manually from the passive server prior to restarting replication. Please consult your Protected Application vendor regarding tools that may be used for this purpose. For further information, please contact Neverfail Support.
How to resolve two active servers (split-brain syndrome):
-
Identify the server with the most up-to-date data.
-
Shutdown Neverfail Engine on all servers (if it is running).
-
On the server(s) you would like to make passive, right-
click the Taskbar icon, and select
Configure Server
wizard.
-
Select the
Machine
tab.
Note:
Do not change the
Identity
of the server for example, Primary/Secondary/Tertiary.
-
Click
Finish
to accept the changes. Exit the wizard and reboot this server.
-
Start Neverfail Engine and check that the Taskbar icon now reflects the changes by showing P / - (Primary and Passive) or S / - (Secondary and Passive) or T / -
(Tertiary and Passive)
-
On the active server,
right-click the taskbar icon and select
Configure Server
wizard.
-
Select the
Machine
tab.
Note:
Do not change the
Identity
of the server for example, Primary/Secondary/Tertiary.
-
Click
Finish
to accept the changes. Exit the wizard and reboot this server.
Note:
As the Active server restarts, it will connect to the passive server and start replication. Once this happens, data on the passive server will be overwritten by the data on the active server. Please see above for further information on how to check which server contains the most up-to-date data.
-
Start Neverfail Engine (if required) and check that the Taskbar icon now reflects the changes by showing P / A (Primary and active) or S / A (Secondary and active) or T / A (Tertiary and Active).
-
Log into the Neverfail Advanced Management Client.
-
Check that the servers have connected and replication has started.
Applies To
All Versions
Knowledgebase article #984
: Resolve Two Passive Servers (Pair) or Three Passive Servers (Trio)
KBID-516