Create a Post
cancel
Showing results for 
Search instead for 
Did you mean: 
KostasGR
Advisor

VS on a VSX cluster (VSLS mode) can be on active active mode in normal or split brain scenario

Hello

Does anyone knows if a VS on a VSX cluster (VSLS mode) can be on active/active mode in normal or in a split brain scenario?

In case of split brain scenario is a way to force a VS that is on down state on that VSX( VSLS  mode) become active?

BR,

Kostas

0 Kudos
8 Replies
Bob_Zimmerman
Authority
Authority

It's theoretically possible, but only if neither one can see cluster heartbeats from the other, and both can ping something on every interface. I could contrive such an environmental disruption, but it would be extraordinarily unlikely for random failures to result in both members claiming to be active.

I don't know of a way to force a down member to become active.

0 Kudos
genisis__
Leader Leader
Leader

vsenv <id>

clusterXL_admin down

 

0 Kudos
Bob_Zimmerman
Authority
Authority

That forces an active context in VSLS to become down. 'clusterXL_admin up' clears the artificially-down status, but does not force the member to become active. If the member thinks it is unhealthy for another reason and the active contention mechanism has failed, it will refuse to take over, and I don't know of a way to force it to.

0 Kudos
genisis__
Leader Leader
Leader

so the scenario is DC to DC link gone down, both members now active and you need to force the traffic down one of the member only, so we are attempting to treat the symptom here and not address the root cause.

Possible things to try:

- cpstop on the member you want down?

- Shutdown all interfaces except Management on the member?

- cphastop?

- shutdown the node via LOM.

- Clearly - resolve the DC to DC issue. 


Above is all I think of unless TAC have another suggestion.

0 Kudos
Bob_Zimmerman
Authority
Authority

Again, that works to force a member down. That wasn't the question asked, though. The original post asks if there is a way to force a VS which is currently down to become active.

0 Kudos
Chris_Atkinson
Employee Employee
Employee

What is the hardware platform & version  involved, also is this for L2 or L3 virtual systems?

CCSM R77/R80/ELITE
0 Kudos
KostasGR
Advisor

Hello Chris

The version is r80.30 and they are L3 VS and not bridge VS L2.

Consider a scenario you have two data centers with Layer 2 Data Center Interconnection and one site is totally destroyed by a meteor or a flood for example or the lines that interconnect them are out of service (most probable scenario).

BR,

Kostas

 

0 Kudos
Bob_Zimmerman
Authority
Authority

One site being destroyed isn't really split-brain. If one whole datacenter is gone, then the other member should take over because it's still there.

If the interconnect between the datacenters fails, that would be split-brain, sure. Depending on the rest of the environment (in particular, whether there are things in both datacenters which respond to pings from the firewalls), that could result in both members becoming active, though they wouldn't conflict with each other, as the link between the datacenters is down.

This has complicated implications if the two cluster members have totally independent connectivity to the same things outside the datacenters. For example, if they each have their own Internet connection, and they talk BGP with the telco, that could result in both of them trying to claim to be the right path for the AS, which could result in anycast-style traffic flow.

If the DC-to-DC link goes down, and one member decides it isn't healthy enough to take over, I don't know of a way to force it to take over anyway.

0 Kudos

Leaderboard

Epsum factorial non deposit quid pro quo hic escorol.

Upcoming Events

    CheckMates Events