Create a Post
cancel
Showing results for 
Search instead for 
Did you mean: 
Matlu
Advisor

Failover root-cause detection in ClusterXL HA

Hello everyone.

Today our ClusterXL HA, had a FAILOVER from "one moment to another", without any intervention from the administrators.

We would like to detect if the root-cause of this FAILOVER is the responsibility of the GW (I would say no).

I have reviewed the following, and I would like to know the opinion of people more experienced with these diagnoses.

I get the impression that at the time of the FAILOVER, there was an error with the VLAN that was being monitored, but possibly the root-cause is in a different equipment than the GW Check Point.

FO1.png

FO2.png

Cheers . 🙂

0 Kudos
5 Replies
the_rock
Legend
Legend

Bro, as I stated in my email, best is to review routed.log and messages file, but based on what you show, it apears issue was with interface bond12.62

Is customer using ospf/bgp? MAKE SURE below value is set to 0 and not 1

fw ctl get int fwha_monitor_all_vlan

Andy

0 Kudos
Matlu
Advisor

Hey,

I'm not sure yet, if you use "dynamic routing".

The command you share with me, shows me the following.

[Expert@SG1:0]# fw ctl get int fwha_monitor_all_vlan
fwha_monitor_all_vlan = 0
[Expert@SG1:0]#

Cheers. 🙂

0 Kudos
the_rock
Legend
Legend

I do use it lol, but you should check if customer does hehe.

Anyway, you can verify via web UI, ospf and bgp tabs or from clish, just run show ospf and show bgp and when you hit tab, gives all the options.

Andy

0 Kudos
Matlu
Advisor

I checked, and no, it does not use either of the 2 protocols.

LOL 🤣

FO3.png

 

I suspect it may also be a problem with "other equipment" on the network, perhaps the switches.

What do you think?

0 Kudos
the_rock
Legend
Legend

Its possible, but you would need them to check the switches.

Andy

0 Kudos

Leaderboard

Epsum factorial non deposit quid pro quo hic escorol.

Upcoming Events

    CheckMates Events