Create a Post
cancel
Showing results for 
Search instead for 
Did you mean: 
Matlu
Advisor
Jump to solution

ClusterXL Failover.

ClusterXL Failover.

Hello,

Is there any "practical" way to validate the reason "why" our ClusterXL did a failover?

We want to rule out that it was a CPU problem and / or equipment configuration.

The cphaprob state, shows the following.

 

CPHA.png

Is there any other documentation to check in these "situations"?

Greetings.

0 Kudos
1 Solution

Accepted Solutions
Matlu
Advisor

Andy,

I found this, with the command you recommended.

CPHA1.png

I have the impression, and I am almost sure, that it was a manual "shutdown" of the equipment, either it was disconnected from the mains, or something similar.

By the way, in your command grep -i DOWN /var/log/messages* ... how can you filter the date only for today, 06September?

Cheers, 🙂

View solution in original post

0 Kudos
7 Replies
the_rock
Legend
Legend

Yes, there are ways to tell...

Btw, I would NOT be using routable IPs for sync, as 3.3.3.x is Amazon range...I always tell customers to use 169.254.x.x range

Anyway, having said that, I would run below and look for messages in proper date/time span

from expert -> grep -i DOWN /var/log/messages*

Andy

0 Kudos
Matlu
Advisor

Andy,

I found this, with the command you recommended.

CPHA1.png

I have the impression, and I am almost sure, that it was a manual "shutdown" of the equipment, either it was disconnected from the mains, or something similar.

By the way, in your command grep -i DOWN /var/log/messages* ... how can you filter the date only for today, 06September?

Cheers, 🙂

0 Kudos
the_rock
Legend
Legend

There ya go bro, thats your answer : - )

Not sure if you can filter for specific date, will check.

Andy

0 Kudos
Bob_Zimmerman
Authority
Authority

cphaprob show_failover

This shows a log of up to 20 failovers going back to the last time services were restarted. It doesn't always give a lot of detail. It also doesn't contain state changes which did not result in a failover (like standby to down).

the_rock
Legend
Legend

Thanks for that, totally forgot about the command.

Andy

0 Kudos
Matlu
Advisor

Hello,

This is a bit of a silly question, but when you apply the command, like:

cphaprob show_failover, the result says something like.

CPHA2.png

"member2 -> member1"

It's silly the doubt, but by "member2" do you mean, the same computer, where at that moment, I'm running the command?

Greetings.

0 Kudos
the_rock
Legend
Legend

It would show you exact same output on both, but no matter the context, it would look at member 2 as current standby and member 1 as active.

Andy

0 Kudos

Leaderboard

Epsum factorial non deposit quid pro quo hic escorol.

Upcoming Events

    CheckMates Events