Create a Post
cancel
Showing results for 
Search instead for 
Did you mean: 
xiro
Contributor

Issue with DNS lookup from HA standby member

Hi,

we have an issue on our FW-cluster standby member, causing AV/TE-Updates to fail.

 

Internet connection is OK, but I assume that there is an issue with DNS. If I try to resolve e.g. checkpoint.com from our standby member, it times out and is not resolvable. The error messages vary: 

 

[Expert@chp-2:0]# nslookup checkpoint.com
;; connection timed out; trying next origin
;; connection timed out; no servers could be reached

 

[Expert@chp-2:0]# nslookup facebook.com
;; connection timed out; trying next origin
Server: 10.1.1.14
Address: 10.1.1.14#53

** server can't find facebook.com: NXDOMAIN

 

 

If I do the same with the active member, it works:

[Expert@chp-1:0]# nslookup checkpoint.com
Server: 10.1.1.14
Address: 10.1.1.14#53

Non-authoritative answer:
Name: checkpoint.com
Address: 209.87.209.100

 

Afterwards, in most cases (80%), the resolution works then from the standby member too for the same host.

But not always - it's a very strange behavior.

 

If I do a failover, the standby (now main) FW works properly. 

The DNS server are up and running, no issues at all (except this one from the HA standby member).

 

As far as I can see on another FW (ASA), located between CP and DNS servers, all requests are coming with the cluster IP.

I did a fw monitor and at the same time a capture on the ASA. All packets have corresponding packet captures on ASA, and there I can also see, that the servers answer to every request. But in the FW monitor, I don't see most of the answer-packets. Here's an example:

Capture.PNG

On the left side you can see the FW monitor output. Packets leave with the cluster VIP 10.2.1.1 on different ports.

On the right side you can see the ASA-capture where all these packets appear, and where you can also see the answers from the DNS server 10.1.1.14/15. (marked blue/azure)

But on the left side, these answers dont appear for the first 6 requests. Only the last, yellow one has a properly appearing answer. 

The only strange thing that bothers me, is that the packet length of the failing answers seems to be much lower than the last one (26 compared to 96).

 

Any ideas, what could be causing this problem?

 

 

 

0 Kudos
2 Replies
Tommy_Forrest
Advisor

Have a look at this SK:

https://supportcenter.checkpoint.com/supportcenter/portal?eventSubmit_doGoviewsolutiondetails=&solut...

 

We had a similar issue.  Though it wasn't with DNS.  It was with HTTP.  The inactive cluster member was sending traffic out via the cluster IP.  but when it came back, the cluster incorrectly sent that return packet to the active host member.

0 Kudos
JanVandenberghe
Explorer

Was this problem solved ?

We are facing the same problems after our VSX upgrade to R80.20 (Take 91)

Even a ping from the standby node doesn't work to non SmartCenter/Logserver IPs

setting the fwha_forw_packet_to_not_active parameter doesn't seems to solving the problem

 

 

 

 

0 Kudos

Leaderboard

Epsum factorial non deposit quid pro quo hic escorol.

Upcoming Events

    Tue 23 Apr 2024 @ 11:00 AM (EDT)

    East US: What's New in R82

    Thu 25 Apr 2024 @ 11:00 AM (SGT)

    APAC: CPX 2024 Recap

    Tue 30 Apr 2024 @ 03:00 PM (CDT)

    EMEA: CPX 2024 Recap

    Thu 02 May 2024 @ 11:00 AM (SGT)

    APAC: What's new in R82

    Tue 23 Apr 2024 @ 11:00 AM (EDT)

    East US: What's New in R82

    Thu 25 Apr 2024 @ 11:00 AM (SGT)

    APAC: CPX 2024 Recap

    Tue 30 Apr 2024 @ 03:00 PM (CDT)

    EMEA: CPX 2024 Recap

    Thu 02 May 2024 @ 11:00 AM (SGT)

    APAC: What's new in R82
    CheckMates Events