<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: ClusterXL HA issue in Firewall and Security Management</title>
    <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265104#M52184</link>
    <description>&lt;P&gt;If I were you, would install recommended one, take 119, but either way, what&amp;nbsp;&lt;a href="https://community.checkpoint.com/t5/user/viewprofilepage/user-id/16383"&gt;@Vincent_Bacher&lt;/a&gt;&amp;nbsp;said, makes total sense, at least to me.&lt;/P&gt;</description>
    <pubDate>Thu, 11 Dec 2025 12:32:06 GMT</pubDate>
    <dc:creator>the_rock</dc:creator>
    <dc:date>2025-12-11T12:32:06Z</dc:date>
    <item>
      <title>ClusterXL HA issue</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265084#M52168</link>
      <description>&lt;P&gt;Hi all,&lt;BR /&gt;&lt;BR /&gt;This has now happened a few times in last 6 months. The Standby firewall doesnot receive the CCP packets and marks Sync interface as down. The cluster goes into a split brain scenario.&lt;BR /&gt;It resolves itself in less than a min. All BGP peers are re-established. Any idea why is this happening ?&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Note:&lt;/STRONG&gt; Sys_admin installed Threat Prevention policy right after this. There were spike detective alerts for temain right before this happened ( can be totally unrealted ).&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;STRONG&gt;Active firewall&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Dec 11 01:53:31 2025 F1-2 spike_detective: spike info: type: cpu, cpu core: 42, top consumer: fwk0_dev_57, start time: 11/12/25 01:53:18, spike duration (sec): 12, initial cpu usage: 91, average cpu usage: 74, perf taken: 0&lt;/P&gt;&lt;P&gt;Dec 11 01:54:37 2025 F1-2 spike_detective: spike info: type: thread, thread id: 115061, thread name: temain, start time: 11/12/25 01:54:30, spike duration (sec): 6, initial cpu usage: 100, average cpu usage: 100, perf taken: 1&lt;/P&gt;&lt;P&gt;Dec 11 01:55:27 2025 F1-2 fwk: CLUS-210300-2: Remote member 1 (state STANDBY -&amp;gt; DOWN) | Reason: Interface is down (Cluster Control Protocol packets are not received)&lt;BR /&gt;Dec 11 01:55:27 2025 F1-2 fwk: CLUS-114402-2: State change: ACTIVE -&amp;gt; STANDBY | Reason: Member state has been changed after returning from ACTIVE/ACTIVE scenario (remote cluster member 1 has higher priority)&lt;/P&gt;&lt;P&gt;Dec 11 01:55:27 2025 F1-2 fwk: CLUS-210305-2: Remote member 1 (state DOWN -&amp;gt; ACTIVE(!)) | Reason: Interface is down (Cluster Control Protocol packets are not received)&lt;BR /&gt;Dec 11 01:55:27 2025 F1-2 fwk: CLUS-210300-2: Remote member 1 (state ACTIVE(!) -&amp;gt; DOWN) | Reason: Interface is down (Cluster Control Protocol packets are not received)&lt;BR /&gt;Dec 11 01:55:27 2025 F1-2 fwk: CLUS-114704-2: State change: STANDBY -&amp;gt; ACTIVE | Reason: No other ACTIVE members have been found in the cluster&lt;/P&gt;&lt;P&gt;Dec 11 01:55:27 2025 F1-2 fwk: CLUS-100102-2: Failover member 1 -&amp;gt; member 2 | Reason: Available on member 1&lt;BR /&gt;Dec 11 01:55:27 2025 F1-2 fwk: CLUS-214802-2: Remote member 1 (state DOWN -&amp;gt; STANDBY) | Reason: There is already an ACTIVE member in the cluster&lt;BR /&gt;Dec 11 01:55:27 2025 F1-2 fwk: CLUS-211700-2: Remote member 1 (state STANDBY -&amp;gt; DOWN) | Reason: ROUTED PNOTE&lt;BR /&gt;Dec 11 01:55:27 2025 F1-2 fwk: CLUS-100201-2: Failover member 2 -&amp;gt; member 1 | Reason: Member state has been changed after returning from ACTIVE/ACTIVE scenario (remote cluster member 1 has higher priority)&lt;BR /&gt;Dec 11 01:55:27 2025 F1-2 fwk: CLUS-120105-2: routed PNOTE ON&lt;BR /&gt;Dec 11 01:55:27 2025 F1-2 fwk: CLUS-111705-2: State change: ACTIVE -&amp;gt; ACTIVE(!) | Reason: ROUTED PNOTE&lt;/P&gt;&lt;P&gt;Dec 11 01:55:28 2025 F1-2 fwk: CLUS-120105-2: routed PNOTE OFF&lt;BR /&gt;Dec 11 01:55:28 2025 F1-2 fwk: CLUS-114904-2: State change: ACTIVE(!) -&amp;gt; ACTIVE | Reason: Reason for ACTIVE! alert has been resolved&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;STRONG&gt;Standby Firewall&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Dec 11 01:55:24 2025 F1-1 fwk: CLUS-110300-1: State change: STANDBY -&amp;gt; DOWN | Reason: Interface Sync is down (Cluster Control Protocol packets are not received)&lt;/P&gt;&lt;P&gt;Dec 11 01:55:25 2025 F1-1 fwk: CLUS-216400-1: Remote member 2 (state ACTIVE -&amp;gt; LOST) | Reason: Timeout Control Protocol packet expired member declared as DEAD&lt;BR /&gt;Dec 11 01:55:25 2025 F1-1 fwk: CLUS-116505-1: State change: DOWN -&amp;gt; ACTIVE(!) | Reason: All other machines are dead (timeout), Interface Sync is down (Cluster Control Protocol packets are not received)&lt;/P&gt;&lt;P&gt;Dec 11 01:55:25 2025 F1-1 fwk: CLUS-100201-1: Failover member 2 -&amp;gt; member 1 | Reason: Available on member 2&lt;BR /&gt;Dec 11 01:55:27 2025 F1-1 fwk: CLUS-214802-1: Remote member 2 (state LOST -&amp;gt; STANDBY) | Reason: There is already an ACTIVE member in the cluster&lt;BR /&gt;Dec 11 01:55:27 2025 F1-1 fwk: CLUS-110305-1: State change: ACTIVE! -&amp;gt; DOWN | Reason: Interface Sync is down (Cluster Control Protocol packets are not received)&lt;/P&gt;&lt;P&gt;Dec 11 01:55:27 2025 F1-1 fwk: CLUS-214904-1: Remote member 2 (state STANDBY -&amp;gt; ACTIVE) | Reason: Reason for ACTIVE! alert has been resolved&lt;BR /&gt;Dec 11 01:55:27 2025 F1-1 fwk: CLUS-114802-1: State change: DOWN -&amp;gt; STANDBY | Reason: There is already an ACTIVE member in the cluster (member 2)&lt;/P&gt;&lt;P&gt;Dec 11 01:55:27 2025 F1-1 fwk: CLUS-120105-1: routed PNOTE ON&lt;BR /&gt;Dec 11 01:55:27 2025 F1-1 fwk: CLUS-111700-1: State change: STANDBY -&amp;gt; DOWN | Reason: ROUTED PNOTE&lt;/P&gt;&lt;P&gt;Dec 11 01:55:27 2025 F1-1 fwk: CLUS-100102-1: Failover member 1 -&amp;gt; member 2 | Reason: Interface Sync is down (Cluster Control Protocol packets are not received)&lt;BR /&gt;Dec 11 01:55:27 2025 F1-1 routed[168442]: [routed] ERROR: cpcl_recv: Failed to receive cluster message header, connection will need to be reestablished. errno = 104 (Connection reset by peer)&lt;BR /&gt;Dec 11 01:55:27 2025 F1-1 routed[168442]: [routed] ERROR: cpcl_recv: deleting peer task 0x8f1aee4 due to failure to read from the socket&lt;BR /&gt;Dec 11 01:56:02 2025 F1-1 fwk: CLUS-120105-1: routed PNOTE OFF&lt;BR /&gt;Dec 11 01:56:02 2025 F1-1 fwk: CLUS-114802-1: State change: DOWN -&amp;gt; STANDBY | Reason: There is already an ACTIVE member in the cluster (member 2)&lt;/P&gt;</description>
      <pubDate>Thu, 11 Dec 2025 10:44:13 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265084#M52168</guid>
      <dc:creator>NiladriSarkar</dc:creator>
      <dc:date>2025-12-11T10:44:13Z</dc:date>
    </item>
    <item>
      <title>Re: ClusterXL HA issue</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265085#M52169</link>
      <description>&lt;P&gt;I see this suspicious messages:&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;Dec 11 01:53:31 2025 F1-2 spike_detective: spike info: type: cpu, cpu core: 42, top consumer: fwk0_dev_57, start time: 11/12/25 01:53:18, spike duration (sec): 12, initial cpu usage: 91, average cpu usage: 74, perf taken: 0

Dec 11 01:54:37 2025 F1-2 spike_detective: spike info: type: thread, thread id: 115061, thread name: temain, start time: 11/12/25 01:54:30, spike duration (sec): 6, initial cpu usage: 100, average cpu usage: 100, perf taken: 1&lt;/LI-CODE&gt;
&lt;P&gt;&lt;BR /&gt;question is why this device consumes so much cpu. I guess it's VSX and maybe to be analysed what exactly caused the spike and to consider an adjustment of the VS core assignment.&lt;/P&gt;</description>
      <pubDate>Thu, 11 Dec 2025 10:56:38 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265085#M52169</guid>
      <dc:creator>Vincent_Bacher</dc:creator>
      <dc:date>2025-12-11T10:56:38Z</dc:date>
    </item>
    <item>
      <title>Re: ClusterXL HA issue</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265088#M52170</link>
      <description>&lt;P&gt;It is not a VSX. The CPU spikes are short lived.. mostly for TEMAIN threads. Its a 28600 box and not being over utilized. Will investigate the cpu issue anyways. thanks&lt;/P&gt;</description>
      <pubDate>Thu, 11 Dec 2025 11:07:31 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265088#M52170</guid>
      <dc:creator>NiladriSarkar</dc:creator>
      <dc:date>2025-12-11T11:07:31Z</dc:date>
    </item>
    <item>
      <title>Re: ClusterXL HA issue</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265092#M52173</link>
      <description>&lt;P&gt;ClusterXL typically has split brain prevention mechanisms so either is overwhelmed or their is some Layer-2 issue.&lt;/P&gt;
&lt;P&gt;What is the topology of the sync interface? Is it a bond, are there intermediate switches etc.&lt;/P&gt;</description>
      <pubDate>Thu, 11 Dec 2025 11:42:28 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265092#M52173</guid>
      <dc:creator>Chris_Atkinson</dc:creator>
      <dc:date>2025-12-11T11:42:28Z</dc:date>
    </item>
    <item>
      <title>Re: ClusterXL HA issue</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265093#M52174</link>
      <description>&lt;P&gt;I see the point Vince is making. That could absolutely happen due to CPU spike.&lt;/P&gt;</description>
      <pubDate>Thu, 11 Dec 2025 11:55:41 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265093#M52174</guid>
      <dc:creator>the_rock</dc:creator>
      <dc:date>2025-12-11T11:55:41Z</dc:date>
    </item>
    <item>
      <title>Re: ClusterXL HA issue</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265100#M52180</link>
      <description>&lt;P&gt;not a bond interface.. and no switch in between. they are directly connected. The cable was replaced after we saw this issue earlier.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 11 Dec 2025 12:25:22 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265100#M52180</guid>
      <dc:creator>NiladriSarkar</dc:creator>
      <dc:date>2025-12-11T12:25:22Z</dc:date>
    </item>
    <item>
      <title>Re: ClusterXL HA issue</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265102#M52182</link>
      <description>&lt;P&gt;Which version / JHF are we working with?&lt;/P&gt;</description>
      <pubDate>Thu, 11 Dec 2025 12:29:43 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265102#M52182</guid>
      <dc:creator>Chris_Atkinson</dc:creator>
      <dc:date>2025-12-11T12:29:43Z</dc:date>
    </item>
    <item>
      <title>Re: ClusterXL HA issue</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265103#M52183</link>
      <description>&lt;P&gt;its 81.20 take 113&lt;/P&gt;</description>
      <pubDate>Thu, 11 Dec 2025 12:30:21 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265103#M52183</guid>
      <dc:creator>NiladriSarkar</dc:creator>
      <dc:date>2025-12-11T12:30:21Z</dc:date>
    </item>
    <item>
      <title>Re: ClusterXL HA issue</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265104#M52184</link>
      <description>&lt;P&gt;If I were you, would install recommended one, take 119, but either way, what&amp;nbsp;&lt;a href="https://community.checkpoint.com/t5/user/viewprofilepage/user-id/16383"&gt;@Vincent_Bacher&lt;/a&gt;&amp;nbsp;said, makes total sense, at least to me.&lt;/P&gt;</description>
      <pubDate>Thu, 11 Dec 2025 12:32:06 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265104#M52184</guid>
      <dc:creator>the_rock</dc:creator>
      <dc:date>2025-12-11T12:32:06Z</dc:date>
    </item>
    <item>
      <title>Re: ClusterXL HA issue</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265105#M52185</link>
      <description>&lt;P&gt;yup, thank you. Will check on the CPU usage and plan to install t119.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 11 Dec 2025 12:33:57 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265105#M52185</guid>
      <dc:creator>NiladriSarkar</dc:creator>
      <dc:date>2025-12-11T12:33:57Z</dc:date>
    </item>
    <item>
      <title>Re: ClusterXL HA issue</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265106#M52186</link>
      <description>&lt;P&gt;Good idea since T119 has some CXL fixes&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 11 Dec 2025 12:35:59 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265106#M52186</guid>
      <dc:creator>Chris_Atkinson</dc:creator>
      <dc:date>2025-12-11T12:35:59Z</dc:date>
    </item>
    <item>
      <title>Re: ClusterXL HA issue</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265107#M52187</link>
      <description>&lt;P&gt;right ! thanks&lt;BR /&gt;&lt;BR /&gt;PRJ-62301, PMTR-115027 ClusterXL In ClusterXL High Availability (HA), in some scenarios, the Active cluster member stops sending Cluster Control Protocol (CCP) heartbeats, and the Standby member may misinterpret this as an Interface Active Check (IAC) failure.&lt;/P&gt;</description>
      <pubDate>Thu, 11 Dec 2025 12:40:34 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265107#M52187</guid>
      <dc:creator>NiladriSarkar</dc:creator>
      <dc:date>2025-12-11T12:40:34Z</dc:date>
    </item>
    <item>
      <title>Re: ClusterXL HA issue</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265108#M52188</link>
      <description>&lt;P&gt;You can also follow below for historical data:&lt;/P&gt;
&lt;P&gt;&lt;A href="https://community.checkpoint.com/t5/Security-Gateways/How-to-view-cpview-history-file-on-other-machine/td-p/162600" target="_blank"&gt;https://community.checkpoint.com/t5/Security-Gateways/How-to-view-cpview-history-file-on-other-machine/td-p/162600&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;or&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;cpview -t and then press t again&lt;/P&gt;</description>
      <pubDate>Thu, 11 Dec 2025 12:43:57 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265108#M52188</guid>
      <dc:creator>the_rock</dc:creator>
      <dc:date>2025-12-11T12:43:57Z</dc:date>
    </item>
    <item>
      <title>Re: ClusterXL HA issue</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265109#M52189</link>
      <description>&lt;P&gt;I feel it will improve the situation, for sure.&lt;/P&gt;</description>
      <pubDate>Thu, 11 Dec 2025 12:46:32 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265109#M52189</guid>
      <dc:creator>the_rock</dc:creator>
      <dc:date>2025-12-11T12:46:32Z</dc:date>
    </item>
    <item>
      <title>Re: ClusterXL HA issue</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265112#M52191</link>
      <description>&lt;P&gt;Perhaps I didn't express myself clearly enough as a non-native English speaker, but thank you for the flowers.&lt;/P&gt;</description>
      <pubDate>Thu, 11 Dec 2025 14:09:27 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265112#M52191</guid>
      <dc:creator>Vincent_Bacher</dc:creator>
      <dc:date>2025-12-11T14:09:27Z</dc:date>
    </item>
    <item>
      <title>Re: ClusterXL HA issue</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265113#M52192</link>
      <description>&lt;P&gt;You absolutely did, I got all you had to say. Dont worry, English is not my first language either lol&lt;/P&gt;</description>
      <pubDate>Thu, 11 Dec 2025 14:11:29 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/ClusterXL-HA-issue/m-p/265113#M52192</guid>
      <dc:creator>the_rock</dc:creator>
      <dc:date>2025-12-11T14:11:29Z</dc:date>
    </item>
  </channel>
</rss>

