<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: High CPU but no failover in Firewall and Security Management</title>
    <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/High-CPU-but-no-failover/m-p/231067#M44513</link>
    <description>&lt;P&gt;That's not silly at all, its exactly what we saw as well some cores, particularly core 0 was at 99%.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;It could be done via a bash script but I don't really want to go down that path.&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 29 Oct 2024 01:05:42 GMT</pubDate>
    <dc:creator>Ryan_Ryan</dc:creator>
    <dc:date>2024-10-29T01:05:42Z</dc:date>
    <item>
      <title>High CPU but no failover</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/High-CPU-but-no-failover/m-p/231064#M44510</link>
      <description>&lt;P&gt;Hi, we had two scenarios over the past 3 months where primary cluster member high cpu (98%+) on all cores and was dropping connections and causing general network chaos for several hours, but the issue was it was 'alive enough' to respond to corexl heartbeats and remain active for 3 hours till it finally failed over, the load was so bad we couldn't even establish an ssh session to it.&lt;/P&gt;
&lt;P&gt;Is there anyway we can have this type of resource exhaustion cause a pnote and failover?&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;They are R81.20, will be patching to latest jumbo this week.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 29 Oct 2024 00:16:22 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/High-CPU-but-no-failover/m-p/231064#M44510</guid>
      <dc:creator>Ryan_Ryan</dc:creator>
      <dc:date>2024-10-29T00:16:22Z</dc:date>
    </item>
    <item>
      <title>Re: High CPU but no failover</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/High-CPU-but-no-failover/m-p/231065#M44511</link>
      <description>&lt;P&gt;I know this may sound silly what I will say, but I had seen cases where cpu is at 99% and failover does NOT happen. Honestly, I have no clue if there is an official "threshold" for things like this, but I had never seen one. Not sure if updating ti jumbo 89 (recommended take) would fix your issue, but worth a try.&lt;/P&gt;
&lt;P&gt;Andy&lt;/P&gt;</description>
      <pubDate>Tue, 29 Oct 2024 00:48:50 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/High-CPU-but-no-failover/m-p/231065#M44511</guid>
      <dc:creator>the_rock</dc:creator>
      <dc:date>2024-10-29T00:48:50Z</dc:date>
    </item>
    <item>
      <title>Re: High CPU but no failover</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/High-CPU-but-no-failover/m-p/231067#M44513</link>
      <description>&lt;P&gt;That's not silly at all, its exactly what we saw as well some cores, particularly core 0 was at 99%.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;It could be done via a bash script but I don't really want to go down that path.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 29 Oct 2024 01:05:42 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/High-CPU-but-no-failover/m-p/231067#M44513</guid>
      <dc:creator>Ryan_Ryan</dc:creator>
      <dc:date>2024-10-29T01:05:42Z</dc:date>
    </item>
    <item>
      <title>Re: High CPU but no failover</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/High-CPU-but-no-failover/m-p/231069#M44514</link>
      <description>&lt;P&gt;It depends on why the CPU is at 99%. If it's at 99% because there's a load issue, the load simply moves to the other cluster member upon failover and nothing is resolved - in fact it's probably made worse due to the extra overhead of the failover occurring, causing a bigger/worse outage. So we don't necessarily want to code in load related failovers.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Would suggest that if you have out-of-band access to the gateway (LOM/console) set up, you may have an easier time getting in to the CLI to check things out as you don't have to try wrestle an SSH connection in. Worst case, if you have LOM access you can power cycle the gateway to force the failover.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;In R82, setting up an ElasticXL cluster could also help, as you will want to size a 2-gateway EXL cluster such that neither gateway is utilised more than 40%, to maintain HA. This way, a load related resource utilisation spike is absorbed by having extra overhead there. You also may find that if you have at least one member behaving nicely you can set the other one down from there, depending on the circumstance.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 29 Oct 2024 02:10:07 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/High-CPU-but-no-failover/m-p/231069#M44514</guid>
      <dc:creator>emmap</dc:creator>
      <dc:date>2024-10-29T02:10:07Z</dc:date>
    </item>
    <item>
      <title>Re: High CPU but no failover</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/High-CPU-but-no-failover/m-p/231070#M44515</link>
      <description>&lt;P&gt;Yes that is a fair call the load may just get shifted around, in this case it wasnt traffic generated load but I do understand the chaos of flipping back and forward if it was load generated.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;As these are vm's we only have access to console via vsphere which we also could not get into due to the network outage (working on ways to get around that for next time)&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 29 Oct 2024 02:57:46 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/High-CPU-but-no-failover/m-p/231070#M44515</guid>
      <dc:creator>Ryan_Ryan</dc:creator>
      <dc:date>2024-10-29T02:57:46Z</dc:date>
    </item>
    <item>
      <title>Re: High CPU but no failover</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/High-CPU-but-no-failover/m-p/231091#M44521</link>
      <description>&lt;P&gt;Maybe shut a switch port that is connected to this firewall? If active unit has less interfaces up it will do failover.&lt;/P&gt;
&lt;P&gt;Failover based on high load I have never seen and would not recommend.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Maybe this is something:&lt;/P&gt;
&lt;P&gt;Management Data Plane Separation (MDPS)&lt;/P&gt;
&lt;P&gt;&lt;A href="https://support.checkpoint.com/results/sk/sk138672" target="_blank"&gt;https://support.checkpoint.com/results/sk/sk138672&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 29 Oct 2024 08:31:55 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/High-CPU-but-no-failover/m-p/231091#M44521</guid>
      <dc:creator>Lesley</dc:creator>
      <dc:date>2024-10-29T08:31:55Z</dc:date>
    </item>
    <item>
      <title>Re: High CPU but no failover</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/High-CPU-but-no-failover/m-p/231101#M44523</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.checkpoint.com/t5/user/viewprofilepage/user-id/14416"&gt;@Ryan_Ryan&lt;/a&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;CheckPoint CUL mode (Cluster Under Load)&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;As I know, there is a threshold at 80% where the CUL mode is enabled. During this mode, the cluster state freezes.&lt;/P&gt;
&lt;P&gt;check this SK:&amp;nbsp;&lt;A href="https://support.checkpoint.com/results/sk/sk92723" target="_blank"&gt;https://support.checkpoint.com/results/sk/sk92723&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;To summarize it, I don't think this kind of situaton triggers a failover.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;a href="https://community.checkpoint.com/t5/user/viewprofilepage/user-id/38213"&gt;@the_rock&lt;/a&gt;&amp;nbsp;What is your opinion?&lt;/P&gt;
&lt;P&gt;Akos&lt;/P&gt;</description>
      <pubDate>Tue, 29 Oct 2024 09:53:33 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/High-CPU-but-no-failover/m-p/231101#M44523</guid>
      <dc:creator>AkosBakos</dc:creator>
      <dc:date>2024-10-29T09:53:33Z</dc:date>
    </item>
    <item>
      <title>Re: High CPU but no failover</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/High-CPU-but-no-failover/m-p/231114#M44526</link>
      <description>&lt;P&gt;Well, makes sense, though I dont know if there is an official article or statement somewhere that says what thresholds are there for clustering processes to trigger failover (ie processes from command cphaprob -l list).&lt;/P&gt;
&lt;P&gt;Because, lets be realistic and logical...IF cpu reaches say 80%, to me personally, thats good enough reason for fw to failover. Cause lets be honest, Im sure IT admin for a big bank would not feel overly comfortable having fw under 80% cpu load keep processing the traffic for a very long time...&lt;/P&gt;
&lt;P&gt;But again, just my thinking.&lt;/P&gt;
&lt;P&gt;Andy&lt;/P&gt;</description>
      <pubDate>Tue, 29 Oct 2024 10:35:30 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/High-CPU-but-no-failover/m-p/231114#M44526</guid>
      <dc:creator>the_rock</dc:creator>
      <dc:date>2024-10-29T10:35:30Z</dc:date>
    </item>
    <item>
      <title>Re: High CPU but no failover</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/High-CPU-but-no-failover/m-p/231124#M44527</link>
      <description>&lt;P&gt;For what its worth, I had seen cases where say if this happened on 15000 series appliances, failover did NOT happen, but if it was 6000 series, it would have happened, so clearly it has to do with how powerful appliances are or how many cores they have.&lt;/P&gt;
&lt;P&gt;Andy&lt;/P&gt;</description>
      <pubDate>Tue, 29 Oct 2024 11:08:56 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/High-CPU-but-no-failover/m-p/231124#M44527</guid>
      <dc:creator>the_rock</dc:creator>
      <dc:date>2024-10-29T11:08:56Z</dc:date>
    </item>
  </channel>
</rss>

