<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Management Server is stuck - user is unable to run any command, seen many times! in Firewall and Security Management</title>
    <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217193#M65089</link>
    <description>&lt;P&gt;Incredibly curious, indeed. &amp;nbsp;Have you been able to get the hotfix mentioned in that SK?&lt;/P&gt;
&lt;P&gt;I happened to check one of my customers, and I also see them with numerous defunct CPD processes. &amp;nbsp;Theirs is a CloudGuard management server, but I have many other customers with the same deployment (with same Azure template and VM size). &amp;nbsp;I went through a bunch of logs and didn't find any smoking guns. &amp;nbsp;I found some concerning logs, but other customers have the same, without issue.&lt;/P&gt;
&lt;P&gt;I'm going to request that hotfix from TAC for my one customer, like yours. &amp;nbsp;Looks like we have the same bug.&amp;nbsp;&lt;span class="lia-unicode-emoji" title=":pensive_face:"&gt;😔&lt;/span&gt;&lt;/P&gt;</description>
    <pubDate>Tue, 11 Jun 2024 21:15:09 GMT</pubDate>
    <dc:creator>Duane_Toler</dc:creator>
    <dc:date>2024-06-11T21:15:09Z</dc:date>
    <item>
      <title>Management Server is stuck - user is unable to run any command, seen many times!</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217104#M65074</link>
      <description>&lt;P&gt;Hello team,&amp;nbsp;&lt;/P&gt;
&lt;P&gt;recently we stumbled over three issues on three totaly independet customer who run into this issue:&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;The MGMT server stopps to execute all kind of Check Point commands.&lt;BR /&gt;only "cpwd_admin list" worked and showed all processes as "E" not "T".&lt;/P&gt;
&lt;P&gt;Even "reboot" or "init6" stop to work.&lt;BR /&gt;only a power cycle via VmWare or similar is possible to regain control.&lt;BR /&gt;&lt;BR /&gt;if the MGMT is down, the Check Point CA is down, which is very unheathly for all VPN tunnels from the same MGMT based on certificates.&lt;BR /&gt;"invalid certificate" messages are then shown in the log.&lt;BR /&gt;&lt;BR /&gt;&lt;STRONG&gt;cpm.elg says:&lt;/STRONG&gt;&lt;BR /&gt;&lt;EM&gt;&lt;STRONG&gt;08/06/24 08:09:44,534 ERROR tracker.dataLog.TrackerDataSenderSvcImp [taskExecutor-31]: AuditLogsToTrackerSender: Unable to connect fwm (down), Exception: Could not receive Message.&lt;/STRONG&gt;&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;and it throws a ton on java errors in cpm.elg.&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;we saw this on three different customers, all on R81.20 HFA 53/65&lt;BR /&gt;&lt;BR /&gt;and yes there is that sk:&lt;BR /&gt;&lt;A href="https://support.checkpoint.com/results/sk/sk173405" target="_blank"&gt;https://support.checkpoint.com/results/sk/sk173405&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;it did not work for me to kill "autoupdater" ...&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;anybody noticed the same?&lt;BR /&gt;we have two CP cases ongoing!&lt;BR /&gt;&lt;BR /&gt;best regards&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2024 10:41:46 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217104#M65074</guid>
      <dc:creator>Thomas_Eichelbu</dc:creator>
      <dc:date>2024-06-11T10:41:46Z</dc:date>
    </item>
    <item>
      <title>Re: Management Server is stuck - user is unable to run any command, seen many times!</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217112#M65075</link>
      <description>&lt;P&gt;You could consider to disable the CRL check (less secure). It is a workaround during the time you figure it out with TAC.&lt;/P&gt;
&lt;P&gt;At least it will maybe give you some rest if you are not able to power cycle the unit right away.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://support.checkpoint.com/results/sk/sk21156" target="_blank"&gt;https://support.checkpoint.com/results/sk/sk21156&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2024 12:26:23 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217112#M65075</guid>
      <dc:creator>Lesley</dc:creator>
      <dc:date>2024-06-11T12:26:23Z</dc:date>
    </item>
    <item>
      <title>Re: Management Server is stuck - user is unable to run any command, seen many times!</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217117#M65076</link>
      <description>&lt;P&gt;Hello Lesley,&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;well yes i know that, but thats not the problem itself ...&lt;/P&gt;
&lt;P&gt;the problem is, the &lt;STRONG&gt;MGMT becomes unusable&lt;/STRONG&gt;, since no services run and &lt;STRONG&gt;no CP commands can be executed&lt;/STRONG&gt;.&lt;BR /&gt;Thats the root issue that lead to the outcome of an unreachable CRL ...&amp;nbsp;&lt;BR /&gt;And when the MGMT stops working you have no more chance to apply your SK&amp;nbsp;&lt;A href="https://support.checkpoint.com/results/sk/sk21156" target="_blank" rel="noopener noreferrer"&gt;sk21156&lt;/A&gt;&amp;nbsp;since, you cannot connect to MGMT database anymore to push policy &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2024 12:46:01 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217117#M65076</guid>
      <dc:creator>Thomas_Eichelbu</dc:creator>
      <dc:date>2024-06-11T12:46:01Z</dc:date>
    </item>
    <item>
      <title>Re: Management Server is stuck - user is unable to run any command, seen many times!</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217118#M65077</link>
      <description>&lt;P&gt;I understand that. And SK can be done before the problem occurs. Atleast the tunnels will stay up&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2024 12:50:17 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217118#M65077</guid>
      <dc:creator>Lesley</dc:creator>
      <dc:date>2024-06-11T12:50:17Z</dc:date>
    </item>
    <item>
      <title>Re: Management Server is stuck - user is unable to run any command, seen many times!</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217122#M65078</link>
      <description>&lt;P&gt;Does guidbedit load or that does not work either? I assume rebooting the mgmt does not make a difference?&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2024 14:11:27 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217122#M65078</guid>
      <dc:creator>the_rock</dc:creator>
      <dc:date>2024-06-11T14:11:27Z</dc:date>
    </item>
    <item>
      <title>Re: Management Server is stuck - user is unable to run any command, seen many times!</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217132#M65079</link>
      <description>&lt;P&gt;Check your cp_mgmt SIC certificate. &amp;nbsp;The "fwm" is down and the certificate errors you stated indicate you may have a problem there.&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;cpca_client lscert -kind SIC |grep cp_mgmt -A 2
&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2024 15:08:33 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217132#M65079</guid>
      <dc:creator>Duane_Toler</dc:creator>
      <dc:date>2024-06-11T15:08:33Z</dc:date>
    </item>
    <item>
      <title>Re: Management Server is stuck - user is unable to run any command, seen many times!</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217133#M65080</link>
      <description>&lt;P&gt;Good command!&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2024 15:11:33 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217133#M65080</guid>
      <dc:creator>the_rock</dc:creator>
      <dc:date>2024-06-11T15:11:33Z</dc:date>
    </item>
    <item>
      <title>Re: Management Server is stuck - user is unable to run any command, seen many times!</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217137#M65081</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;well i see some expired certificates, but the majority of the certificates is still valid.&lt;BR /&gt;sorry i cannot post much of the output since it all contains personal data and so on ...&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;[Expert@ABCDEF:0]# cpca_client lscert -kind SIC |grep cp_mgmt -A 2&lt;/EM&gt;&lt;BR /&gt;&lt;EM&gt;Subject = CN=cp_mgmt,O=ABCDEF.X.X.X.X.Y.Y.Y.4cn4gu&lt;/EM&gt;&lt;BR /&gt;&lt;EM&gt;Status = Valid Kind = SIC Serial = 7622 DP = 0&lt;/EM&gt;&lt;BR /&gt;&lt;EM&gt;Not_Before: Tue May 25 14:22:35 2021 Not_After: Mon May 25 14:22:35 2026&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;But if the SIC certificate is expired or revoked or anything but valid the MGMT would not stay completely down. And i still could run any CP commands on cli.&lt;BR /&gt;And if the SIC certificate is invalid a reboot would not help here ...&amp;nbsp;&lt;/P&gt;
&lt;P&gt;As i said the MGMT server is not running nor any services nor running any Check Point CLI commands works.&lt;BR /&gt;Iam pretty sure it will be stuck by tomorrow again ... we will see.&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2024 15:27:42 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217137#M65081</guid>
      <dc:creator>Thomas_Eichelbu</dc:creator>
      <dc:date>2024-06-11T15:27:42Z</dc:date>
    </item>
    <item>
      <title>Re: Management Server is stuck - user is unable to run any command, seen many times!</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217140#M65082</link>
      <description>&lt;P&gt;&lt;a href="https://community.checkpoint.com/t5/user/viewprofilepage/user-id/24246"&gt;@Thomas_Eichelbu&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hi,&lt;BR /&gt;are there a lot of zombie processes running on the server? Our server currently has this error and the server is therefore unusable. A restart will temporarily fix the error&lt;/P&gt;&lt;P&gt;best regards&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2024 15:55:49 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217140#M65082</guid>
      <dc:creator>Pauli</dc:creator>
      <dc:date>2024-06-11T15:55:49Z</dc:date>
    </item>
    <item>
      <title>Re: Management Server is stuck - user is unable to run any command, seen many times!</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217141#M65083</link>
      <description>&lt;P&gt;Ok that's good to rule out. &amp;nbsp;Have you done the handful of sanity checks as well? &amp;nbsp;I would expect you have, but again just to rule them out:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;check disk space&lt;/LI&gt;
&lt;LI&gt;check OS logs to make sure nothing weird is there&lt;/LI&gt;
&lt;LI&gt;Since it's a VM, make sure the hypervisor host is ok:
&lt;UL&gt;
&lt;LI&gt;datastore disk space&lt;/LI&gt;
&lt;LI&gt;datastore access to the SAN or local storage&lt;/LI&gt;
&lt;LI&gt;hypervisor RAM&lt;/LI&gt;
&lt;/UL&gt;
&lt;/LI&gt;
&lt;LI&gt;Run CPM doctor ($FWDIR/scripts/run_cpmdoc.sh) when the host is functioning normally&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If you are able to run OS commands, but not Check Point commands, then that does sound like an issue with the registry file like the SK indicated.&lt;/P&gt;
&lt;P&gt;Try this, too: &amp;nbsp;If the host is functioning now, do a controlled reboot just to see how it behaves. &amp;nbsp;Since you have a pattern of the host misbehaving on an interval, see if this controlled reboot "buys" you more time for that interval before the next occurrence of the issue. Then look at the CPM debug topics and enable debug for Solr and webservices. &amp;nbsp;These may give you an additional clue while you wait on TAC.&lt;/P&gt;
&lt;P&gt;&lt;A href="https://support.checkpoint.com/results/sk/sk115557" target="_blank"&gt;https://support.checkpoint.com/results/sk/sk115557&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;Likewise, you may also want to do a separate debug of CPD:&lt;BR /&gt;&lt;A href="https://support.checkpoint.com/results/sk/sk86320" target="_blank"&gt;https://support.checkpoint.com/results/sk/sk86320&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;If it exhibits the issue again, a close examination of the CPM debug &amp;nbsp;*should* point to the issue at the moment it occurs.&lt;/P&gt;
&lt;P&gt;As a heads-up: TAC may give you the solr_cure process as part of the troubleshooting (sk140394, but it's a TAC internal SK).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2024 15:56:42 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217141#M65083</guid>
      <dc:creator>Duane_Toler</dc:creator>
      <dc:date>2024-06-11T15:56:42Z</dc:date>
    </item>
    <item>
      <title>Re: Management Server is stuck - user is unable to run any command, seen many times!</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217142#M65084</link>
      <description>&lt;P&gt;What did TAC come back with?&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2024 16:05:49 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217142#M65084</guid>
      <dc:creator>the_rock</dc:creator>
      <dc:date>2024-06-11T16:05:49Z</dc:date>
    </item>
    <item>
      <title>Re: Management Server is stuck - user is unable to run any command, seen many times!</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217146#M65085</link>
      <description>&lt;P&gt;Hello, oh yes many Zombies ...&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;HCP from&amp;nbsp; customer B&lt;/STRONG&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Bild.png" style="width: 999px;"&gt;&lt;img src="https://community.checkpoint.com/t5/image/serverpage/image-id/26193i47F992714731C181/image-size/large?v=v2&amp;amp;px=999" role="button" title="Bild.png" alt="Bild.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;grep for Zombies on Customer A&lt;/STRONG&gt;, here HCP doesnt run, it dies&amp;nbsp; on licence check, a different story ... maybe?&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;P&gt;endless rows of:&lt;BR /&gt;&lt;EM&gt;[Expert@ABCDEF:0]# ps aux | grep Z | more&lt;/EM&gt;&lt;BR /&gt;&lt;EM&gt;USER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMMAND&lt;/EM&gt;&lt;BR /&gt;&lt;EM&gt;admin 4653 0.0 0.0 0 0 ? Z 08:38 0:00 [cpd] &amp;lt;defunct&amp;gt;&lt;/EM&gt;&lt;BR /&gt;&lt;EM&gt;admin 4654 0.0 0.0 0 0 ? Z 08:38 0:00 [cpd] &amp;lt;defunct&amp;gt;&lt;/EM&gt;&lt;BR /&gt;&lt;EM&gt;admin 4655 0.0 0.0 0 0 ? Z 08:38 0:00 [cpd] &amp;lt;defunct&amp;gt;&lt;/EM&gt;&lt;BR /&gt;&lt;EM&gt;admin 4657 0.0 0.0 0 0 ? Z 08:38 0:00 [cpd] &amp;lt;defunct&amp;gt;&lt;/EM&gt;&lt;BR /&gt;&lt;EM&gt;admin 4658 0.0 0.0 0 0 ? Z 08:38 0:00 [cpd] &amp;lt;defunct&amp;gt;&lt;/EM&gt;&lt;BR /&gt;&lt;EM&gt;admin 4659 0.0 0.0 0 0 ? Z 08:38 0:00 [cpd] &amp;lt;defunct&amp;gt;&lt;/EM&gt;&lt;BR /&gt;&lt;EM&gt;admin 4661 0.0 0.0 0 0 ? Z 08:38 0:00 [cpd] &amp;lt;defunct&amp;gt;&lt;/EM&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;customer C, the lucky guy!&lt;BR /&gt;&lt;/STRONG&gt;no Zombies,&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;[Expert@HIJKLMNO:0]# ps aux | grep Z | more&lt;BR /&gt;USER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMMAND&lt;BR /&gt;admin 24765 0.0 0.0 2652 572 pts/1 S+ 18:44 0:00 grep --color=auto Z&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;A href="https://support.checkpoint.com/results/sk/sk182370" target="_blank"&gt;https://support.checkpoint.com/results/sk/sk182370&lt;/A&gt;&amp;nbsp;is about Zombis on CPD ...&lt;BR /&gt;&lt;BR /&gt;maybe an license issues, on customer A &amp;amp; B is see licensed issued for different IP´s on the SMS ...&lt;BR /&gt;(usage of aliases and so on )&lt;BR /&gt;Customer C has licensees issued only for its own real IP.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2024 16:56:32 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217146#M65085</guid>
      <dc:creator>Thomas_Eichelbu</dc:creator>
      <dc:date>2024-06-11T16:56:32Z</dc:date>
    </item>
    <item>
      <title>Re: Management Server is stuck - user is unable to run any command, seen many times!</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217147#M65086</link>
      <description>&lt;P&gt;nothing so far ...&lt;BR /&gt;iam still waiting.&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2024 16:28:36 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217147#M65086</guid>
      <dc:creator>Thomas_Eichelbu</dc:creator>
      <dc:date>2024-06-11T16:28:36Z</dc:date>
    </item>
    <item>
      <title>Re: Management Server is stuck - user is unable to run any command, seen many times!</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217148#M65087</link>
      <description>&lt;P&gt;this are all good points.&lt;BR /&gt;&lt;BR /&gt;CPM Doctor did not show any negative things, all green.&lt;BR /&gt;disk space is all good.&lt;BR /&gt;didnt run SOLR Cure yet&lt;BR /&gt;since iam not controlling the VMware infrastructure, i rely on third party to check this.&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2024 16:33:04 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217148#M65087</guid>
      <dc:creator>Thomas_Eichelbu</dc:creator>
      <dc:date>2024-06-11T16:33:04Z</dc:date>
    </item>
    <item>
      <title>Re: Management Server is stuck - user is unable to run any command, seen many times!</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217149#M65088</link>
      <description>&lt;P&gt;Ok, fair enough. As far as zombie processes, I know that usually fixed by doing cpstop; cpstart or reboot, but does not sound that would do much here. And since you said it does happen on R81.20 jumbo 65 as well, they cant really ask you to install any other jumbo hotfix. Now, to comment for cpm doctor, if that does not show any errors, tells me most likely database is clean.&lt;/P&gt;
&lt;P&gt;Just wondering, how much ram is there on these servers, any idea?&lt;/P&gt;
&lt;P&gt;Andy&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2024 16:36:22 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217149#M65088</guid>
      <dc:creator>the_rock</dc:creator>
      <dc:date>2024-06-11T16:36:22Z</dc:date>
    </item>
    <item>
      <title>Re: Management Server is stuck - user is unable to run any command, seen many times!</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217193#M65089</link>
      <description>&lt;P&gt;Incredibly curious, indeed. &amp;nbsp;Have you been able to get the hotfix mentioned in that SK?&lt;/P&gt;
&lt;P&gt;I happened to check one of my customers, and I also see them with numerous defunct CPD processes. &amp;nbsp;Theirs is a CloudGuard management server, but I have many other customers with the same deployment (with same Azure template and VM size). &amp;nbsp;I went through a bunch of logs and didn't find any smoking guns. &amp;nbsp;I found some concerning logs, but other customers have the same, without issue.&lt;/P&gt;
&lt;P&gt;I'm going to request that hotfix from TAC for my one customer, like yours. &amp;nbsp;Looks like we have the same bug.&amp;nbsp;&lt;span class="lia-unicode-emoji" title=":pensive_face:"&gt;😔&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2024 21:15:09 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217193#M65089</guid>
      <dc:creator>Duane_Toler</dc:creator>
      <dc:date>2024-06-11T21:15:09Z</dc:date>
    </item>
    <item>
      <title>Re: Management Server is stuck - user is unable to run any command, seen many times!</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217240#M65090</link>
      <description>&lt;P&gt;Well i had a long phone call with TAC to summarize all things ...&lt;BR /&gt;But he didn't said much about the zombies. Maybe they are not so horrible as they sound. At least he didn't paid much attention to it.&lt;BR /&gt;&lt;BR /&gt;And i got no Hotfix for the CDP zombies as mention in SK&amp;nbsp;&lt;A href="https://support.checkpoint.com/results/sk/sk182370" target="_blank" rel="noopener noreferrer"&gt;sk182370.&lt;/A&gt;&lt;BR /&gt;Honestly i didn't requested one.&lt;BR /&gt;&lt;BR /&gt;So i have opened two cases for two customer, they are ongoing. lets see what TAC will find out.&lt;/P&gt;</description>
      <pubDate>Wed, 12 Jun 2024 06:34:14 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217240#M65090</guid>
      <dc:creator>Thomas_Eichelbu</dc:creator>
      <dc:date>2024-06-12T06:34:14Z</dc:date>
    </item>
    <item>
      <title>Re: Management Server is stuck - user is unable to run any command, seen many times!</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217303#M65091</link>
      <description>&lt;P&gt;Hey, any new updates or nothing yet?&lt;/P&gt;
&lt;P&gt;Andy&lt;/P&gt;</description>
      <pubDate>Wed, 12 Jun 2024 17:50:59 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217303#M65091</guid>
      <dc:creator>the_rock</dc:creator>
      <dc:date>2024-06-12T17:50:59Z</dc:date>
    </item>
    <item>
      <title>Re: Management Server is stuck - user is unable to run any command, seen many times!</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217807#M65092</link>
      <description>&lt;P&gt;We have the same thing with 1 of our CMA's on our MDS.&amp;nbsp; At least for us, a reboot of the MDS will fix things and we will be fine for another 6-10 days until we have the CPD problem.&amp;nbsp; Working with TAC, the one engineer said that T150 was supposed to contain a fix for CPD with defunct status.&amp;nbsp; I assume this might be the same hotfix that&amp;nbsp;sk182370 mentions.&amp;nbsp; We are currently running T141.&lt;/P&gt;&lt;P&gt;The first time we had this problem was May 30, before we installed T141.&amp;nbsp; T141 was recommended at the time the hotfixes for the vpn problem came out, so we went with that&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;With CPD in the defunct status, we had the SIC issues too and we started to have the vpn tunnels start failing as I would say it would have been over the 24 hours that the remote firewall had checked in with the mgmt server to check SIC.&lt;/P&gt;</description>
      <pubDate>Mon, 17 Jun 2024 16:30:30 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217807#M65092</guid>
      <dc:creator>Chris_Wilson</dc:creator>
      <dc:date>2024-06-17T16:30:30Z</dc:date>
    </item>
    <item>
      <title>Re: Management Server is stuck - user is unable to run any command, seen many times!</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217812#M65093</link>
      <description>&lt;P&gt;My customer with this issue hasn't had the CPD defunct process situation escalate to total server outage yet. &amp;nbsp;I have Nagios monitoring the system process counts frequently, so I am able to get to it and restart CPD with the "cpwd_admin" commands in a controlled state.&lt;/P&gt;
&lt;P&gt;If you're desperate, make yourself a cron job to do it, too.&lt;/P&gt;
&lt;P&gt;EDIT: I made a real script today that will do everything we need (MDS top-level, MDS per-domain, SMS, EPM, SME) and posted it in the ToolBox:&lt;/P&gt;
&lt;P&gt;&lt;A href="https://community.checkpoint.com/t5/Scripts/Restart-CPD-script/m-p/217862/highlight/true#M1159" target="_blank" rel="noopener"&gt;https://community.checkpoint.com/t5/Scripts/Restart-CPD-script/m-p/217862/highlight/true#M1159&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;[Expert@cpmgmt01:0]# ./cpd_restart.sh -h
  cpd_restart.sh:  Restart CPD process on Multi-Domain server and Security/Endpoint management

  Usage: ./cpd_restart.sh  [ -d [ ALL | &amp;lt;specific domain server&amp;gt; ] | [ -h ]

  Options:
    d     Specify a single domain management server (CMA) or special word ALL for all domain
          servers listed in "mdsstat" output (Optional; only relevant for MDS)
    h     This help

  If no argument is given, then the top level CPD process is restarted (for the MDS itself,
  Security Management server, or Endpoint Management server)&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Run it with a "-d ..." &amp;nbsp;to restart CPD on a given domain server if that's your troublesome one, or "-d ALL" to restart CPD on all domain servers. &amp;nbsp;This only restarts CPD and leaves the other processes alone, so there's no outage. &amp;nbsp;It uses the same methods that Check Point's own scripts use (shameless stole the commands out of $MDSDIR/scripts/cpshared). &amp;nbsp;This ensures CPD restart is done the correct way and gets re-attached to CPWD for monitoring.&lt;/P&gt;
&lt;P&gt;If you just have a single Security Management server, then don't give any arguments and it'll just restart the one process, or the MDS root CPD process.&lt;/P&gt;
&lt;P&gt;Put that script in /home/admin, chmod 755, then set a job in CLISH:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;&amp;gt; add cron job CPD_Restart command "/home/admin/cpd_restart.sh" recurrence hourly hours all at 00 

&amp;gt; show cron job CPD_Restart recurrence 
Every day at every hour at the 00 minutes.&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;or for MDS:&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;&amp;gt; add cron job CPD_Restart command "/home/admin/cpd_restart.sh" recurrence hourly hours all at 00 
&amp;gt; add cron job CPD_Restart_domains command "/home/admin/cpd_restart.sh -d ALL" recurrence hourly hours all at 05

&amp;gt; show cron job CPD_Restart recurrence 
Every day at every hour at the 00 minutes.
&amp;gt; show cron job CPD_Restart_domains recurrence 
Every day at every hour at the 05 minutes.&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 18 Jun 2024 02:24:12 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/Management-Server-is-stuck-user-is-unable-to-run-any-command/m-p/217812#M65093</guid>
      <dc:creator>Duane_Toler</dc:creator>
      <dc:date>2024-06-18T02:24:12Z</dc:date>
    </item>
  </channel>
</rss>

